Hands-on Genomics Lab
Since the completion of the Human Genome Project in 2003, there has been an explosion in data fueled by a dramatic drop in the cost of DNA sequencing, from $3B for the first genome to under $1,000/genome today.
This workshop will focus on the application of Apache Spark and related projects to life science challenges by way of touching upon GATK4 pipelines, Genotype-phenotype association tests, and population scale risk-modeling.
9:00-9:50 Opening Remarks, Customer Use Case and Set-up
10:00-10:45 Accelerating Variant Calls with Apache Spark
10:45-11:30 Characterizing Genetic Variants with Spark SQL
11:45-12:15 Disease Risk Scoring with Machine Learning