Monday, February 26, 2018
1:15 - 2:15 PM
Minot Room, 5th Floor
Countway Library, HMS
Pizza lunch will be provided (available at 1:00pm)
Professor, Medicine, Harvard Medical School
Associate Biostatistician, Channing Laboratory, Brigham And Women’s Hospital
Semantically rich interfaces for cloud-scale genomics
The problem of matching statistically and computationally efficient inference methods to problems in genome-scale biology is intrinsically difficult. The difficulty is compounded by rapid changes in both domains: computational environments for data management and statistics are evolving both technically and conceptually, and readouts in genome-scale biology are growing in diversity, size, and complexity.
This talk will review some approaches under development in the Bioconductor project to streamline exploratory data analysis for cloud-scale genomic data. Apart from fostering scalable access to resources of arbitrary size, important concerns include the definition and use of biologically meaningful filters and covariates for integrative modeling. Examples will be drawn from TCGA, the 10x million neuron dataset, and GTEx.