140.688.01 STATISTICS FOR GENOMICS
- Rafael Irizarry
- Andrew Jaffe
Covers the basics of R software and the key capabilities of the Bioconductor project (a widely used open source and open development software project for the analysis and comprehension of data arising from high-throughput experimentation in genomics and molecular biology and rooted in the open source statistical computing environment R), including importation and preprocessing of high-throughput data from microarrays and other platforms. Also introduces statistical concepts and tools necessary to interpret and critically evaluate the bioinformatics and computational biology literature. Includes an overview of of preprocessing and normalization, statistical inference, multiple comparison corrections, Bayesian Inference in the context of multiple comparisons, clustering, and classification/machine learning.
Upon successful completion of this course, students will be able to: 1) Understand the basics of how microarray technology works; 2) Understand and critique existing methodology for the analysis of microarray data; 3) Write R code to import and analyze microarray data.
- Tuesday 1:30 - 2:50
- Thursday 1:30 - 2:50