Course Description
This graduate course is part of a first-year training program for Genetics & Genomics Scholars and provides a broad understanding how to apply principles of data science to large multi-faceted datasets that are central to modern-day genetics and genomics. The students will focus on the application of these principles in the analysis of genetics and genomic data. Students will develop basic skills for reproducible research, including project organization, version control and test-evaluate-diagnose development. While exploring the universe of genetic and genomics analysis packages, the students will focus on the R data-science platform. They will develop their skills in common genetics and genomics analyses, including RNA-seq differential expression and population genetics statistics. The final product for the course will be a collaborative, small groups project consisting of a defined analysis of a genetic or genomic dataset of their choice.