Skip Navigation

Course Catalog

140.629.01 Data Science for Public Health II

Department:
Biostatistics
Term:
4th term
Credits:
4 credits
Academic Year:
2019 - 2020
Location:
East Baltimore
Class Times:
  • Tu Th,  8:30 - 9:50am
Auditors Allowed:
Yes, with instructor consent
Grading Restriction:
Letter Grade or Pass/Fail
Contact:
Brian Caffo
Course Instructor :
Resources:
Prerequisite:

140.628, prior programming experience, precalculus mathematics

Description:

Presents the basics of data science using the R programming language. Teaches basic unix, version control, graphing and plotting techniques, creating interactive graphics, web app development, reproducible research tools and practices, resampling based statistics and artificial intelligence via deep learning, focusing on practical

implementation specifically tied to computational tools and core

fundamentals necessary for practical implementation. Culminates with a web app development project chosen by student (who will come out of this course sequence well-equipped to tackle many of the data science problems that they will see in their

research).

Learning Objectives:

Upon successfully completing this course, students will be able to:

  1. Demonstrate proficiency in data-oriented R programming
  2. Practice basic data cleaning in R
  3. Implement and demonstrate proficiency in tidyverse commands
  4. Implement plotting and interactive graphics tools on novel data sets
  5. Implement artificial intelligence programs on novel data sets
  6. Create a web application
  7. Implement resampling-based statistics
  8. Synthesize concepts of machine learning overfitting
  9. Synthesize concepts of probabilistic inference
Methods of Assessment:

This course is evaluated as follows:

  • 33% Homework
  • 33% Weekly Quizzes
  • 33% Final Capstone Project

Multiterm:
Instructor Consent:

No consent required

Special Comments:

Part 1 necessitates enrollment in Part II; grades given at end of Part II.