# 140.653.01 METHODS IN BIOSTATISTICS III

Department:
Term: 3rd term
Credits: 4 credits
Contact: Hong Kai Ji
Course Instructor:
Description:

Focuses on regression analysis for continuous and discrete data, and data analyses that integrate the methods learned in 140.651-652. Regression topics include simple linear regression; a matrix formulation of multiple linear regression; inference for coefficients, predicted values, and residuals; tests of hypotheses; graphical displays and regression diagnostics; specific models, including polynomial regression, splines, one- and two-way ANOVA; variable selection non-parametric regression; log-linear models for incidence rates and contingency tables; logistic regression; and generalized linear models.

Old Learning Objective:

Students who master the content of this course will be able to: 1) Formulate a scientific question about the relationship of a continuous response variable Y and predictor variables X in terms of the appropriate linear regression model; 2) Interpret the meaning of regression coefficients in scientific terms as if for a substantive journal (2.1 Explicitly define the epidemiologic terms “confounding” and “effect modification” in terms of multiple regression coefficients); 3) Develop graphical and/or tabular displays of the data to display the evidence relevant to describing the relationship of Y with one X controlling for others (3.1 Use an adjusted variables plot to explain the meaning of a multiple regression coefficient); 4) Estimate the model using a modern statistical package such as STATA or R and interpret the results for substantive colleagues (4.1 Derive the least squares estimators for the linear model and the distribution of coefficients, predicted values, residuals and linear functions of them); 5) Check the major assumptions of the model including independence and model form (mean, variance and distribution of residuals) and make changes to the model or method of estimation and inference to appropriately handle violations of standard assumptions (5.1 Use weighted least squares for situations with unequal variances; 5.2 Use robust variance estimates for violations of independence or variance or distributional assumptions; 5.3 Use regression diagnostics to prevent a small fraction of observations from having undue influence on the results); 6) Write a methods and results section for a substantive journal, correctly describing the regression model in scientific terms and the method used to specify and estimate the model. Correctly interpret the regression results to answer the specific substantive questions posed in scientific terms that can be understood by substantive experts; 7) Critique the methods and results from the perspective of the statistical methods chosen and alternative approaches that might have been.

New Learning Objective(s):
Upon successfully completing this course, students will be able to:
Formulate a scientific question about the relationship of a continuous response variable Y and predictor variables X in terms of the appropriate linear regression model
Interpret the meaning of regression coefficients in scientific terms as if for a substantive journal (2.1 Explicitly define the epidemiologic terms “confounding” and “effect modification” in terms of multiple regression coefficients)
Develop graphical and/or tabular displays of the data to display the evidence relevant to describing the relationship of Y with one X controlling for others (3.1Use an adjusted variables plot to explain the meaning of a multiple regression coefficient)
Estimate the model using a modern statistical package such as STATA or R and interpret the results for substantive colleagues 4.1 Derive the least squares estimators for the linear model and the distribution of coefficients, predicted values, residuals and linear functions of them
Check the major assumptions of the model including independence and model form (mean, variance and distribution of residuals) and make changes to the model or method of estimation and inference to appropriately handle violations of standard assumptions 5.1 Use weighted least squares for situations with unequal variances; 5.2 Use robust variance estimates for violations of independence or variance or distributional assumptions; 5.3 Use regression diagnostics to prevent a small fraction of observations from having undue influence on the results
Write a methods and results section for a substantive journal, correctly describing the regression model in scientific terms and the method used to specify and estimate the model. Correctly interpret the regression results to answer the specific substantive questions posed in scientific terms that can be understood by substantive experts
Critique the methods and results from the perspective of the statistical methods chosen and alternative approaches that might have been

Methods of Assessment: Student evaluation based on problem sets, a midterm exam, and a final exam
Location: East Baltimore
Class Times:
• Tuesday 10:30 - 11:50
• Thursday 10:30 - 11:50
Enrollment Minimum: 10
Instructor Consent: No consent required
Prerequisite:

140.652

Auditors Allowed: Yes, with instructor consent