Eric J. Tchetgen Tchetgen

Eric J. Tchetgen Tchetgen
  • University Professor
  • Professor of Biostatistics in Biostatistics and Epidemiology
  • Professor of Statistics and Data Science

Contact Information

  • office Address:

    407 Academic Research Building
    265 South 37th Street
    Philadelphia, PA 19104

Research Interests: Semiparametric theory, nonparametric statistics, causal inference, missing data, and epidemiologic methods.

Overview

Education

Ph.D., 2006, Harvard University
B.S., 1999, Yale University

Research

My primary area of interest is in semi-parametric efficiency theory with application to causal inference, missing data problems, statistical genetics and mixed model theory. In general, I work on the development of statistical and epidemiologic methods that make efficient use of the information in data collected by scientific investigators, while avoiding unnecessary assumptions about the underlying data generating mechanism.

 

 

 

Continue Reading

Research

Teaching

Current Courses

  • STAT9220 - Advanced Causal Inference

    This course will provide an in depth investigation of statistical methods for drawing causal inferences from complex observational studies and imperfect randomized experiments. Formalization will be given for key concepts at the foundation of causal inference, including: confounding, comparability, positivity, interference, intermediate variables, total effects, controlled direct effects, natural direct and indirect effects for mediation analysis, generalizability, transportability, selection bias, etc.... These concepts will be formally defined within the context of a counterfactual causal model. Methods for estimating total causal effects in the context of both point and time-varying exposure will be discussed, including regression-based methods, propensity score techniques and instrumental variable techniques for continuous, discrete, binary and time to event outcomes. Mediation analysis will be discussed from a counterfactual perspective. Causal directed acyclic graphs (DAGs) and associated nonparametric structural equations models (NPSEMs) will be used to formalize identification of causal effects for static and dynamic longitudinal treatment regimes under unconfoundedness and unmeasured confounding settings. This formalization will be used to define, identify and make inferences about the joint effects of time-varying exposures in the presence of (possibly hidden) time-dependent covariates that are simultaneously confounders and intermediate variables. These methods include g-estimation of structural nested models, inverse probability weighted estimators of marginal structural models, and g-computation algorithm estimators. Credible quasi-experimental causal inference methods will be described, leveraging auxiliary variables such as instrumental variables, negative control variables, or more broadly confounding proxy variables. Quasi-experimental methods discussed will include the control outcome calibration approach, proximal causal inference, difference-in-differences and related generalizations of these methods. Semiparametric efficiency and the prospects for doubly robust inference will feature prominently throughout the course, including methods that combine modern semiparametric theory and machine learning techniques.

    STAT9220001 ( Syllabus )

Past Courses

  • BSTA6990 - Lab Rotation

    Student lab rotation.

  • BSTA8990 - Pre-Dissertation Lab Rot

  • PSYC6120 - Int To Nonp & Loglin Mod

    An applied graduate level course for students who have completed an undergraduate course in basic statistical methods. Covers two unrelated topics: loglinear and logit models for discrete data and nonparametric methods for nonnormal data. Emphasis is on practical methods of data analysis and their interpretation. Primarily for doctoral students in the managerial, behavioral, social and health sciences. Permission of instructor required to enroll.

  • STAT3990 - Independent Study

    Written permission of instructor and the department course coordinator required to enroll in this course.

  • STAT5010 - Int To Nonp & Loglin Mod

    An applied graduate level course for students who have completed an undergraduate course in basic statistical methods. Covers two unrelated topics: loglinear and logit models for discrete data and nonparametric methods for nonnormal data. Emphasis is on practical methods of data analysis and their interpretation. Primarily for doctoral students in the managerial, behavioral, social and health sciences. Permission of instructor required to enroll.

  • STAT9210 - Observational Studies

    This course will cover statistical methods for the design and analysis of observational studies. Topics will include the potential outcomes framework for causal inference; randomized experiments; matching and propensity score methods for controlling confounding in observational studies; tests of hidden bias; sensitivity analysis; and instrumental variables.

  • STAT9220 - Advanced Causal Inference

    This course will provide an in depth investigation of statistical methods for drawing causal inferences from complex observational studies and imperfect randomized experiments. Formalization will be given for key concepts at the foundation of causal inference, including: confounding, comparability, positivity, interference, intermediate variables, total effects, controlled direct effects, natural direct and indirect effects for mediation analysis, generalizability, transportability, selection bias, etc.... These concepts will be formally defined within the context of a counterfactual causal model. Methods for estimating total causal effects in the context of both point and time-varying exposure will be discussed, including regression-based methods, propensity score techniques and instrumental variable techniques for continuous, discrete, binary and time to event outcomes. Mediation analysis will be discussed from a counterfactual perspective. Causal directed acyclic graphs (DAGs) and associated nonparametric structural equations models (NPSEMs) will be used to formalize identification of causal effects for static and dynamic longitudinal treatment regimes under unconfoundedness and unmeasured confounding settings. This formalization will be used to define, identify and make inferences about the joint effects of time-varying exposures in the presence of (possibly hidden) time-dependent covariates that are simultaneously confounders and intermediate variables. These methods include g-estimation of structural nested models, inverse probability weighted estimators of marginal structural models, and g-computation algorithm estimators. Credible quasi-experimental causal inference methods will be described, leveraging auxiliary variables such as instrumental variables, negative control variables, or more broadly confounding proxy variables. Quasi-experimental methods discussed will include the control outcome calibration approach, proximal causal inference, difference-in-differences and related generalizations of these methods. Semiparametric efficiency and the prospects for doubly robust inference will feature prominently throughout the course, including methods that combine modern semiparametric theory and machine learning techniques.

  • STAT9620 - Adv Methods Applied Stat

    This course is designed for Ph.D. students in statistics and will cover various advanced methods and models that are useful in applied statistics. Topics for the course will include missing data, measurement error, nonlinear and generalized linear regression models, survival analysis, experimental design, longitudinal studies, building R packages and reproducible research.

  • STAT9910 - Sem in Adv Appl of Stat

    This seminar will be taken by doctoral candidates after the completion of most of their coursework. Topics vary from year to year and are chosen from advance probability, statistical inference, robust methods, and decision theory with principal emphasis on applications.

  • STAT9950 - Dissertation

    Dissertation

  • STAT9990 - Independent Study

    Written permission of instructor and the department course coordinator required to enroll.

Awards And Honors

  • Co-winner of the Rousseeuw Prize for Statistics, 2022
  • Myrto Lefkopoulou Distinguished Lectureship, 2020
  • Co-winner of the Society of Epidemiologic Research and American Journal of Epidemiology Article of the Year, 2014 Description

    For the paper, “Assessment and indirect adjustment for confounding by smoking in cohort studies using relative hazards model” with David Richardson, Steve Cole
    and Dominique Laurier.

  • Career Incubator Award, Harvard School of Public Health, 2013
  • Co-winner of the Kenneth Rothman Epidemiology Prize, 2011 Description

    For the paper, “The use of negative controls to detect confounding and other sources of error in experimental and observational science.” with Marc Lipsitch and Ted Cohen.

  • Best Poster Award: Gene Environment Initiative Symposium, Boston, MA, 2008
  • Yerby Fellowship, Harvard School of Public Health, 2006
  • Mars Scholar, Yale University, 1995

In the News

Knowledge @ Wharton

Activity

In the News

Election Statistics

Wharton experts discuss the statistics, polling data, and momentum factors influencing the 2024 presidential election.Read More

Knowledge @ Wharton - 2024/11/6
All News