431 Academic Research Building
265 South 37th Street
Philadelphia, PA 19104
Research Interests: genomics, change-point methods, empirical bayes estimation, model and variable selection, scan statistics, statistical modeling
Links: CV, Lab Website
Dr. Zhang is a Ge Li and Ning Zhao Professor of Statistics in The Wharton School at University of Pennsylvania. Her research focuses primarily on the development of statistical methods and computational algorithms for the analysis of data from high-throughput biological experiments. She has made contributions to copy number and structural variant detection, to the modeling and estimation of intra-tumor genetic heterogeneity, and to the modeling and analysis of single-cell and spatial genomic data. In Statistics, she has made contributions to change-point analysis, variable selection, and model selection.
Dr. Zhang obtained her Ph.D. in Statistics in 2005 from Stanford University. After one year of postdoctoral training at University of California, Berkeley, she returned to the Department of Statistics at Stanford University as Assistant Professor in 2006. She received the Sloan Fellowship in 2011, and formally moved to University of Pennsylvania with tenure in 2012. She was awarded the Medallion Lectureship by the Institute of Mathematical Statistics in 2021 and the P.R. Krishnaiah Memorial Lectureship in 2023. Her work has been funded by grants from the NSF, NIH, and Mark Foundation. At Penn, she is a member of the Abramson Cancer Center and the Graduate Group in Genomics and Computational Biology, and Senior Fellow of Institute of Biomedical Informatics. Dr. Zhang currently serves as the Vice Dean of the Wharton Doctoral Program.
Here are some of Dr. Zhang’s representative publications, categorized by topic (ǂalphabetical ordering, *corresponding author):
Computational methods for single cell data denoising, batch correction, and transfer learning
Computational methods for integrative multi-omic modeling of single cell data
Computational methods for analysis of spatial genomic data and integration of spatial and single cell data.
DNA copy number estimation, variant detection and inference
Intra-tumor heterogeneity and cancer genomics.
Change-point detection and scan statistics
General multiple testing control, high-dimensional inference
For a complete overview of Dr. Zhang’s publications, funded grants, and teaching, mentoring, and service work, see her CV above.
You can find the latest updates on my research on my lab website:
For a complete list of my publications and funded grants, the most trustworthy source is my CV (see link above). The searchable publication list below is only updated once per year.
This seminar-based course provides students with the opportunity to hone their data science skills and gain practical experience by working with a community organization on a data science problem of interest to the organization. Students will gain skills in problem formulation, collaboration with community organizations and communication of data science results. Students will work in groups on a data science problem of interest to a community organization.
STAT9915301 ( Syllabus )
Independent Study allows students to pursue academic interests not available in regularly offered courses. Students must consult with their academic advisor to formulate a project directly related to the student’s research interests. All independent study courses are subject to the approval of the AMCS Graduate Group Chair.
Study under the direction of a faculty member.
Lab rotation
Pre-dissertation lab research
Ph.D. students enroll in this course after passing their candidacy exam. They work on their dissertation full-time under the guidance of their dissertation supervisor and other members of their dissertation committee.
The goal of this course is to introduce students to the R programming language and related eco-system. This course will provide a skill-set that is in demand in both the research and business environments. In addition, R is a platform that is used and required in other advanced classes taught at Wharton, so that this class will prepare students for these higher level classes and electives.
The goal of this course is to introduce students to the R programming language and related eco-system. This course will provide a skill-set that is in demand in both the research and business environments. In addition, R is a platform that is used and required in other advanced classes taught at Wharton, so that this class will prepare students for these higher level classes and electives.
This is a course that prepares 1st year PhD students in statistics for a research career. This is not an applied statistics course. Topics covered include: linear models and their high-dimensional geometry, statistical inference illustrated with linear models, diagnostics for linear models, bootstrap and permutation inference, principal component analysis, smoothing and cross-validation.
This seminar will be taken by doctoral candidates after the completion of most of their coursework. Topics vary from year to year and are chosen from advance probability, statistical inference, robust methods, and decision theory with principal emphasis on applications.
This seminar-based course provides students with the opportunity to hone their data science skills and gain practical experience by working with a community organization on a data science problem of interest to the organization. Students will gain skills in problem formulation, collaboration with community organizations and communication of data science results. Students will work in groups on a data science problem of interest to a community organization.
Dissertation
Written permission of instructor and the department course coordinator required to enroll.
Wharton’s Cait Lamberton breaks down the demise of fabric and crafts chain Joann and why it’s so challenging for legacy brands to stay solvent.…Read More
Knowledge @ Wharton - 2025/03/18