Yuqi Gu

Email: yuqi.gu@columbia.edu.
Address: Room 928 SSW, 1255 Amsterdam Avenue, New York, NY 10027

I am an Assistant Professor in the Department of Statistics at Columbia University. I am also a member of the Data Science Institute. Before joining Columbia in 2021, I spent a year as a postdoc at Duke University, mentored by David B. Dunson. In 2020 I received a Ph.D. in Statistics from the University of Michigan, advised by Gongjun Xu. In 2015 I received a B.S. in Mathematics from Tsinghua University. My first name can be pronounced as /ju:-tʃi:/. My name in Chinese is 顾雨琦.

My research develops statistical theory and methodology for uncovering \textbf{latent structure} in complex data. A unifying theme is to make latent structure and representation learning identifiable, interpretable, computationally scalable, and statistically reliable.

  • Identifiable deep generative models and causal representation learning: I study identifiability, latent graph discovery, and causal representation learning in nonlinear probabilistic graphical models with latent structures.
  • High-dimensional inference for latent structure: The high dimensionality and the latent structure pose double statistical challenges. I develop spectral, tensor, and likelihood-based methods for latent class, mixed-membership, and nonlinear low-rank representation problems, with finite-sample theory and uncertainty quantification.
  • Latent variable models for psychometrics, heterogeneous populations, and AI evaluation: Principled latent trait models for educational, psychological, biomedical, and language-model data, including cognitive diagnosis, item response theory, and psychometric frameworks for evaluating large language models (LLMs).

Here is my Curriculum Vitae (CV).


Recent News

04/2026 Big congratulations to Seunghyun Lee on successfully defending his PhD dissertation and becoming Dr. Lee!
04/2026 Our paper Generalized Grade-of-Membership Estimation for High-dimensional Locally Dependent Data is accepted by Journal of the American Statistical Association.
04/2026 New preprint is available: Scalable Variational Inference for Probabilistic Boolean Matrix Factorization with Unknown Latent Dimension.
04/2026 Congratulations to Chengzhu Huang on receiving the 2026 IMS Hannan Graduate Student Travel Award for the work Minimax-Optimal Spectral Clustering with Covariance Projection for High-Dimensional Anisotropic Mixtures!
03/2026 New preprint is on arXiv: Discrete Causal Representation Learning.
03/2026 New preprint is on arXiv: Scalable Text-Embedding-informed Cognitive Diagnosis of Large Language Models.
03/2026 Our paper Adaptive Transfer Clustering: A Unified Framework is accepted by Journal of the American Statistical Association.
02/2026 Our paper Spectral Clustering with Likelihood Refinement for High-dimensional Latent Class Recovery is accepted by Psychometrika.
01/2026 Congratulations to both Zhiyu Xu and Wenjin Zhang for winning the 2026 ASA Student Paper Award from the Statistical Learning and Data Science Section! They are each invited to give a talk in an award session at JSM 2026 in Boston: Zhiyu’s paper is on Latency-Response Theory Model for Evaluating LLMs, and Wenjin’s paper is on Discrete Causal Representation Learning.
12/2025 New preprint Latency-Response Theory Model: Evaluating LLMs via Response Accuracy and Chain-of-Thought Length is on arXiv.
12/2025 My paper Constructive Q-Matrix Identifiability via Novel Tensor Unfolding is accepted by Psychometrika.
11/2025 Our paper Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers is accepted by Journal of the American Statistical Association.
09/2025 Our paper Learning from Similar Linear Representations is published in Journal of Machine Learning Research.
04/2025 Big congratulations to Ling Chen on successfully defending her PhD dissertation and becoming Dr. Chen!
01/2025 Big congratulations to Zhongyuan Lyu, who just accepted the position of Lecturer in Business Analytics (equivalent to US tenure-track Assistant Professor) at the University of Sydney’s Business School!
01/2025 Congratulations to Seunghyun Lee for winning the 2025 American Statistical Association (ASA) Student Paper Award from the Statistical Learning and Data Science Section for our work Deep Discrete Encoders!
01/2025 Our paper Degree-heterogeneous Latent Class Analysis for High-dimensional Discrete Data is accepted by Journal of the American Statistical Association (JASA).