Yuqi Gu
Email: yuqi.gu@columbia.edu.
Address: Room 928 SSW, 1255 Amsterdam Avenue, New York, NY 10027
I am an Assistant Professor in the Department of Statistics at Columbia University. I am also a member of the Data Science Institute.
Before joining Columbia in 2021, I spent a year as a postdoc at Duke University, mentored by David B. Dunson.
In 2020 I received a Ph.D. in Statistics from the University of Michigan, advised by Gongjun Xu.
In 2015 I received a B.S. in Mathematics from Tsinghua University.
My first name can be pronounced as /ju:-tʃi:/. My name in Chinese is 顾雨琦.
My research develops statistical theory and methodology for uncovering \textbf{latent structure} in complex data. A unifying theme is to make latent structure and representation learning identifiable, interpretable, computationally scalable, and statistically reliable.
- Identifiable deep generative models and causal representation learning: I study identifiability, latent graph discovery, and causal representation learning in nonlinear probabilistic graphical models with latent structures.
- High-dimensional inference for latent structure: The high dimensionality and the latent structure pose double statistical challenges. I develop spectral, tensor, and likelihood-based methods for latent class, mixed-membership, and nonlinear low-rank representation problems, with finite-sample theory and uncertainty quantification.
- Latent variable models for psychometrics, heterogeneous populations, and AI evaluation: Principled latent trait models for educational, psychological, biomedical, and language-model data, including cognitive diagnosis, item response theory, and psychometric frameworks for evaluating large language models (LLMs).
Recent News
| 04/2026 | Big congratulations to Seunghyun Lee on successfully defending his PhD dissertation and becoming Dr. Lee! |
|---|---|
| 04/2026 | Our paper Generalized Grade-of-Membership Estimation for High-dimensional Locally Dependent Data is accepted by Journal of the American Statistical Association. |
| 04/2026 | New preprint is available: Scalable Variational Inference for Probabilistic Boolean Matrix Factorization with Unknown Latent Dimension. |
| 04/2026 | Congratulations to Chengzhu Huang on receiving the 2026 IMS Hannan Graduate Student Travel Award for the work Minimax-Optimal Spectral Clustering with Covariance Projection for High-Dimensional Anisotropic Mixtures! |
| 03/2026 | New preprint is on arXiv: Discrete Causal Representation Learning. |
| 03/2026 | New preprint is on arXiv: Scalable Text-Embedding-informed Cognitive Diagnosis of Large Language Models. |
| 03/2026 | Our paper Adaptive Transfer Clustering: A Unified Framework is accepted by Journal of the American Statistical Association. |
| 02/2026 | Our paper Spectral Clustering with Likelihood Refinement for High-dimensional Latent Class Recovery is accepted by Psychometrika. |
| 01/2026 | Congratulations to both Zhiyu Xu and Wenjin Zhang for winning the 2026 ASA Student Paper Award from the Statistical Learning and Data Science Section! They are each invited to give a talk in an award session at JSM 2026 in Boston: Zhiyu’s paper is on Latency-Response Theory Model for Evaluating LLMs, and Wenjin’s paper is on Discrete Causal Representation Learning. |
| 12/2025 | New preprint Latency-Response Theory Model: Evaluating LLMs via Response Accuracy and Chain-of-Thought Length is on arXiv. |
| 12/2025 | My paper Constructive Q-Matrix Identifiability via Novel Tensor Unfolding is accepted by Psychometrika. |
| 11/2025 | Our paper Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers is accepted by Journal of the American Statistical Association. |
| 09/2025 | Our paper Learning from Similar Linear Representations is published in Journal of Machine Learning Research. |
| 04/2025 | Big congratulations to Ling Chen on successfully defending her PhD dissertation and becoming Dr. Chen! |
| 01/2025 | Big congratulations to Zhongyuan Lyu, who just accepted the position of Lecturer in Business Analytics (equivalent to US tenure-track Assistant Professor) at the University of Sydney’s Business School! |
| 01/2025 | Congratulations to Seunghyun Lee for winning the 2025 American Statistical Association (ASA) Student Paper Award from the Statistical Learning and Data Science Section for our work Deep Discrete Encoders! |
| 01/2025 | Our paper Degree-heterogeneous Latent Class Analysis for High-dimensional Discrete Data is accepted by Journal of the American Statistical Association (JASA). |