Statistics Seminar Series

Choose which semester to display:

Schedule for Fall 2024

Seminars are on Mondays
Time: 4:10 pm - 5:00 pm

Location: Room 903 SSW, 1255 Amsterdam Avenue

 

 

 

 

9/9/2024

 

 

Caleb Miles (Columbia University

Title: Optimal Tests of the Composite Null Hypothesis Arising in Mediation Analysis

Abstract: The indirect effect of an exposure on an outcome through an intermediate variable can be identified by a product of regression coefficients under certain causal and regression modeling assumptions. In this context, the null hypothesis of no indirect effect is a composite null hypothesis, as the null holds if either regression coefficient is zero. A consequence is that traditional hypothesis tests are severely underpowered near the origin (i.e., when both coefficients are small with respect to standard errors). We propose hypothesis tests that (i) preserve level alpha type 1 error, (ii) meaningfully improve power when both true underlying effects are small relative to sample size, and (iii) preserve power when at least one is not. One approach gives a closed-form test that is minimax optimal with respect to local power over the alternative parameter space. Another uses sparse linear programming to produce an approximately optimal test for a Bayes risk criterion. We also propose adaptations to large-scale hypothesis testing as well as modifications that yield improved interpretability. We provide an R package that implements the minimax optimal test.

Bio: Dr. Caleb Miles works on developing semiparametric methods for causal inference and applying them to problems in public health. His applied work is largely in HIV/AIDS, psychiatry, anesthesiology, and drug abuse. His methodological research interests include causal inference, its intersection with machine learning, mediation analysis, transportability/generalizability, and measurement error.

 

 

9/16/2024

Alexander Aue ( University of Californa, Davis)

Title:  Testing general linear hypotheses in a high-dimensional regression model with spiked covariance

Abstract:  This talk is on the problem of testing linear hypotheses in a multivariate regression model with a high-dimensional response and spiked noise covariance. The proposed family of tests consists of test statistics based on weighted sums of projections of the data onto the estimated latent factor directions, with the weights acting as regularization parameters. We establish asymptotic normality of the test statistics under the null hypothesis. We also establish the power characteristics of the tests and propose a data-driven choice of the regularization parameters under a family of local alternatives. The performance of the proposed tests is evaluated through a simulation study. Finally, the proposed tests are applied to the Human Connectome Project data to test for the presence of associations between volumetric measurements of human brain and behavioral variables. The talk is based on joint work with Haoran Li, Debashis Paul and Jie Peng.

Bio:   Alexander Aue is a professor in the Department of Statistics at the University of California, Davis. He holds a BS degree in Mathematics from Philipps University Marburg and a PhD degree in Applied Mathematics from the University of Cologne, both in Germany. His primary research expertise is in time series, in particular for functional and high–dimensional models. Aue was elected a Fellow of the American Statistical Association in 2016 and of the Institute of Mathematical Statistics in 2018. He is also the 2016 recipient of the UC Davis Chancellor's Award for Excellence in Mentoring Undergraduate Research. Aue is a co-editor of the Journal of Time Series Analysis and serves as an associate editor for a number of leading statistics journals, including the Annals of Statistics. He is the Program Director of the interdisciplinary major in Data Science housed in the Department of Statistics at UC Davis.

 

9/23/2024

Pragya Sur (Harvard University)

Title: Spectrum-Aware Debiasing: A Modern Inference Framework with Applications to Principal Components Regression

Abstract: Debiasing methodologies have emerged as a powerful tool for statistical inference in high dimensions.

Since its original introduction, the methodology witnessed a major advancement with the introduction of degrees-

of-freedom debiasing in Bellec and Zhang (2019). While overcoming limitations of initial debiasing approaches, this

updated method suffered a limitation—it relied on sub-Gaussian tails and independent, identically distributed

samples. In this talk, we propose a novel debiasing formula that breaks this barrier by exploiting the spectrum of the

sample covariance matrix. Our formula applies to a broader class of designs known as right rotationally invariant

designs, which include some heavy-tailed distributions, as well as certain dependent data settings. Our correction

term differs significantly from prior work but recovers the Gaussian-based formula as a special case. Notably, our

approach does not require estimating the high-dimensional population covariance matrix yet can account for

dependence among features and samples. We demonstrate the utility of our method for several statistical inference

problems. As a by-product, our work also introduces the first debiased principal component regression estimator with

formal guarantees in high dimensions. This is based on joint work with Yufan Li.

 

 

9/30/2024

Yihong Wu (Yale University)

Title: Recent advances in Empirical Bayes: statistical and optimization perspective

Abstract: Introduced by Robbins in the 1950s, Empirical Bayes is a powerful approach and popular framework for large-scale inference that aims at learning and adapting to latent structure in data, by finding data-driven estimators to compete with the Bayesian oracle that knows the true prior. This talk surveys some recent theoretical, methodological, and algorithmic advances in empirical Bayes, in both classical sequence models and extensions where latent variables and data interact through more complex designs. A central theme of this talk is the nonparametric maximum likelihood estimator of Kiefer and Wolfowitz. Along the way, I will introduce various open problems in the theory and practice of empirical Bayes. Time permitting, I will discuss applications of empirical Bayes techniques to score matching and diffusion models.

This talk is based on joint works with Zhou Fan, Leying Guan, Andre Wibisono, and Kaylee Yang at Yale, Soham Jana at Notre Dame, Yury Polyanskiy and Anzo Teh at MIT, and Yandi Shen at CMU: https://arxiv.org/abs/2008.08244https://arxiv.org/abs/2109.03943https://arxiv.org/abs/2209.01328https://arxiv.org/abs/2211.12692https://arxiv.org/abs/2312.12708https://arxiv.org/abs/2402.07747

Bio: Yihong Wu is James A. Attwood Professor and Department Chair of Statistics and Data Science at Yale University. He received his B.E. degree from Tsinghua University in 2006 and Ph.D. degree from Princeton University in 2011. He was a postdoctoral fellow with the Statistics Department in The Wharton School at the University of Pennsylvania from 2011 to 2012 and an assistant professor in the Department of ECE at the University of Illinois at Urbana-Champaign from 2013 to 2015. His research interests are in the theoretical and algorithmic aspects of high-dimensional statistics, information theory, and optimization. He was elected an IMS fellow in 2023 and was a recipient of the NSF CAREER award in 2017 and the Sloan Research Fellowship in Mathematics in 2018. 

 

10/7/2024

Nicolas Garcia-Trillos (UW Madison)

Title: Minimax rates for the learning of spectra of differential operators from data.

Abstract: The field of graph-based learning is one that is closely connected to manifold learning. It explores the following question: given a collection of data points $x_1 , \dots, x_n$ and a similarity graph over it, how can one use this graph to learn relevant geometric features from the dataset and in turn learn about the distribution that generated it? The question becomes a geometric or analytical problem when one assumes that the sampling distribution is supported on an unknown low dimensional manifold, as it is postulated by the manifold hypothesis. 

In this talk, I will discuss that, despite the multiple questions and answers that have been explored in the area of graph-based learning, there are several fundamental questions in statistical theory that have been largely unexplored, all of which are essential for manifold learning. Examples of these questions include: 1) What is the best possible estimator (potentially not graph-based), from a sample efficiency perspective, for learning features of unknown manifolds from observed data? 2) What is the asymptotic efficiency of popular graph-based estimators used in unsupervised learning? I will focus on the first type of question in the context of the task of learning the spectra of elliptic differential operators from data and will present new results that can be interpreted as a first step in bridging the gap between the mathematical analysis of graph-based learning and the analysis of fundamental statistical questions like the ones mentioned above.  Throughout the talk, I will highlight the connection between the spectrum estimation and density estimation problems, and through this connection I will motivate a series of open mathematical questions related to operator learning and generative models using contemporary machine learning tools. 

This talk is based on very recent work with my PhD student Chenghui Li (UW-Madison) and with Raghavendra Venkatraman (Utah).

Bio: Nicolás García Trillos is an associate professor at the department of statistics of University of Wisconsin Madison. He does work at the intersection of applied analysis, applied probability, machine learning, and statistics. Some of his current interests include adversarial machine learning, operator learning, and optimal transport in statistics.

 

10/14/2024

Yao Xie (Georgia Institute of Technology)

Title:   TBA

Abstract:   TBA

Bio:  TBA

 

10/17/2024

Peng Ding (Berkeley

Title:  TBA

Abstract: TBA

Bio: TBA

 

10/21/2024

Johannes Schmidt-Hieber (University of Twente)

Title: TBA

Abstract: TBA

Bio: TBA

 

 

10/28/2024

Jeremias Knoblauch (University College London)

Title: TBA

Abstract: TBA

Bio: TBA

 

11/4/2024

HOLIDAY
11/11/2024

Adel Javanmard (University of Southern California)

Title: TBA

Abstract: TBA

Bio: TBA

11/18/2024

Tapabrata Maiti (Michigan State University)

Title: TBA

Abstract: TBA

Bio: TBA

11/25/2024

Omiros Papaspilopoulous (Bocconi)

Title: TBA

Abstract: TBA

Bio: TBA

12/2/2024 TBA
12/9/2024 TBA