Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

Awardee:CARNEGIE MELLON UNIVERSITY
PD/PI:
  • Sivaraman Balakrishnan
  • sbalakri@andrew.cmu.edu
Co-PD(s)/co-PI(s):
  • Larry A Wasserman
  • Matey Neykov
Award Date:06/14/2021
Estimated Total Award Amount: $ 250,000
Funds Obligated to Date: $ 250,000
  • FY 2021=$250,000
Start Date:07/01/2021
End Date:06/30/2024
Transaction Type:Grant
Agency:NSF
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.049
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:Foundations of High-Dimensional and Nonparametric Hypothesis Testing
Federal Award ID Number:2113684
DUNS ID:052184116
Parent DUNS ID:052184116
Program:STATISTICS
Program Officer:
  • Pena Edsel
  • (703) 292-8080
  • epena@nsf.gov

Awardee Location

Street:5000 Forbes Avenue
City:PITTSBURGH
State:PA
ZIP:15213-3815
County:Pittsburgh
Country:US
Awardee Cong. District:18

Primary Place of Performance

Organization Name:Carnegie-Mellon University
Street:5000 Forbes Avenue
City:Pittsburgh
State:PA
ZIP:15213-3890
County:Pittsburgh
Country:US
Cong. District:18

Abstract at Time of Award

Statistical inferential tools are the main export from the discipline of statistics to the empirical sciences, serving as the primary lens through which natural scientists interpret observations and quantify the uncertainty of their conclusions. However, in the analysis of modern large datasets the most common inferential tools available to us are fraught with pitfalls, often requiring various technical conditions to be checked before their valid application. This in turn has led to misuse of the inferential tools and subsequent misinterpretation of results. This research project will aim to address this issue by developing and analyzing new user-friendly methodologies for statistical inference in complex settings. The methods we develop will be broadly applicable to a wide variety of challenging inferential problems in the physical and biological sciences, will eliminate the need to verify technical conditions, and will ultimately be robust in their application. The principal and co-principal investigators will be involved in advising and mentoring graduate students, in curricular and course development, and in integrating the project with a research group on Statistical Methods in the Physical Sciences (STAMPS). This project will advance our understanding of high-dimensional and non-parametric inference along three frontiers. Firstly, we aim to develop statistical inferential tools for irregular models, which are valid under weak conditions. Our particular focus will be on mixture models, and on methods which use sample-splitting to avoid strong regularity conditions. Secondly, we will show that our methods achieve these strong guarantees at a surprisingly small statistical price. To rigorously quantify the statistical price paid for avoiding strong regularity conditions we will use minimax theory. However, standard minimax theory, in many cases, does not adequately capture the difficulty of statistical inference since the difficulty of inference can vary significantly across the parameter space. A more refined theory -- called local minimax theory -- leads to a more accurate picture, and we will study our methods via this lens. Finally, we will address the problem of conditional independence (CI) testing. Despite its central role in regression diagnostics, and in the study of probabilistic graphical models, the task of CI testing and its intrinsic difficulty is poorly understood. We will address two fundamental aspects of CI testing, by studying methods to appropriately calibrate CI tests, and by developing and analyzing powerful new CI tests. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.