Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

Awardee:REGENTS OF THE UNIVERSITY OF MICHIGAN
Doing Business As Name:Regents of the University of Michigan - Ann Arbor
PD/PI:
  • Snigdha Panigrahi
  • (530) 304-4252
  • psnigdha@umich.edu
Award Date:06/14/2021
Estimated Total Award Amount: $ 150,000
Funds Obligated to Date: $ 47,514
  • FY 2021=$47,514
Start Date:08/01/2021
End Date:07/31/2024
Transaction Type:Grant
Agency:NSF
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.049
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:Reusing Data Efficiently for Iterative and Integrative Inference
Federal Award ID Number:2113342
DUNS ID:073133571
Parent DUNS ID:073133571
Program:STATISTICS
Program Officer:
  • Yong Zeng
  • (703) 292-7902
  • yzeng@nsf.gov

Awardee Location

Street:3003 South State St. Room 1062
City:Ann Arbor
State:MI
ZIP:48109-1274
County:Ann Arbor
Country:US
Awardee Cong. District:12

Primary Place of Performance

Organization Name:Regents of the University of Michigan - Ann Arbor
Street:3003 South State St. Room 1062
City:Ann Arbor
State:MI
ZIP:48109-1274
County:Ann Arbor
Country:US
Cong. District:12

Abstract at Time of Award

Drawing knowledge and reproducible results from complex data drives a broad range of scientific disciplines. From a statistical viewpoint, model selection and inference are the two fundamental tasks, the latter often pursued only after models are chosen through data-driven procedures. Naively using the same data for both tasks creates complicated correlations between the selected models and their inferential properties, which inevitably affects the reproducibility of findings from these models. The investigator develops methods for reusing data from selection to compensate for these correlations while not squandering away information from the full data. Finding immediate use in biomedical problems, observational studies in the behavioral sciences, and engineering applications, the methods will aid discoveries even when analyses rely on scarce samples. This research has a broader outreach component in creating opportunities for interdisciplinary engagement, training statisticians, and contributing to a new graduate curriculum. The project is geared towards efficient and reproducible inference through a reuse of data from the model selection steps. Combining ideas from convex optimization, probability theory, and statistical learning, the project seeks solutions for two main thrusts. In the first thrust, the investigator develops methods to integrate fresh samples available at a later point in time with information from selection. This workflow is realized in modern applications such as online streaming of data, which demand iterative inference on the fly. In the second thrust, the investigator explores integrative inference by combining selected models from different batches or splits or sources of data. Aggregating inference from multiple sources through a reuse of samples will have the potential for new discoveries that any single dataset may fail to report due to a lack of power. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.