Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

Awardee:IOWA STATE UNIVERSITY OF SCIENCE AND TECHNOLOGY
Doing Business As Name:Iowa State University
PD/PI:
  • Hridesh Rajan
  • (515) 294-6168
  • hridesh@iastate.edu
Co-PD(s)/co-PI(s):
  • Daniel S Nettleton
  • Eric Weber
  • Pavan Aduri
  • Chinmay Hegde
Award Date:09/12/2019
Estimated Total Award Amount: $ 1,499,995
Funds Obligated to Date: $ 499,999
  • FY 2019=$499,999
Start Date:10/01/2019
End Date:09/30/2022
Transaction Type:Grant
Agency:NSF
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.070
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:HDR TRIPODS: D4 (Dependable Data-Driven Discovery) Institute
Federal Award ID Number:1934884
DUNS ID:005309844
Parent DUNS ID:005309844
Program:HDR-Harnessing the Data Revolu
Program Officer:
  • Tracy Kimbrel
  • (703) 292-7924
  • tkimbrel@nsf.gov

Awardee Location

Street:1138 Pearson
City:AMES
State:IA
ZIP:50011-2207
County:Ames
Country:US
Awardee Cong. District:04

Primary Place of Performance

Organization Name:Iowa State University
Street:1138 Pearson
City:Ames
State:IA
ZIP:50011-2207
County:Ames
Country:US
Cong. District:04

Abstract at Time of Award

Data-driven discoveries are permeating critical fabrics of society. Unreliable discoveries lead to decisions that can have far-reaching and catastrophic consequences on society, defense, and the individual. Thus, the dependability of data-science lifecycles that produce discoveries and decisions is a critical issue that requires a new holistic view and formal foundations. This project will establish the Dependable Data Driven Discovery (D4) Institute at Iowa State University that will advance foundational research on ensuring that data-driven discoveries are of high quality. The activities of the D4 Institute will have a transformative impact on the dependability of data-science lifecycles. First, the problem definition itself will have a significant impact by helping future innovations beyond academia. While the notion of dependability is well-studied in the computer-systems literature, challenges in data science push the boundary of existing knowledge into the unknown. This institute's work will define D4, and increase data science's benefit to society by providing a transformative theory of D4. The second impact will come from the process of shared vocabulary development facilitated by this institute, and its result that would encourage experts across TRIPODS disciplines and domain experts to collaborate on common goals and challenges. Third, the institute will set research directions for D4 by providing funding for foundational research, which will have a separate set of impacts. Fourth, the institute will facilitate transdisciplinary training of a diverse cadre of data scientists through activities such as the Midwest Big Data Summer School and the D4 workshop. The project will advance the theoretical foundations of data science by fostering foundational research to enable understanding of the risks to the dependability of data-science lifecycles, to formalize the rigorous mathematical basis of the measures of dependability for data science lifecycles, and to identify mechanisms to create dependable data-science lifecycles. The project defines a risk to be a cause that can lead to failures in data-driven discovery, and the processes that plan for, acquire, manage, analyze, and infer from data collectively as the data-science lifecycle. For instance, an inference procedure that is significantly expensive can deliver late information to a human operator facing a deadline (complexity as a risk); if the data-science lifecycle provides a recommendation without an uncertainty measure for the recommendation, a human operator has no means to determine whether to trust the recommendation (uncertainty as a risk). Compared to recent works that have focused on fairness, accountability, and trustworthiness issues for machine learning algorithms, this project will take a holistic perspective and consider the entire data-science lifecycle. In phase I of the project the investigators will focus on four measures: complexity, resource constraints, uncertainty, and data freshness. In developing a framework to study these measures, this work will prepare the investigators to scale up their activities to other measures in phase II as well as to address larger portions of the data-science lifecycle. The study of each measure brings about foundational challenges that will require expertise from multiple TRIPODS disciplines to address. This project is jointly funded by HDR TRIPODS and the Established Program to Stimulate Competitive Research (EPSCoR). This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.