Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

  • Mehraveh Salehi
  • (203) 535-8674
Award Date:07/23/2021
Estimated Total Award Amount: $ 256,000
Funds Obligated to Date: $ 256,000
  • FY 2021=$256,000
Start Date:08/01/2021
End Date:07/31/2022
Transaction Type:Grant
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.041
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:SBIR Phase I: Technology Translation of a Universal Summarization System
Federal Award ID Number:2052394
DUNS ID:081258109
Program:SBIR Phase I
Program Officer:
  • Peter Atherton
  • (703) 292-8772

Awardee Location

Street:1826 N 57TH ST
Awardee Cong. District:

Primary Place of Performance

Street:1826 N. 57th St
Cong. District:07

Abstract at Time of Award

The broader impact of this Small Business Innovation Research (SBIR) Phase I project is to make machine-learning (ML) and artificial intelligence (AI) less costly, less biased, more accurate, more scalable, and easier to use through the process of commercializing “universal data summarization.” This summarization process works on any kind of data and significantly reduces dataset size without loss of information. Cost reductions include computational, core memory, storage, labor, and energy, all while reducing AI's environmental impact to provide "green AI." Universal summarization can also be used to measure and remove bias in data used to train AI/ML systems. Biases, caused by concepts in the data that are vastly over-represented while others are under-represented, will be reduced since for a small summary to be representative, it must be diverse and inclusive. In addition, accuracy will be increased by improving human analytics. Many data science tasks involve analysis by humans who must examine data to discover insight. These are arduous, expensive, time-consuming, and error-prone tasks made worse by human alert and decision fatigue caused by redundancy and repetitiveness. By reducing size and eliminating redundancy, human fatigue is reduced, human accuracy and efficiency are increased, and annotation costs are mitigated. This Small Business Innovation Research (SBIR) Phase I project will develop and commercialize the ability to simply and affordably perform universal summarization of massive datasets. A summarization is a process that selects from a dataset a small subset of data items – the few selected summary items accurately represent the information contained in the many remaining unselected items. The innovation is called “calibrated submodular summarization,” a technology that involves quickly, cost-effectively, and accurately measuring information in subsets of data, and then algorithmically selecting small subsets that have mathematical information content guarantees. This technology strips away redundancy, leaving behind an efficient representation of the core information in the dataset. The proposed activities will create a commercial service that can summarize massive amounts of data (of any kind) quickly, easily, and without requiring user expertise in machine learning, data science, or submodularity. This will greatly reduce costs, time-to-market, environmental impact and data bias for any data-rich industry. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.