Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

Awardee:NEVADA SYSTEM OF HIGHER EDUCATION
Doing Business As Name:Board of Regents, NSHE, obo University of Nevada, Reno
PD/PI:
  • Feng Yan
  • (775) 784-4040
  • fyan@unr.edu
Award Date:05/04/2021
Estimated Total Award Amount: $ 517,459
Funds Obligated to Date: $ 201,110
  • FY 2021=$201,110
Start Date:05/01/2021
End Date:04/30/2026
Transaction Type:Grant
Agency:NSF
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.070
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:CAREER: Automated and Efficient Machine Learning as a Service
Federal Award ID Number:2048044
DUNS ID:146515460
Parent DUNS ID:067808063
Program:CSR-Computer Systems Research
Program Officer:
  • Alexander Jones
  • (703) 292-8950
  • alejones@nsf.gov

Awardee Location

Street:1664 North Virginia Street
City:Reno
State:NV
ZIP:89557-0001
County:Reno
Country:US
Awardee Cong. District:02

Primary Place of Performance

Organization Name:Board of Regents, NSHE obo University of Nevada, Reno
Street:1664 North Virginia Street
City:Reno
State:NV
ZIP:89557-0001
County:Reno
Country:US
Cong. District:02

Abstract at Time of Award

Machine-Learning-as-a-Service (MLaaS) is an emerging computing paradigm that provides optimized execution of machine learning tasks, such as model design, model training, and model serving, on cloud infrastructure. Explosive growth in model complexity and data size along with the surging demands of MLaaS is already resulting in substantial increases in computational resource and energy requirements. Unfortunately, existing MLaaS systems have poor resource management and limited support for user specified performance and cost requirements, exacerbating waste in computing resources and energy. This project aims to utilize the unique features of MLaaS to design efficient, automated, and user-centric MLaaS systems. This approach will significantly reduce resource waste and shorten the model design cycles through a variety of novel optimization approaches and by eliminating candidate models that fail to meet model serving latency and target accuracy. To support complete MLaaS workflow, this project will also develop MLaaS model serving methodologies that can meet service level latency requirements with minimum resource consumption using intelligent autoscaling. This project has the potential to tremendously reduce the resource and energy consumptions as well as the carbon footprint associated with the fast-growing societal demands in machine learning and cloud computing. Important insights and technologies will be produced targeting resource management and energy saving of the next-generation machine learning systems and cloud infrastructure. The findings of this project will also contribute to related fields of parallel and distributed systems, performance evaluation and optimization, and green computing. This project will carry out substantial integrated education activities including new course and online education development, integration of industry feedback in education. Additionally, the work will impact undergraduate and graduate students by training them in the art of system optimization combined with the latest machine learning domain knowledge while combining outreach and engagement of students from underrepresented groups and especially women. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.