Skip directly to content

Minimize RSR Award Detail

Research Spending & Results

Award Detail

Doing Business As Name:Rutgers University New Brunswick
  • Matthew S Weber
  • (732) 932-7500
Award Date:02/02/2016
Estimated Total Award Amount: $ 29,883
Funds Obligated to Date: $ 29,883
  • FY 2016=$29,883
Start Date:02/01/2016
End Date:01/31/2017
Transaction Type:Grant
Awarding Agency Code:4900
Funding Agency Code:4900
CFDA Number:47.070
Primary Program Source:040100 NSF RESEARCH & RELATED ACTIVIT
Award Title or Description:2016 Web Archiving Data Workshops: Travel Support for U.S. Students
Federal Award ID Number:1624067
DUNS ID:001912864
Parent DUNS ID:001912864
Program Officer:
  • Maria Zemankova
  • (703) 292-8930

Awardee Location

Street:33 Knightsbridge Road
Awardee Cong. District:06

Primary Place of Performance

Organization Name:Rutgers University New Brunswick
Street:3 Rutgers Plaza
City:New Brunswick
County:New Brunswick
Cong. District:06

Abstract at Time of Award

In recent years, significant focus has turned to archival data sources in the Big Data and computational social science research communities. The scale of archived Web data is unprecedented in terms of scope and scale, creating major challenges for scholars. The 2016 Web Archiving workshop series provides a premier venue for engaging with Web archives, and for learning the skills necessary to utilize archived Web data for knowledge discovery. This award will support travel of Ph.D. students at U.S. universities who have submitted research proposals that have been accepted as part of the workshop series. Participants in this workshop series will gain critical skills necessary for developing cutting edge research agendas. Training provided to Ph.D. students will help to prepare the next generation of Big Data scholars. The workshop series will consist of two events; the first workshop will be held at the University of Toronto from March 3 - 5, 2016, and the second workshop will be held at the Library of Congress in Washington DC from June 15 - 17. The workshop co-chairs are Matthew Weber (Rutgers University), Jimmy Lin (University of Waterloo) and Ian Milligan (University of Waterloo). The workshop series builds on prior research agendas established across disciplines for working with archived Web data. Specifically, this work responds to ongoing calls in computational social science for improved access to archived Web data. Workshop attendees will further be asked to engage with a variety of topics of societal importance. The products of the workshop focus on integrating scholarship in computer science with the social sciences and demonstrating the potential of interdisciplinary Big Data scholarship. For further information see the workshop homepage (

Project Outcomes Report


This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Project Outcomes: Travel Support for 2017 Web Archiving Data Workshops

Matthew S. Weber (Rutgers University)

1. Introduction

Archival data sources are an important data source in Big Data and computational social science research communities. It is thus critically important that training opportunities continue to be provided for young scholars in order to continue the development of this research agenda. Moreover, new skills are needed to advance the potential of archive-based research and Big Data computation.

The 2016 Archives Unleashed Workshop Series provided a major forum for developing new tools and research questions based on research conducted utilized archived Web data and Big Data analysis techniques. The first workshop in the series took place in Toronto, Canada, from March 3 – 5, 2016. The second workshop in the series took place at the Library of Congress from June 14 – 15, 2016. The first workshop had 37 graduate student participants and 8 faculty participants. The second workshop had 40 graduate student participants and 6 faculty participants.


2. Travel Support

The NSF grant funded 10 students to attend the first workshop and 15 students to attend the second workshop. 7 of the students were female and 18 were male. Special consideration was given to women and under-represented minorities. In addition, organizers in both locations worked with local planners to ensure that accommodations are made for those attending the event with young children; due to the small size of the event, organizers addressed such needs on a case-by-case basis.


2.1 Conference Organizers

The conference co-chairs were Dr. Matthew Weber from Rutgers University, Dr. Jimmy Lin of the University of Waterloo, and Ian Milligan of the University of Waterloo. Dr. Weber specializes in organizational change in relation to the use of new information communication technologies. Dr. Lin specializes in creating tools to help users make sense of large amounts of data with a focus on large-scale distributed algorithms and infrastructure for data analytics. Dr. Milligan is a digital historian who specializes in exploring how historians can fruitfully use web archives and other large digital repositories.

2.2 Archives Unleashed, University of Toronto, Toronto, Canada

The Archives Unleashed workshop series began with the first Archives Unleashed hackathon and workshop at the University of Toronto’s John P. Robarts Research Library on March 3-5, 2016.

The full program from the first event is available at:

Code and final projects from the first event are available here:

2.3 Archives Unleashed, Library of Congress, Washington DC

The Archives Unleashed workshop series continued with the second Archives Unleashed hackathon and workshop at the Library of Congress, Washington DC, from June 15 - 17. The final day also included presentations on ongoing state-of-the-art research utilizing Internet Archives.

Content from the second event is permanently available at

Final projects from the second event are available here:

3. Broader Impact

 The growing importance of archived Web data as a record of society on the Web, paired with the challenges of access to such large-scale data sources, means that new tools and new programs of research are needed. The workshops supported by this funding helped to educate and support the next generation of scholars who will tackle these critical issues. An emphasis on recruiting women and other underrepresented minorities helped to broaden participation and enhance the overall impact of these workshops. Moreover, this work is situated within the domain of computational social science; by engaging social scientists and computer scientists in the same space, this work helps to improve the standing of US-based scholars within the global computational landscape. Additionally, tools developed during the workshop series are publically available via the GitHub open source repository.



Appendix I: List of Supported Students


Archives Unleashed 1.0: Toronto, Canada, 2016


Alexander Nwala, Old Dominion University

Allison Hegel, University of California, Los Angeles

Jonathan Armoza, New York University

Kelsey Utne, Cornell University

Kyle Parry, University of Rochester

Mat Kelly, Old Dominion University

Ruqin Ren, University of Southern California

Sawood Alam, Old Dominion University

Teis Moller Kristensen, Rutgers University

Yu Xu, University of Southern California


Archives Unleashed 2.0: Washington D.C., 2016


Alexander Nwala, Old Dominion University

Allie Kosterich, Rutgers University 

Allison Hegel, University of California, Los Angeles

Emily Gade, University of Washington

Erin E. McCabe, Pratt Institute

Jaimie Murdock, Indiana University

John Andrew Berlin, Old Dominion University

Keesha Burke Henderson, Morehouse College

Luis Meneses, Texas A&M University

Mat Kelly, Old Dominion University           

Mohamed Aturban, Old Dominion University

Sawood Alam, Old Dominion University

Shawn M. Jones,  Dominion University

Shawn Walker, University of Washington 

Teis Kristensen, Rutgers University


Last Modified: 07/13/2017
Modified by: Matthew S Weber

For specific questions or comments about this information including the NSF Project Outcomes Report, contact us.