Skip to Main Content

Government Information Data Rescue

Links to trusted repositories that have rescued U.S. government data

Where Can I Archive Existing Information?

Websites

Datasets and Reports

Data Rescue Activist Tools

These data activist organizations focus on rescuing and preserving data:

  • ArchiveBox has archived datasets from data.gov, CIBP, USCIS, NOAA, NASA, NSIDC
  • The Archive Team at the Internet Archive has a project archiving datasets from the U.S. Government
  • GitHub: Awesome Datahoarding provides lists of tools for web harvesting
  • GovDiff shows side-by-side comparisons of government website changes
  • MIT Libraries: Data Management Checklist provides a checklist for curating data rescue efforts
  • r/Data Hoarder is a subreddit of data preservation activists
  • Safeguarding Research Discourse Group is a preservation group hosted outside of the US
  • Tracking Government Information is a preservation group of four academic librarians
  • WebRecorder has archived 8TB+ of government sites