Data.gov Archive
The Library
Innovation Lab at Harvard Law School has launched a new archive
of data.gov. This 16TB repository contains over 311,000 datasets from 2024 and
2025 and is updated daily. The initiative aims to preserve public datasets for
research, policymaking, and public use. In addition, the project has released
open-source software to help others replicate the archive and build similar
repositories.