Archive ChangeLog#

Below you can find a time-indexed list of notable events and changes to archival policies in the Software Heritage Archive. Each of them might have (had) an impact on how content is archived and explain apparent statistical anomalies or other changes in archival behavior over time. They are collected in this document for historical reasons.

2023#

2022#

2021#

  • 2021-12-11 Completed first archival of the current and historical Ubuntu releases. Regular crawling of those repositories enabled (tracking: T2400)

  • 2021-11-22 Made the package loaders write release objects instead of revisions (tracking: T3638)

  • 2021-11-17: Completed first archival of the Opam Coq repository. Regular crawling of those repositories enabled (tracking: T3717)

  • 2021-10-14: Completed archival of Bitbucket Mercurial repositories (tracking: T3338)

  • 2021-09-25 Completed first archival of the Opam repository. Regular crawling for those repositories enabled (tracking: T3424)

  • 2021-09-23 Completed first archival of the Logilab Heptapod instance. Regular crawling for those repositories enabled (tracking: T3597)

  • 2021-09-23 Completed first archival of the Heptapod instance. Regular crawling for those repositories enabled (tracking: T3600)

  • 2021-09-22 Completed first archival of the FOSS Heptapod instance. This is the first forge with mostly mercurial origins. Regular crawling for those repositories enabled (tracking: T3581)

  • 2021-09-22 Disabled insertion of Git objects with non-canonical representations in the SWH data model (tracking: T399)

  • 2021-08-03 Completed first archival of SourceForge Mercurial repositories; regular crawling for those repositories enabled (tracking: T3374)

  • 2021-07-22 Completed first archival of SourceForge Git and Subversion repositories; regular crawling for those repositories enabled (tracking: T3374)

2020#

  • 2020-10-06 - 2020-11-23: source code crawlers have been paused to avoid an out of disk condition, due to an unexpected delay in the arrival of new storage hardware. Push archival (both deposit and save code now) remained in operation. (tracking: T2656)

  • 2020-09-15: completed first archival of, and added to regular crawling GNU Guix System (tracking: T2594)

  • 2020-06-11: completed integration with the IPOL journal, allowing paper authors to explicitly deposit source code to the archive (announcement)

  • 2020-05-25: completed first archival of, and added to regular crawling NixOS (tracking: T2411)

2019#

2018#

  • 2018-10-10: completed first archival of PyPI packages and added PyPI as a regularly crawled package repository (announcement)

  • 2018-09-25: completed integration with HAL, allowing paper authors to explicitly deposit source code to the archive (announcement)

  • 2018-08-31: completed first archival of public GitLab repositories from gitlab.com and added it as a regularly crawled forge (tracking: T1111)

  • 2018-03-21: completed archival of Google Code Mercurial repositories. (tracking: T682)

  • 2018-02-20: completed archival of Debian packages and added Debian as a regularly crawled distribution (announcement)

2017#

  • 2017-10-02: completed archival of Google Code Subversion repositories (tracking: T617)

  • 2017-06-06: completed archival of Google Code Git repositories (tracking: T673)

2016#

  • 2016-04-04: completed archival of the Gitorious (tracking: T312)

2015#

  • 2015-11-06: archived all GNU source code releases from ftp.gnu.org (tracking: T90)

  • 2015-07-28: started archiving public GitHub repositories