Archive ChangeLog

Below you can find a time-indexed list of notable events and changes to archival policies in the Software Heritage Archive. Each of them might have (had) an impact on how content is archived and explain apparent statistical anomalies or other changes in archival behavior over time. They are collected in this document for historical reasons.

2021

  • 2021-11-17: Completed first archival of the Opam Coq repository. Regular cralwing of those repositories enabled (tracking: T3717)

  • 2021-10-14: Completed archival of Bitbucket Mercurial repositories (tracking: T3338)

  • 2021-09-25 Completed first archival of the Opam repository. Regular crawling for those repositories enabled (tracking: T3424)

  • 2021-09-23 Completed first archival of the Logilab Heptapod instance. Regular crawling for those repositories enabled (tracking: T3597)

  • 2021-09-23 Completed first archival of the Heptapod instance. Regular crawling for those repositories enabled (tracking: T3600)

  • 2021-09-22 Completed first archival of the FOSS Heptapod instance. This is the first forge with mostly mercurial origins. Regular crawling for those repositories enabled (tracking: T3581)

  • 2021-08-03 Completed first archival of SourceForge Mercurial repositories; regular crawling for those repositories enabled (tracking: T3374)

  • 2021-07-22 Completed first archival of SourceForge Git and Subversion repositories; regular crawling for those repositories enabled (tracking: T3374)

2020

  • 2020-10-06 - 2020-11-23: source code crawlers have been paused to avoid an out of disk condition, due to an unexpected delay in the arrival of new storage hardware. Push archival (both deposit and save code now) remained in operation. (tracking: T2656)

  • 2020-09-15: completed first archival of, and added to regular crawling GNU Guix System (tracking: T2594)

  • 2020-06-11: completed integration with the IPOL journal, allowing paper authors to explicitly deposit source code to the archive (announcement)

  • 2020-05-25: completed first archival of, and added to regular crawling NixOS (tracking: T2411)

2019

2018

  • 2018-10-10: completed first archival of PyPI packages and added PyPI as a regularly crawled package repository (announcement)

  • 2018-09-25: completed integration with HAL, allowing paper authors to explicitly deposit source code to the archive (announcement)

  • 2018-08-31: completed first archival of public GitLab repositories from gitlab.com and added it as a regularly crawled forge (tracking: T1111)

  • 2018-03-21: completed archival of Google Code Mercurial repositories. (tracking: T682)

  • 2018-02-20: completed archival of Debian packages and added Debian as a regularly crawled distribution (announcement)

2017

  • 2017-10-02: completed archival of Google Code Subversion repositories (tracking: T617)

  • 2017-06-06: completed archival of Google Code Git repositories (tracking: T673)

2016

  • 2016-04-04: completed archival of the Gitorious (tracking: T312)

2015

  • 2015-11-06: archived all GNU source code releases from ftp.gnu.org (tracking: T90)

  • 2015-07-28: started archiving public GitHub repositories