Archive ChangeLog#
Below you can find a time-indexed list of notable events and changes to archival policies in the Software Heritage Archive. Each of them might have (had) an impact on how content is archived and explain apparent statistical anomalies or other changes in archival behavior over time. They are collected in this document for historical reasons.
2023#
2023-04-18 Completed first archival of annas-software.org gitea forge. Regular crawling of their repositories enabled (tracking: #4855)
2023-04-18 Completed first archival of Internet Systems Consortium’s gitlab forge. Regular crawling of their repositories enabled (tracking: #4854)
2023-04-13 Completed first archival of dev.sanctum.geek.nz cgit forge. Regular crawling of their repositories enabled (tracking: #4852)
2023-04-13 Completed first archival of trueelena.org cgit forge. Regular crawling of their repositories enabled (tracking: #4851)
2023-04-12 Completed first archival of Epita infra gitlab forge. Regular crawling of their repositories enabled (tracking: #4845)
2023-04-11 Completed first archival of INRAE MathNum department gitlab forge. Regular crawling of their repositories enabled (tracking: #4842)
2023-04-11 Completed first archival of Montpellier Bioinformatics Biodiversity platform gitlab forge. Regular crawling of their repositories enabled (tracking: #4843)
2023-04-07 Completed first archival of Garbaye gitea forge. Regular crawling of their repositories enabled (tracking: #4841)
2023-04-07 Completed first archival of Alaryso’s personal projects gitea forge. Regular crawling of their repositories enabled (tracking: #4833)
2023-04-05 Completed first archival of Replicant git repositories (split in 7 forges). Regular crawling of their repositories enabled (tracking: #4685)
2023-04-03 Completed first archival of Software Heritage gitlab forge. Regular crawling of their repositories enabled (tracking: #4683)
2023-03-30 Completed first archival of CodeAurora, The Global Gathering for Mobile Open Source. Regular crawling of its repositories enabled (tracking: #4813)
2023-02-13 Completed first archival of AFPY (Association Francophone Python) git repositories. Regular crawling of its repositories enabled (tracking: #4674)
2023-01-05 Completed first archival of the University of Stuttgart gitlab forge. Regular crawling of their repositories enabled (tracking: #4712)
2023-01-05 Completed first archival of FFDN (Fédération des Fournisseurs d’Accès Internet Associatifs). Regular crawling of their repositories enabled (tracking: #4687)
2023-01-03 Completed first archival of DeuxFleurs’s gitea forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4686)
2023-01-03 Completed first archival of Minetest Land’s gitea forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4684)
2022#
2022-12-14 Completed first archival of Université Gustave Eiffel git repositories, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4675)
2022-11-13 Completed first archival of Jey Hess’s git repositories, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4666)
2022-11-13 Completed first archival of Gitlab forge of Université de Lille, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4668)
2022-11-13 Completed first archival of OpenWork Ltd’s and Adrian Cochrane’s projects, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4667)
2022-11-04 Deployed improvements to the Git loader that result in a 3-5x increase in the throughput of Git repository archival. As a result of this the crawling frequency of Git repositories will also increase. (tracking: D8808)
2022-11-03 Completed first archival of the Gentoo forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: #4648)
2022-09-27 Completed first archival of the Gitgud forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4498)
2022-09-27 Completed first archival of the Madhouse project forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4500)
2022-09-27 Completed first archival of the OpenGeoSys forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4499)
2022-09-27 Completed first archival of the Spork forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4501)
2022-09-27 Completed first archival of Alex Schroeder’s forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4502)
2022-09-27 Completed first archival of the RWTH Aachen University secondary forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4504)
2022-09-27 Completed first archival of the RWTH Aachen University forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4503)
2022-09-21 Completed first archival of Case Duckworth’s forge, as requested via Add forge now. Regular crawling of its repositories enabled (tracking: T4505)
2021#
2021-12-11 Completed first archival of the current and historical Ubuntu releases. Regular crawling of those repositories enabled (tracking: T2400)
2021-11-22 Made the package loaders write release objects instead of revisions (tracking: T3638)
2021-11-17: Completed first archival of the Opam Coq repository. Regular crawling of those repositories enabled (tracking: T3717)
2021-10-14: Completed archival of Bitbucket Mercurial repositories (tracking: T3338)
2021-09-25 Completed first archival of the Opam repository. Regular crawling for those repositories enabled (tracking: T3424)
2021-09-23 Completed first archival of the Logilab Heptapod instance. Regular crawling for those repositories enabled (tracking: T3597)
2021-09-23 Completed first archival of the Heptapod instance. Regular crawling for those repositories enabled (tracking: T3600)
2021-09-22 Completed first archival of the FOSS Heptapod instance. This is the first forge with mostly mercurial origins. Regular crawling for those repositories enabled (tracking: T3581)
2021-09-22 Disabled insertion of Git objects with non-canonical representations in the SWH data model (tracking: T399)
2021-08-03 Completed first archival of SourceForge Mercurial repositories; regular crawling for those repositories enabled (tracking: T3374)
2021-07-22 Completed first archival of SourceForge Git and Subversion repositories; regular crawling for those repositories enabled (tracking: T3374)
2020#
2020-10-06 - 2020-11-23: source code crawlers have been paused to avoid an out of disk condition, due to an unexpected delay in the arrival of new storage hardware. Push archival (both deposit and save code now) remained in operation. (tracking: T2656)
2020-09-15: completed first archival of, and added to regular crawling GNU Guix System (tracking: T2594)
2020-06-11: completed integration with the IPOL journal, allowing paper authors to explicitly deposit source code to the archive (announcement)
2020-05-25: completed first archival of, and added to regular crawling NixOS (tracking: T2411)
2019#
2019-09-10: completed first archival of Bitbucket Git repositories and added Bitbucket as a regularly crawled forge (tracking: T592)
2019-06-30: completed first archival of, and added to regular crawling, several GitLab instances: 0xacab.org, framagit.org, gite.lirmm.fr, gitlab.common-lisp.net, gitlab.freedesktop.org, gitlab.gnome.org, gitlab.inria.fr, salsa.debian.org
2019-06-12: completed first archival of CRAN packages and added CRAN as a regularly crawled package repository (tracking: T1709)
2019-06-11: completed a full archival of GNU source code releases from ftp.gnu.org, and added it to regular crawling (tracking: T1722)
2019-05-27: completed a full archival of NPM packages and added it as a regularly crawled package repository (tracking: T1378)
2019-01-10: enabled the save code now service, allowing users to explicitly request archival of a specific source code repository (announcement)
2018#
2018-10-10: completed first archival of PyPI packages and added PyPI as a regularly crawled package repository (announcement)
2018-09-25: completed integration with HAL, allowing paper authors to explicitly deposit source code to the archive (announcement)
2018-08-31: completed first archival of public GitLab repositories from gitlab.com and added it as a regularly crawled forge (tracking: T1111)
2018-03-21: completed archival of Google Code Mercurial repositories. (tracking: T682)
2018-02-20: completed archival of Debian packages and added Debian as a regularly crawled distribution (announcement)
2017#
2017-10-02: completed archival of Google Code Subversion repositories (tracking: T617)
2017-06-06: completed archival of Google Code Git repositories (tracking: T673)
2016#
2015#
2015-11-06: archived all GNU source code releases from ftp.gnu.org (tracking: T90)
2015-07-28: started archiving public GitHub repositories