Getting started#
The Vault is a service in charge of reconstructing parts of the archive
as self-contained bundles, that can then be imported locally, for
instance in a Git repository. This is basically where you can do a
git clone
of a repository stored in Software Heritage.
The Vault is asynchronous : you first need to do a request to prepare the bundle you need, and then a second request to fetch the bundle once the Vault has finished to reconstitute the bundle.
Example: retrieving a directory#
First, ask the Vault to prepare your bundle:
curl -X POST https://archive.softwareheritage.org/api/1/vault/flat/:swhid/
where :swhid
is a SoftWare Heritage persistent IDentifiers (SWHIDs). This initial request and all
subsequent requests to this endpoint will return some JSON data containing
information about the progress of bundle creation:
{
"id": 42,
"fetch_url": "/api/1/vault/flat/:swhid/raw/",
"swhid": ":swhid",
"progress_message": "Creating tarball...",
"status": "pending"
}
Once the status is done
, you can fetch the bundle at the address
given in the fetch_url
field.
curl -o bundle.tar.gz https://archive.softwareheritage.org/api/1/vault/flat/:swhid/raw
tar xaf bundle.tar.gz
E-mail notifications#
You can also ask to be notified by e-mail once the bundle you requested is
ready, by giving an email
POST parameter:
curl -X POST -d 'email=example@example.com' \
https://archive.softwareheritage.org/api/1/vault/directory/:dir_id/
API reference#
For a more exhaustive overview of the Vault API, see the Vault API Reference.