There is not one but at least 4 different object stores directly managed by the Software Heritage group:
Rocquencourt replica archive
The Main archive¶
Uffizi Located in Rocquencourt
Banco Located in Rocquencourt, in a different building than the main one
The Azure archive uses an Azure Block Storage backend, implemented in the swh.objstorage_backends.azure.AzureCloudObjStorage Python class.
Internally, that class uses the block_blob_service Azure API.
- The AWS archive is stored in the softwareheritage Amazon S3 bucket, in the US-East
(N. Virginia) region. That bucket is public.
It is being continuously populated by the content_replayer program.
Softwareheritage Python programs access it using a libcloud backend.
- A Python program which reads new objects from Kafka and then copies them from the
object storages on Banco and Uffizi.