swh.scrubber.journal_checker module#

Reads all objects in a swh-storage instance and recomputes their checksums.

swh.scrubber.journal_checker.get_datastore(journal_cfg) Datastore[source]#
class swh.scrubber.journal_checker.JournalChecker(db: ScrubberDb, config_id: int, journal_client_config: Dict[str, Any])[source]#

Bases: BaseChecker

Reads a chunk of a swh-storage database, recomputes checksums, and reports errors in a separate database.

run() None[source]#

Runs a journal client with the given configuration.

This method does not return, unless the client is configured with on_eof parameter equals to EofBehavior.STOP (stop in YAML).

process_kafka_messages(all_messages: Dict[str, List[bytes]])[source]#