Archival and Retrieval Flow

Archival is split into two subflows, Create datablocks and Move datablocks to LTSAn archival task can contain multiple datasets; for simplicity the case with only one is depicted here.

Nomenclature

Identifier	Description
Flow	Sequence of tasks and subflows
Subflow	Flow triggered by a parent flow
Job	A Scicat terminology corresponding to a archival flow; may contain multiple datasets as parameters
Archive Flow	Subflow that runs archiving for one dataset
User Error	Scicat terminology for errors triggered by the user, e.g. dataset is incomplete; dataset not found
System Error	Scicat terminiolgy for unrecoverable (transient) error in the system

Critical Sections

A sequence of tasks that need to succeed atomically (all succeed or all are rolled back) are depicted as critical tasks below. They have a specific error handling associated in order to roll back all the task in that sequence.

Retries

All task interacting with external services (i.e. Scicat) have retries implemented. Since they are not done in a task level but on the http request level, they are not depicted below.

Reading from the LTS has retries implemented on a task level and is therefore depicted below.

Sequence Diagrams

Archival

The diagramm below shows the different steps in the archiving flow. Each step executed by the Worker is implemented as its own task (expect steps executed in error handling).

sequenceDiagram autonumber participant AS as Archival Service participant L as Landing Zone participant W as Worker participant ST as Staging participant S as SciCat participant LTS as LTS Note over AS, S: Subflow: Create datablocks S --) AS: Archive: POST /api/v1/jobs Note right of AS: Job {"id" : "id", "type":"archive", "datasetlist": [], ... } as defined in Scicat AS --) W: Create archival flow for each dataset AS -->> S: Respond to Scicat Note left of S: {"uuid": "<flow uuid>", "name": "<flow name>"} W --) S: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "inProgress"}, critical Job for datasetlist Note over L,LTS: Subflow: Move dataset to LTS critical Subflow for {DatasetId} W --) S: PATCH /api/v4/Dataset/{DatasetId} Note left of S: {"datasetlifecycle": {"archiveStatusMessage": "started"}} W --> S: GET /api/v4/Datasets/{dataset_id}/origdatablocks W -->> L: Create datablocks L -->> ST: Move datablocks to staging W -->> S: Register datablocks POST /api/v4/Datasets/{DatasetId}/datablocks W -->> L: Cleanup dataset files, datablocks Note right of L: Dataset files only get cleaned up when everythings succeeds option Failure user error W --) S: Report error: PATCH /api/v4/Dataset/{DatasetId} Note left of S: {"datasetlifecycle:{"archiveStatusMessage": "missingFileserror", "archivable": False, "retrievable":False}} W -->> ST: Cleanup Datablocks W -->> L: Cleanup Datablocks Note right of L: No cleanup of dataset files option Failure system error W --) S: Report error: PATCH /api/v4/Dataset/{DatasetId} Note left of S: {"datasetlifecycle:{"archiveStatusMessage": "scheduleArchiveJobFailed", "archivable": "false", "retrievable":"false"}} W -->> ST: Cleanup datablocks W -->> L: Cleanup datablocks Note right of L: No cleanup of dataset files end option Failure user error W --) S: Report error: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "finishedWithDataseterrors"} option Failure system error W --) S: Report error: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "finishedUnsuccessful"} end Note over W, LTS: Subflow: Move datablocks to LTS loop for each datablock critical ST -->> LTS: Move datablock to LTS loop Retry ST -->> LTS: Verify checksum of datablock to LTS with retries end option Moving datablock to LTS Failed W -->> LTS: Cleanup LTS folder W -->> S: Report error Note left of S: {"jobStatusMessage": "scheduleArchiveJobFailed"} end critical W -->> LTS: Verify datablock in LTS W -->> L: Cleanup dataset files, datablock option Datablock validation failed W -->> S: Report error Note left of S: {"jobStatusMessage": "scheduleArchiveJobFailed"} W -->> LTS: Cleanup datablocks end end W -->> S: PATCH /api/v4/Datasets/{DatasetId} Note left of S: {"datasetlifecycle": {"archivable": False, "retrievable": True, "archiveStatusMessage": "datasetOnArchiveDisk"}} W -->> S: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "finishedSuccessful"}

Press "Alt" / "Option" to enable Pan & Zoom

Retrieval

The diagramm below shows the different steps in the retrieval flow. Each step executed by the Worker is implemented as its own task (expect steps executed in error handling).

sequenceDiagram autonumber participant AS as Archival Service participant J as Worker participant ST as Staging participant S as SciCat participant LTS as LTS S --) AS: Retrieve: POST /api/v1/jobs Note right of AS: Job {"id" : "id", "type":"retrieve", "datasetlist": [], ... } as defined in Scicat AS --) J: Create Retrieval Flow AS -->> S: Respond to Scicat Note left of S: {"uuid": "<flow uuid>", "name": "<flow name>"} J -->> S: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "inProgress"} J -->> S: PATCH /api/v4/Datasets/{DatasetId} Note left of S: {"datasetlifecycle": {"retrieveStatusMessage": "started"}} critical LTS -->> ST: Download Datablocks from LTS option Retrieval Failure J --) S: Report Error: PATCH /api/v4/Dataset/{DatasetId} Note left of S: {"retrieveStatusMessage": datasetRetrievalFailed,<br> "retrieveReturnMessage": "retrievalFailed"} J --) S: Report Error: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "finishedWithDatasetErrors",<br> "jobResultObject": [JobResultEntry]} J -->> ST: Cleanup Files end critical J -->> ST: Validate Datablocks option Validation Failure J -->> S: Report Error: PATCH /api/v4/Dataset/{DatasetId} Note left of S: {"retrieveStatusMessage": datasetRetrievalFailed,<br> "retrieveReturnMessage": "retrievalFailed"} J -->> ST: Cleanup Files end J -->> S: PATCH /api/v4/Jobs/{JobId} Note left of S: {"jobStatusMessage": "finishedSuccessful"} J -->> S: PATCH /api/v4/Datasets/{DatasetId} Note left of S: {"datasetlifecycle": {"retrieveStatusMessage": "datasetRetrieved",<br> "retrievable": True }}

Press "Alt" / "Option" to enable Pan & Zoom

Note: updatedBy and updatedAt are omitted for brevity but need to be set for every update of the job status and datsetlifecycle as well.

JobResult

The result of a retrieval job contains a list of JobResultEntry objects, which contain a url in order for end users to download the datablocks.

JobResultEntry

 {
    "datasetId": "<datasetId>",
    "name": "str",
    "size": "str",
    "archiveId": "str",
    "url": "str"
 }