Have a personal or library account? Click to login
ESA EO Data Preservation System Cover

Figures & Tables

dsj-19-937-g1.png
Figure 1

EO Data Preservation System.

dsj-19-937-g2.png
Figure 2

Master Archive service locations and duties in Europe.

dsj-19-937-g3.png
Figure 3

Master Archive infrastructure and high-level data flow between archive centers.

dsj-19-937-g4.png
Figure 4

Checks performed during the data ingestion process.

Table 1

Checks performed by the Master Archive during the data ingestion process.

#CentreOperationVerification
1MainIngestionThe list of files is compared to any delivered inventory.
1’MainIngestionThe structure and the content (repository, datasets, products, files) is compared with the delivery spreadsheet delivered by ESA.
2MainIngestionThe total number of products is compared to the delivery information.
3MainIngestionThe actual file MD5 checksum is compared to the value extracted from the product metadata (manifest file or attached checksum file).
3’MainIngestionIf data is included in a container (zip, tgz, …), the integrity of the container is verified (i.e. container content can be accessed).
4MainIngestionAfter the generation of the zip container, the files are extracted in a scratch directory and checked with respect to the content of the checksum.txt file. This operation is logged for future use by the global verification of the dataset ingestion.
5MainIngestionMD5 hash code computed by Quantum from the products are queried from the StorNext database and compared to the DMPC MD5 hash code before products are set in “TAPE” status.
6BothIngestionLTO-7 drives apply an automatic verify-after-write technology to immediately check the data as it is being written.
7BothBackendBoth Quantum libraries include EDLM. The Extended Data Life Management feature ensures that tapes are trouble-free (based on tape scan and tape memory analysis). Tapes scan and analysis is performed following predefined policies (max. 4 tapes per day i.e. 7% of a 10 PBytes archive per month using 2 LTO7 EDLM drives). Suspect tapes are automatically copied to new tapes.
8MainValidationAfter the ingestion process where data has been verified at product level, a global ingestion verification is performed using the DPMC database information, the initial media inventory and the ingestion process log files.
9BackupIngestionThe inventory of the tapes sent to the backup centre is retrieved and used for comparison with the copy process performed to copy the products from temporary tapes to ANTF tapes via disk cache.
10BackupIngestionThe zip container integrity is used to verify that the transferred products have not been corrupted.
dsj-19-937-g5.png
Figure 5

EODAS Service Web Portal.

dsj-19-937-g6.png
Figure 6

EODAS overall historical data ingestion status.

dsj-19-937-g7.png
Figure 7

EODAS historical data ingestion detail.

dsj-19-937-g8.png
Figure 8

EODAS live mission ingestion status.

dsj-19-937-g9.png
Figure 9

Front-End Data Circulation.

dsj-19-937-g10.png
Figure 10

CBA Volumes, January 2019.

dsj-19-937-g11.png
Figure 11

CBA Next Generation Archive.

dsj-19-937-g12.png
Figure 12

Data Information System.

dsj-19-937-g13.png
Figure 13

DIS Data Provenance.

dsj-19-937-g14.png
Figure 14

Archiving Policy.

dsj-19-937-g15.png
Figure 15

Data Monitoring.

dsj-19-937-g16.png
Figure 16

Data Holdings by Availability.

Language: English
Submitted on: Jan 26, 2019
|
Accepted on: Jan 7, 2020
|
Published on: May 7, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 Mirko Albani, Michel Douzal, Domenico Castrovillari, Paolo Boezi, Daniele Iozzino, Iolanda Maggio, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.