Have a personal or library account? Click to login
Global Community Guidelines for Documenting, Sharing, and Reusing Quality Information of Individual Digital Datasets Cover

Global Community Guidelines for Documenting, Sharing, and Reusing Quality Information of Individual Digital Datasets

Open Access
|Mar 2022

Figures & Tables

dsj-21-1423-g1.png
Figure 1

Brief description of four quality aspects (i.e., science, product, stewardship and service) throughout a dataset lifecycle, three key stages and a few quality attributes associated with each quality aspect (e.g., define, develop, and validate stages for the science quality aspect). The quality aspects and associated stages are based on Ramapriyan et al. (2017) with the following changes, based on feedback from the ESIP community and the International FAIR Dataset Quality Information (DQI) Community Guidelines Working Group: i) ‘Assess’ replaced by ‘Evaluate’ in the Product aspect; ii) ‘Deliver’ replaced by ‘Release’ in the Product aspect; and iii) ‘Maintain’ replaced by ‘Document’ in the Stewardship aspect. Additionally, completeness of metadata is moved from the Product to Stewardship aspect. Creator: Ge Peng; Contributors to conceptualization: Lesley Wyborn and Robert R. Downs.

dsj-21-1423-g2.png
Figure 2

Schematic diagram of timelines of the initiation, planning, development, community review, and first baseline of the guidelines document. The guidelines document will be updated in the future to improve its coverage in diverse disciplines. ESIP IQC: Information Quality Cluster of the Earth Science Information Partners. BSC EQC: Barcelona Supercomputing Center (BSC) Evaluation and Quality Control (EQC) team.

dsj-21-1423-g3.png
Figure 3

Flowchart outlining different phases of the guidelines development process, including the initiation, planning, development, community review and engagement, and baseline of the guidelines.

dsj-21-1423-g4.png
Figure 4

A schematic diagram of a basic workflow with relevant elements for curating and disseminating dataset quality information. Creator: Carlo Lacagnina. Contributor: Ge Peng.

dsj-21-1423-g5.png
Figure 5

Diagram mapping the guidelines to the FAIR Guiding Principles as defined in Wilkinson et al. (2016).5 Solid lines represent direct mapping while the dashed lines represent indirect or weak mapping that are either inferred or may not always hold. {F, A, I, R}n denotes the nth element of the findable, accessible, interoperable, and reusable principles, respectively. Based on Table 1 in Peng et al. (2021b), with additional weak mappings represented by the dashed lines. Creator: Ge Peng. Contributor: Anette Ganske.

Table 1

Examples of dataset quality assessment models and their compliance with Guideline 2.

ASSESSMENT MODELSCIENTIFIC DATA STEWARDSHIP MATURITY MATRIX (PENG ET AL. 2015)STEWARDSHIP MATURITY MATRIX FOR CLIMATE DATA (PENG ET AL. 2019b)FAIR DATA MATURITY MODEL (RDA FAIR DATA MATURITY MODEL WORKING GROUP 2020)METADATA QUALITY FRAMEWORK (BUGBEE ET AL. 2021)DATA QUALITY ANALYSES AND QUALITY CONTROL FRAMEWORK (WOO & GOURCUFF 2021)
Quality Entity (i.e., attribute, aspect, or dimension)StewardshipStewardshipFAIRnessMetadataData
2.1 – Publicly AvailableYesYesYesYesYes
2.1 – PIDDOIDOIDOIDOIDOI
2.2 – IndexedData Science JournalFigshareZenodoData Science JournalIntegrated Marine Observing System Catalog
2.3 – Retrievable
Using Free, Open, Standard-Based Protocol

Yes

Yes

Yes

Yes

Yes
Table 2

Examples of representing quality entities, assessment models and assessment results in machine-readable quality metadata and their compliance with Guideline 3.

QUALITY METADATA FRAMEWORKNOAA ONESTOP DSMM QUALITY METADATA (PENG ET AL. 2019A)ATMODAT MATURITY INDICATOR (HEYDEBRECK ET AL. 2020)METADATAFROMGEODATA (WAGNER ET AL. 2021)
Quality EntityStewardshipAny Quality EntityData and Metadata
3.1 – Semantically and Structurally ConsistentYesYesYes
3.1 – Metadata Framework/SchemaInternationalDomainDomain
3.2 – Quality Entity DescriptionYesYesYes
3.3 – Assessment Method/Structure DescriptionYesYesPartly (contains evaluation of quality description and not description of quality assessment)
3.4 – Assessment Results DescriptionYesYesYes
3.5 – Versioning and the History of the AssessmentsYesVersioningCreation & Last Update Dates
Table 3

Examples of human-readable dataset quality assessment reports and their compliance with Guideline 4.

QUALITY REPORTLEMIEUX ET AL. (2017)HÖCK ET AL. (2020)COWLEY (2021)
Quality EntityStewardshipDataData
4.1 – Follow TemplateYesYesYes
4 – Quality Entity DescriptionYesYesYes
4 – Assessment Method DescriptionYesYesYes
4 – Assessment Results DescriptionYesYesYes
4.2 – LicenseYesYesYes
4.2 – Assessment HistoryYesYesYes
4.3 – Linked Report PIDYesNoYes
Table 4

Examples of disseminating assessment results online and their compliance with Guideline 5.

ONLINE PORTALJPSS DATA PRODUCT ALGORITHM MATURITY PORTAL6C3S CLIMATE DATA STORE DATASET QUALITY ASSESSMENT PORTAL7ROLLINGDECK TO REPOSITORY (R2R) QA DASHBOARD8
Quality EntityAlgorithmTechnical and Scientific QualitySensor
5 – Report information in an organized wayYesYesYes
5.1 – Dataset DescriptionMinimalYesMinimal
5.2 – Assessed Quality Entity DescriptionYesYesYes
5.3 – Evaluation Method and Review Process DescriptionYesYesYes
5.4 – Description of How to Understand and Use DescriptionSomeSomeMinimal
Language: English
Submitted on: Jan 3, 2022
Accepted on: Feb 28, 2022
Published on: Mar 31, 2022
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2022 Ge Peng, Carlo Lacagnina, Robert R. Downs, Anette Ganske, Hampapuram K. Ramapriyan, Ivana Ivánová, Lesley Wyborn, Dave Jones, Lucy Bastin, Chung-lin Shie, David F. Moroni, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.