Have a personal or library account? Click to login
Harvestable Metadata Services Development: Analysis of Use Cases from the World Data System Cover

Harvestable Metadata Services Development: Analysis of Use Cases from the World Data System

Open Access
|Jul 2023

Figures & Tables

dsj-22-1437-g1.png
Figure 1

The metadata harvesting process. Standardized metadata is harvested from repository catalogues, then processed by an aggregation service. The service disseminates the metadata records through a search and discovery portal and/or by serving it to further aggregation services for distribution.

Table 1

HMetS-WG participants, with WDS membership type and host institutions.

WDS MEMBERTYPEHOST INSTITUTION(S)
Centre de Données Astronomiques de Strasbourg (CDS)RegularStrasbourg Astronomical Observatory (ObAS); University of Strasbourg; French National Centre for Scientific Research (CNRS)
Global Change Research Data Publishing and Repository (GCdataPR)RegularInstitute of Geographical Sciences and Natural Resources Research (IGSNRR), Chinese Academy of Sciences (CAS); Geographical Society of China
International Real-time Magnetic Observatory Network (INTERMAGNET)NetworkMultiple institutions (worldwide)
International Service of Geomagnetic Indices (ISGI)RegularSchool and Observatory of Earth Sciences (EOST); University of Strasbourg; French National Centre for Scientific Research (CNRS)
International GNSS Service (IGS)NetworkMultiple institutions
National Space Science Data Center (NSSDC)RegularNational Space Science Center (NSSC), Chinese Academy of Sciences (CAS)
Socioeconomic Data and Applications Center (SEDAC)RegularCenter for International Earth Science Information Network (CIESIN), Columbia University; Earth Observing System Data and Information System (EOSDIS), National Aeronautics and Space Administration (NASA)
World Data Center for Geomagnetism (Edinburgh)RegularBritish Geological Survey (BGS)
World Data Centre for Renewable Resources and Environment (WDC-RRE)RegularIGSNRR; CAS
Table 2

Subject areas represented by repositories and target users groups. Subject areas were provided to WDS-ITO by the repositories.

REPOSITORYSUBJECT AREASUSER GROUPS
GCdataPRAgriculture, Area studies, Earth sciences, Economics, Environmental studies, forestry, Geo-ecosystems Geography, and HistoryGlobal change students, researchers policy makers and society in China and worldwide
IGSEarth sciences, Geodesy, GNSS, GPS, Precise positioning, Navigation, Timing, and Space sciencesMainly IGS staff, project and working group participants. More broadly: worldwide users of modern mapping, orientation and navigation systems, enterprises, non-profits, institutions and government actors
INTERMAGNETEarth sciences, Geomagnetism, Space sciencesScientific community, geomagnetism community, members of IAGA,1415 commercial users
ISGISolar-Terrestrial physics, Space weather-Space Climate, Space sciences, Earth sciences, GeomagnetismAcademia (including behavioral biology), members of IAGA communities, private and public sectors (military, telecommunications, satellite operators)
NSSDCAstronomy, Computer sciences, Planetary science, Space physics, Space sciences, Space weatherTypical users are Chinese and international researchers in subject areas
SEDACAgriculture, Architecture and design, Anthropology, Area studies, Business, Chemistry, Climate science, Computer sciences, Cultural and ethnic studies, Earth sciences, Economics, Engineering, Environmental science, Environmental and forestry studies, Geography, Health sciences, Information system science, Political science, Sociology, Statistics, Sustainability science, Systems science, TransportationUser community interested in studying human interactions in the environment
WDC-RREEarth sciences, Ecology, Environmental studies and forestry, Geography, Geoinformatics, Natural resourcesMainly academic researchers and students, also scientific staff and technicians, general public, government agencies, policy makers, and international organizations
dsj-22-1437-g2.png
Figure 2

Flow-chart diagram of a typical harvestable metadata services implementation (Payne, Urquidi Diaz & Li 2021). This diagram gives a schematic representation of the steps involved in creating a harvestable metadata service. The HMetS-WG used these steps to scaffold the group’s initial work.

Table 3

Use Case Infrastructures: Summary of Features.

REPOSITORYREPOSITORY PLATFORM & CATALOGUEMETADATA STANDARDSMETADATA SERVICE PROTOCOLSKNOWN AGGREGATORS
GCdataPRCustom GCdataPR 2.0DCI16, DataCiteOpenSearchCrossRef, China-GEOSS, CNKI, DCI, CSTR, ScienceEngine
IGSCatalogue via NASA CMR
  • Developing new discovery platform

DIF 10, ECHO 10, ISO 19115-2:2009 (MENDS and SMAP dialects), UMM-CCMR CSW, CMR public APIs, OpenSearchvia NASA’s CMR
INTERMAGNETCustom repository, with some datasets on GFZ Potsdam data repositoryVia INTERMAGNET: IAGA2002, CDF; Via GFZ: GeoJSON, DataCite, ISO 19115Via homepage: HTTP, FTP; Via GFZ: request to DataCite’s APIDataCite, FIDGEO17
ISGICustom
  • Public access metadata service

IAGA2002
  • CERIF, DataCite, and/or DCAT based profiles and/or crosswalks

Via homepage: HTTPS; request to DataCite’s API
NSSDCCustomNSSDC Core Metadata Specification, SPASE
  • DataCite, Data model compatible with NSSDC

OpenSearch, OGC-CSW (via WDS China), Data search platform,
  • OAI-PMH

National Science and Technology Data Sharing Network of China, Scientific Data Center, CAS
SEDACVital Digital Asset Mgt. System (Fedora)
  • Migrating to Drupal 8

FGDC CSDGM, ISO 19115, DataCiteIDN OGC CSW, NASA CMR CSW, CMR public APIs, OpenSearchDataCite, GEOSS (via EOSDIS/CMR)
WDC-RRECustom: Debian OS, OSS NGNIX, PostgreSQL, TorCMSDublin Core, ISO 19115, custom Data Identification and Metadata Standards
  • Revision planned

OpenSearch, OGC-CSW 3.0.0, OAI-PMH 2.0, SRU 1.1.,
  • Geonetwork

WDS-China, CNKI
dsj-22-1437-g3.png
Figure 3

This bar chart compares the mechanisms for metadata exposure (aggregation, discovery, etc.) that were reported by the HMetS-WG repositories with those reported by the WDS repositories in a 2019 member survey (Payne & Urquidi Diaz 2020: 11, 15). Since some repositories reported serving their metadata via third-party services, these services also have been included (e.g. DataCite, EOSDIS, etc.). *Includes schema.org.

Language: English
Submitted on: Mar 9, 2023
Accepted on: Mar 13, 2023
Published on: Jul 5, 2023
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2023 Robert R. Downs, Alicia Urquidi Díaz, Qi Xu, Juanle Wang, Aude Chambodut, Chuang Liu, Simon Flower, Karen Payne, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.