Have a personal or library account? Click to login
Developing Metrics for NASA Earth Science Interdisciplinary Data Products and Services Cover

Developing Metrics for NASA Earth Science Interdisciplinary Data Products and Services

Open Access
|Feb 2022

Figures & Tables

Table 1

Four major types of metrics collected at GES DISC.

TYPE OF METRICSMETRICS
Key MetricsThe operational distribution metrics recording overall user data/service access and download activities for the following three major groupings:
  1. Number of Distinct or Registered Users

  2. Number of Distributed Data Files

  3. Size of Distributed Data Volume

in four categories: Country (e.g., United States, Canada), Protocol (e.g., HTTPS, OPeNDAP), Project (e.g., TRMM/GPM, MERRA-2) and Domain (e.g., ‘.edu’, ‘.gov’).
Bugzilla Ticket MetricsCollecting and retrieving significant anduseful info from user questions or feedback mentioned in UserAssistance Tickets:
  1. User Background:

    1. What they are: Researcher; Professor/GraduateStudents; Industry; etc.

    2. Where they come from: USA; Africa; Asia; Australia; Europe; and the Middle East; etc.

  2. Number of UserAssistance Tickets: Monthly; Seasonal; Yearly distributions (per routine Daily collections)

  3. Application/Study: Hydrology; AtmosphericChemistry; Oceanography; etc.

  4. Portal: Giovanni; MERRA-2; TRMM/GPM; etc.

  5. Data Variable: Air Temperature; Wind Fields; Precipitation; Aerosol; etc.

Giovanni Publication MetricsCollecting/gleaning significant and useful info from our Giovanni user journal publications in regard to:
  1. Applied Variable: Atmos. Aerosol; Precipitation; Air Temperature; etc.

  2. Product Source: TRMM/GPM; MODIS; MERRA-2; etc.

  3. Studied Subject: Hurricane; Aerosol/Dust; Rain/Water Vapor; etc.

  4. Studied Temporal Period: Long-term; Mid-term; Short-term

  5. Studied Spatial Domain: Global; Regional; Local

  6. Studied Region: Continents; Oceans; Countries; Lakes; etc.

  7. Journal Origins: America; Europe; Asia; Middle East; International/Open Access; etc.

Website Metrics (via Google Analytics)Collecting useful info on User Website Access via utilization of the Google Analytics Tool.
  1. Dataset Keyword search: “rainfall”; “TRMM”; “merra-2”; “trmm”; “GPM”; etc.

  2. Information Keyword search: “precipitation”; “trmm”; “rainfall”; “merra-2”; “Giovanni measurements”; etc.

  3. Content Type search: “Data Collection”; etc.

  4. Traffic and/or Referral Sources: Direct Access; Google; etc.

  5. Datasets subsetted/downloaded directly from search results page: “trmm_3b42_v7”; “m2t1nxslv_v5.12.4”; “m2i3npasm_v5.12.4”; etc.

  6. Most Sorted Columns: “begin date”; “time res.”; “end data”

  7. Most Browsed Categories: “subject”; “measurement”; “source”; “project”; “spatial resolution”; “temporal resolution”

  8. Most searched Content Type: “data collections”; “data documentation”; “image gallery”; “how-to’s”; “tools”; “faqs”

dsj-21-1387-g1.png
Figure 1

The homepage of GES DISC with search capabilities for datasets, tools, documentation, alerts, data releases, news, FAQs, publications and more.

dsj-21-1387-g2.png
Figure 2

An example of the dataset landing page for the popular NASA Integrated Multi-satellitE Retrievals for GPM (IMERG) monthly dataset. A one-stop ‘shop’ design allowing easy access to data and dataset related information.

dsj-21-1387-g3.png
Figure 3

A schematic of four “correlated” metrics at GES DISC. More details with examples are shown in Figures 4, 5, 6 and 7.

dsj-21-1387-g4.png
Figure 4

The schematic of the collection workflow (top), and the yearly, i.e., FY2010 – FY2019, distributions of distinct user/IP (middle), data file (bottom left), and data volume (bottom right).

dsj-21-1387-g5.png
Figure 5

Bugzilla metrics – user assistance tickets. Top: a schematic of the collection workflow. Bottom: Monthly (2013-2018, left) and yearly (201301-201909, right) ticket distributions presented in two different perspectives.

dsj-21-1387-g6.png
Figure 6

Giovanni publication metrics. Top: a schematic of the collection workflow. Middle: Monthly publication distributions of diverse disciplines (left) and the respective distributions of individual disciplines (right) for FY2019. Bottom: Yearly publication distributions for Y2004-Y2019* [*projected to Dec 2019].

dsj-21-1387-g7.png
Figure 7

Standard “out-of-the-box” metrics. Top: a schematic of the collection workflow. Bottom: a workflow to generate GES DISC website custom metrics reports.

Table 2

Required fields of the EMS collection metadata and its search terms.

FIELD NAMEDESCRIPTIONMAX LENGTHEXAMPLE
productThis is a product identifier or the short name of the dataset…80AIRIBRAD
metaDataLongNameIdentification of the long name associated with the collection or granule.1024AIRS/Aqua infrared geolocated radiances
productLevelNASA data processing levels (i.e., 0, 1, 1A, 1B, 2, 3, 4).101B
disciplineDesignates the scientific area of application (i.e., Ocean, Atmosphere, Land, Cryosphere, Volcanic, Solar, Raw data, Radiance).500Atmosphere
missionAn operation to provide scientific measurements with space-based and/or ground-based measurement systems (i.e., platforms, satellites, field experiments, and aerial measurements, etc.). For a multi-mission product, list all missions separated by a semi-colon (;). The primary mission should be listed first. Each mission should have 1 or more instruments associated with it. If there are multiple missions and multiple instruments, then the relationships between the missions and instruments should be defined.80Aqua
instrumentConsisting of a collection of one or multiple sensor instruments to provide scientific measurements. For a multi-instrument product from one mission, all instruments are listed and separated by a comma (,). If the product (e.g., a combined product) involves multiple missions and multiple instruments, the instruments from each mission are separated by a semi-colon (;).The order of instruments should be in the same sequence as the mission field. If not applicable, enter: “N/A”. (NOTE: the number of missions entered must pair evenly to number of instruments delimited by “;” i.e., if two missions entered: “mission1;mission2” then at least two instruments: “instrument1;instrument2” or “N/A;N/A” or “instrument1a,instrument1b”; “instrument2a,instrument2b” etc.)80AIRS
processingCenterData center where this product was generated.80GESDISC
archiveCenterData center where the data product is archived. This value is usually ‘GESDISC’.50GESDISC
eosFlagFlag to indicate whether the data product is an EOS (NASA EOS 2021) or Non-EOS product. Values: E for EOS and N for Non-EOS.1E
productFlagFlag denotes the type of product. Values: 1 = Data Product, 2 = Instrument Ancillary, 3 = System/Spacecraft and 4 = External. For a non-ECS product, use the value 1.11
publishFlagFlag to indicate whether the product and its associated granules be published to EMS or not. This value is usually ‘Y’.1Y
searchTermFile name, directory, path, ESDT, Data Provider internal product IDs or other information that uniquely identifies a data product as it appears in an EMS Data file. The searchTerm should not include URL query strings and associated name value pairs. searchTerms can include full strings or substrings. Values within this field are always treated as regular expressions (e.g., ‘.+MOD1[1-9].+’). Therefore, reserved grep/egrep characters should only be used when they are needed. By default, we will use the product Shortname. Let OPS staff know if any specific pattern needs to be added to a product.200AIRIBRAD
dataSourceAssigns the data source (e.g., the system, subsystem, file, table or other identifying information) where the logs/flat files/metadata are generated (e.g., airs, aura, disc, reason, urs). Currently, GES-DISC has identified five data source groups. Each group and its associated hosts are listed as follows:
  • ‘airs’ – airscal1u, airscal2u, airspar1u, airspro2u, airspro3u, airspro5u, airsraw1u, airsraw2u, airsraw3u, rep2u, rep1

  • ‘aura’ – acdisc, aurapar1u, aurapar2u, auraraw1u, goldsfs1u, goldsmr1, goldsmr2, goldsmr3, rep5u

  • ‘reason’ – reason, neespi, atrain, agdisc, hydro1

  • ‘disc’ – disc1, disc2, disc3, tads1u, gdata1, gdata2, rep3, rep4

  • ‘urs’ – discnrt1, discnrt2

50airs
Language: English
Submitted on: Aug 27, 2021
|
Accepted on: Jan 24, 2022
|
Published on: Feb 11, 2022
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2022 Zhong Liu, Chung-Lin Shie, Anthony J. Ritrivi, Guang-Dih Lei, Gary T. Alcott, Mary Greene, James Acker, Jennifer C. Wei, David J. Meyer, Angela Li, Atheer F. Al-Jazrawi, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.