Have a personal or library account? Click to login
Earth Science Data Analytics: Definitions, Techniques and Skills Cover

Earth Science Data Analytics: Definitions, Techniques and Skills

Open Access
|Feb 2017

Figures & Tables

Table 1

Skills of a Data Scientist (Udacity).

Skills of a Data Scientist (Udacity)
Basic ToolsData Munging
Basic StatisticsData Visualization & Communication
Machine LearningSoftware Engineering
Multivariable Calculus and Linear AlgebraThinking Like a Data Scientist
Table 2

Technical skills and tools of a Data Scientist (Master’s in Data Science).

Math (e.g. linear algebra, calculus and probability)
Statistics (e.g. hypothesis testing and summary statistics)
Machine learning tools and techniques (e.g. k-nearest neighbors, random forests, ensemble methods, etc.)
Software engineering skills (e.g. distributed computing, algorithms and data structures)
Data mining
Data cleaning and munging
Data visualization (e.g. ggplot and d3.js) and reporting techniques
Unstructured data techniques
R and/or SAS languages
SQL databases and database querying languages
Python (most common), C/C++ Java, Perl
Big data platforms like Hadoop, Hive & Pig
Cloud tools like Amazon S3
Table 3

Sampling of science research techniques being used.

Science Research Technologies (Sampling)
In Atmospheric ResearchIn Hydrology Research
Correlation Analysis; Bias CorrelationSpectral AnalysisLinear Regression
Regression Analysis; Bivariant RegressionTemporal Trending; Trend AnalysisMonte Carlo
Decision TreeSpatial InterpolationDarcy Equation
Machine LearningRevised Averaging SchemePoisson Regression
Data MiningForward Modeling; Inverse ModelingMulti-variate time series analysis
Data FusionRadiative Transfer ModelBUDYKO formula
Computational ToolsBaysian Synthesis InversionSmoothing (Gaussian)
Constrained Variational AnalysisTemporal StabilityFiltering (Destriping)
Model SimulationsGaussian DistributionMESH Model
RatiosExponential Differentiation
Time Series Analysis
Table 4

Earth Science data analytics goals.

To calibrate data
To validate data (note it does not have to be via data intercomparison)
To assess data quality
To perform coarse data preparation (e.g. subsetting data, mining data, transforming data, recovering data)
To intercompare datasets (i.e. any data intercomparison; Could be used to better define validation/quality)
To tease out information from data
To glean knowledge from data and information
To forecast/predict/model phenomena (i.e. Special kind of conclusion)
To derive conclusions (i.e. that do not easily fall into another type)
To derive new analytics tools
Table 5

Earth science data analytics techniques (sampling).

Data PreparationData ReductionData Analysis
Bias CorrectionAggregationAnomaly Detection
Coordinate TransformationAnomaly DetectionBayesian Techniques
Data EngineeringCluster AnalysisBivariant Regression
Data MiningData EngineeringClassification
Data MungingData FusionCorrelation/Regression Analysis
Database ManagementFactor AnalysisFactor Analysis
Exponential DifferentiationFilteringFourier Analysis
FilteringNeural NetworksGaussian Distribution
Format ConversionOutlier RemovalGraphics Analysis
ImputationRatiosImputation
Normalization/TransformationRevised Averaging SchemeLinear/Non-linear Regression
Outlier RemovalRule LearningMachine Learning/Decision Tree
RatiosTime SeriesMathematics/Calculus
Rule LearningVisualizationModeling
Sensitivity AnalysisMonte Carlo Method
SmoothingMulti-variate Time Series
Spatial InterpolationNormalization
Time SeriesPattern Recognition
VisualizationPrincipal Component Analysis
Revised Averaging Scheme
Rule Learning
Signal Processing
Spectral Analysis
Statistics
Temporal Trend Analysis
Time Series
Visualization
Table 6

Earth science data analytics skills (sampling).

Ability to integrate data across multiple domains
Support domain scientists with data & computational knowledge
Communicate across domains
Knowledge of data cycle
Software engineering
Software programming
Data Engineering
Decision science
Language: English
Submitted on: Oct 27, 2016
Accepted on: Jan 16, 2017
Published on: Feb 24, 2017
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2017 Steve Kempler, Tiffany Mathews, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.