Have a personal or library account? Click to login
Robust Machine Learning Algorithmic Rules for Detecting Air Pollution in the Lower Parts of the Atmosphere Cover

Robust Machine Learning Algorithmic Rules for Detecting Air Pollution in the Lower Parts of the Atmosphere

Open Access
|Sep 2025

Figures & Tables

Table 1

Layers of our atmosphere.

LAYERDESCRIPTION & RELEVANCE TO HUMANITY
TroposphereClosest to our habitat–stretching up to 10 km above earth. Its temperature decreases inversely with distance from the centre of the earth (approx. 6.5C per kilometre) (Omrani et al., 2022).
StratosphereConsists the majority of atmospheric ozone, which absorbs ultraviolet radiation and protects us from potential health risks. It is characterised by high temperatures over summer and lowest over the winter period (Xu et al., 2023).
MesosphereThe temperature varies inversely with vertical height above ground (Laštovička, 2023).
Thermosphere/IonosphereAbsorption of energetic ultraviolet & X-ray radiation from the sun, thus temperature increases with vertical height. They also vary between night and day as well as between seasons. It reflects and absorbs radio waves, allowing global radio wave transmission (Goncharenko et al., 2021).
ExosphereContains mainly oxygen and hydrogen atoms, but they rarely collide - they follow “ballistic” trajectories under the influence of gravity (Janches et al., 2021).
MagnetosphereThe outer region surrounding the earth, where charged particles spiral along the magnetic field lines, with the earth behaving like a huge magnet (Lu et al., 2022).
Table 2

Selected key air pollutants.

DATA ITEMDESCRIPTIONDIMENSION & COMPLETENESS
PM10 (FSPMC)Fine Suspended Particulates (FSP)2952×27 samples: 3.17% missing
NO_2Nitrogen Dioxide2952×28 samples: 2.18% missing
NO_x=NO+NO_2Nitrogen Oxides–in Hong Kong2952×19 samples: 2.67% missing
O_3Ozone2952×114 samples: 4.39% missing
PM2.5 (RSPMC)Respirable Suspended Particulates (RSP)2952×27 samples: 7.9% missing
TimeDaily and hourly recordingFrom 00:00hrs to 23:00hrs of 1–31 January 2023, 1–30 April 2023, 1–31 July 2023 and 1–31 October 2023 (inclusive)
DayTimesDiscretised time periods of day
  • Night: 00:00-06:00hrs

  • Morning: 06:00-11:00hrs

  • Day: 12:00-18:00hrs

  • Evening 19:00-23:00hrs

PeriodMonthly weather periodsJanuary, April, July and October
dsj-24-1867-g1.png
Figure 1

Data sources in southern China including Hong Kong and Macau.

dsj-24-1867-g2.png
Figure 2

Hourly averaged concentrations for all sampled pollutants.

Table 3

Daily and monthly averages.

START HOUREND HOURHOURS RANGECATEGORIES OF DAY
00:00hrs 01st-Jan-202323:00hrs 31st-Jan-20231st744th hourNight, Morning, Day, Evening
00:00hrs 01st-Apr-202323:00hrs 30th-Apr-2023745th1464th hourNight, Morning, Day, Evening
00:00hrs 01st-Jul-202323:00hrs 31st-Jul-20231465st2208th hourNight, Morning, Day, Evening
00:00hrs 01st-Oct-202323:00hrs 31st-Oct-20232209th2952th hourNight, Morning, Day, Evening
Table 4

Daily and monthly average concentrations of each pollutant.

AVERAGE TIMEPM10NO2NOx=NO+NO2O3PM2.5
PERIODS\symbfμg/m3ppbppbppb\symbfμg/m3
Day20.0815.7738.0039.5135.52
Evening21.5115.9935.8530.2938.41
Morning19.9613.6134.7923.7034.68
Night19.6011.0522.3725.4735.35
January28.3615.3039.3826.0645.01
April19.5115.9333.4734.1741.77
July9.0511.6935.5629.4423.05
October20.1414.4929.7935.7933.84
dsj-24-1867-g3.png
Figure 3

Fine suspended particulates monthly and daily averages and variations.

dsj-24-1867-g4.png
Figure 4

Pollution levels across the year 2023.

dsj-24-1867-g5.png
Figure 5

Density distribution of pollutants across day time periods.

dsj-24-1867-g6.png
Figure 6

Pollution data points on a multidimensional scaling.

Table 5

Centroids of the selected clusters formed.

POLLUTANT2 CLUSTER CENTRES3 CLUSTER CENTRES4 CLUSTER CENTRES
Fine Suspended Particulates (PM10)
  • 12.11

  • 25.85

  • 11.79

  • 21.44

  • 25.55

  • 11.53

  • 20.38

  • 20.48

  • 28.54

Nitrogen Dioxide (NO_2)
  • 11.79

  • 16.70

  • 10.93

  • 20.61

  • 15.07

  • 10.88

  • 20.26

  • 17.99

  • 13.56

Ozones (O_3)
  • 22.12

  • 39.53

  • 22.49

  • 26.54

  • 41.50

  • 22.51

  • 23.37

  • 55.29

  • 30.69

Nitrogen Oxides (NO_x=NO+NO_2)
  • 29.77

  • 34.85

  • 25.51

  • 58.79

  • 28.43

  • 25.44

  • 60.33

  • 35.99

  • 25.34

Respirable Suspended Particulates (PM2.5)
  • 25.15

  • 48.14

  • 24.85

  • 37.94

  • 48.59

  • 24.42

  • 36.08

  • 40.39

  • 53.02

dsj-24-1867-g7.png
Figure 7

Actual and estimated PM10 (Left) and PM2.5 (Right).

dsj-24-1867-g8.png
Figure 8

Correlations among paired pollutant variables.

Table 6

Component loadings–contribution of each pollutant in each component.

LOADINGS
POLLUTANTCOMPONENT 1COMPONENT 2COMPONENT 3COMPONENT 4COMPONENT 5
PM100.3990.3340.8220.221
Nitrogen Dioxide (NO2)0.162–0.2000.203–0.944
Ozones (O3)0.5360.199–0.816
Nitrogen Oxide (NOx)0.258–0.9350.225
PM2.50.6790.2030.469–0.524
dsj-24-1867-g9.png
Figure 9

PM10 associations with daily and annual periods.

Table 7

Row points vs Principal Dimension 1.

HIGHLOWMEDIUM
Day4844373
Evening3523260
Morning36920349
Night35117370
Table 8

Columns vs Principal Dimension 1.

HIGHLOWMEDIUM
January5870157
April6180102
July584682
October29340411
Table 9

Row points vs PD 1 for Day Times.

DIMENSION 1DIMENSION 2
Day26.505963.576712
Evening23.529213.377988
Morning23.0454549.011043
Night26.9193844.034257
Table 10

Columns vs PD 1 for Annual Periods.

DIMENSION 1DIMENSION 2
April26.7783872.672733
January17.2405754.196079
July51.79524622.524998
October4.18579370.606190
dsj-24-1867-g10.png
Figure 10

PM10 and PM2.5 associations from Stations 1 and 8.

Language: English
Submitted on: Nov 29, 2024
|
Accepted on: Aug 28, 2025
|
Published on: Sep 24, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Kassim Mwitondi, Hugo Wai Leung Mak, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.