Have a personal or library account? Click to login
Evaluating machine learning models for air quality error mapping in Kraków, Poland Cover

Evaluating machine learning models for air quality error mapping in Kraków, Poland

Open Access
|Jan 2026

Abstract

Accurate air quality prediction is essential for sustainable urban development. This study evaluates the performance of machine learning models, including DLinear and XGBoost, in comparison with the traditional Autoregressive Integrated Moving Average (ARIMA) method for predicting fine particulate matter (PM2.5) concentrations in Kraków, Poland. A dense network of low-cost sensors was used to generate high-resolution spatial and temporal data. Prediction errors were analysed using the Getis-Ord Gi* spatial statistics method during both extreme pollution events and low pollution days. The results indicate that DLinear achieved the lowest root mean square error (RMSE = 3.8 µg/m3), followed by XGBoost (RMSE = 6.7 µg/m3) and ARIMA (RMSE = 9.2 µg/m3). The spatial distribution of errors highlights the influence of environmental factors, such as humidity and proximity to water bodies, on model accuracy. These findings show the limitations of current prediction models and emphasize the need for spatially adaptive approaches to improve pollution.

DOI: https://doi.org/10.2478/mgrsd-2025-0026 | Journal eISSN: 2084-6118 | Journal ISSN: 0867-6046
Language: English
Submitted on: Nov 4, 2024
|
Accepted on: Apr 10, 2025
|
Published on: Jan 14, 2026
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2026 Mateusz Zaręba, Szymon Cogiel, Elżbieta Węglińska, Tomasz Danek, published by Faculty of Geography and Regional Studies, University of Warsaw
This work is licensed under the Creative Commons Attribution 4.0 License.

AHEAD OF PRINT