Skip to main content
Have a personal or library account? Click to login
Probabilistic assessment of annual maximum precipitation in Almaty, Kazakhstan Cover

Probabilistic assessment of annual maximum precipitation in Almaty, Kazakhstan

Open Access
|Mar 2026

Full Article

Introduction

Extreme hydrometeorological events have become a growing global concern due to their direct socio-economic and environmental impacts. Numerous studies have shown that intense rainfall events and flash floods are increasing in both frequency and severity due to climate change. These phenomena not only damage infrastructure and property but also pose significant risks to human safety, particularly in rapidly urbanizing areas with insufficient drainage systems (D’iya et al., 2014).

Globally, urban flooding has become one of the most critical consequences of climate-induced hydrological extremes. For example, severe rainfall events in Europe, such as the 2021 floods in Germany, Belgium, and Poland, resulted in extensive infrastructural damage and loss of life (Kreienkamp et al., 2021; Pietras & Pyrc, 2025). Similar patterns are observed in South Asia, where countries like India, Pakistan, and Bangladesh experience increasingly frequent monsoon-driven floods (Latif et al., 2017; Muhammad Iskandar et al., 2025). In China and Japan, rapid urbanization combined with changing rainfall regimes has intensified flood hazards, prompting the development of advanced stormwater management systems and statistical models for rainfall prediction (Jayawardane et al., 2024; Ling et al., 2025). These international examples highlight that the challenge of managing extreme precipitation and urban flooding is not unique to a single region but represents a widespread, global issue requiring localized yet methodologically comparable approaches.

In Kazakhstan, Almaty city has recently witnessed a marked increase in the frequency and intensity of extreme precipitation events. Several episodes of intense rainfall and sudden surface runoff have caused localized flooding, especially in low-lying districts such as Almalinskiy and Bostandyk. The city’s mountainous surroundings, coupled with rapid urban expansion and the loss of permeable green spaces, accelerate surface runoff and exacerbate the overload of the existing drainage infrastructure (BES Media, 2025). Furthermore, the absence of a comprehensive stormwater management system across many parts of the city leaves Almaty particularly vulnerable to the adverse effects of hydrological extremes.

These challenges are not unique to Almaty city. Similar situations have been observed in other urban centers of Kazakhstan, such as Astana city, where episodes of heavy rainfall and rapid snowmelt have overwhelmed drainage systems, resulting in urban flooding (Nakispekova, 2024). Almaty city has typical characteristics of the Central Asia urbanization processes and, in the authors’ opinion, can explain similar processes in this region, where rapidly developing cities are dominant. Almaty city faces the dual challenge of adapting to intensifying climate extremes while managing the consequences of rapid urbanization. Traditional stormwater infrastructure and planning approaches, based on outdated climatic assumptions, are increasingly inadequate for modern conditions (Dong et al., 2020). This underscores the urgent need for more adaptive and data-driven methods in urban flood risk assessment and management.

International experience demonstrates that statistical modeling of extreme rainfall using probability distribution functions has become a widely accepted approach to understanding and predicting hydrological extremes. The generalized extreme value (GEV) distribution remains the most commonly applied model in Europe, China, and Australia for estimating design rainfall and return periods of floods due to its flexibility in capturing both short-term and long-term extremes (Coles, 2001; Katz et al., 2002; Gentilucci et al., 2023).

Recent studies have shown that lognormal and gamma distributions continue to be preferred in regions where rainfall intensity exhibits moderate variability and observational records are limited, owing to their computational simplicity and stable fitting performance (Stedinger et al., 1993; Cho et al., 2004; Montes--Pajuelo et al., 2024). However, each model presents specific limitations: for instance, the GEV may overestimate rare events when sample sizes are small, while lognormal and gamma distributions tend to underestimate the upper tail of extreme rainfall (Rima et al., 2025).

Therefore, the combined application and intercomparison of multiple statistical models provide a more robust and reliable basis for assessing precipitation extremes under non-stationary climate conditions, as emphasized in recent hydrological analyses (Gentilucci et al., 2023; Rima et al., 2025).

Between 2015 and 2025, Almaty city experienced several instances of localized flooding and mudflows, primarily triggered by heavy rainfall, rapid snowmelt, and natural runoff from the surrounding mountainous terrain. Due to its geographic location at the foothills of the Trans-Ili Alatau (a subrange of the Tian Shan mountain belt), Almaty is particularly vulnerable to sudden water surges from upstream rivers and glacial sources. Each spring and summer, rising temperatures accelerate glacial melt, leading to swollen rivers and, occasionally, the formation and bursting of moraine--dammed glacial lakes (Kyrgyzbay et al., 2023; Choudhary & Kumar, 2025).

Although none of these incidents matched the scale of the 2015 glacial lake outburst flood, they collectively posed a significant challenge to the urban infrastructure. The city’s peripheral districts – such as Alatau, Turksib, and Bostandyk – were the most frequently affected, often facing obstructed roads, flooded basements, and temporary evacuations (Duskayev et al., 2023; Mirlas et al., 2024). Rapid urban expansion has increased vulnerability: changes to the hydrographic network due to infrastructural development, land use patterns, and inadequate drainage have amplified local flood risks. Human activity, particularly the construction of new neighborhoods and modification of watercourses, has further complicated natural water flow, leading to more severe and recurrent urban flood events (Duskayev et al., 2023).

Despite ongoing efforts by the city and national authorities to improve early warning systems and build protective infrastructures such as mudflow barriers and improved river channels, these events highlight the central role of sustainable urban planning and adaptation to climate change. The continued increase in average temperatures and more frequent episodes of intense precipitation intensify the risk of flash floods and mudflows in Almaty’s metropolitan area (Duskayev et al., 2023; Kyrgyzbay et al., 2023; Choudhary & Kumar, 2025).

This study provides a preliminary analysis of rainfall data collected from meteorological stations in Almaty city prior to their application in urban flood modeling.

The aim of this study is to analyze trends and spatial variability of extreme precipitation and runoff potential in Almaty using ground-based meteorological data (2000–2023) through statistical analysis.

Material and methods
Study area

Almaty city is located at the foothills of the Zailiyskiy Alatau Mountains (also known as the Trans-Ili Alatau), in southeastern Kazakhstan. It serves as the administrative center of Almaty city and lies at elevations ranging from approx. 700 m to 900 m a.s.l. The city includes both lowland urbanized areas and mountainous catchments, which form the headwaters of several small rivers, such as Ulken Almaty, Kishi Almaty, Esentai, Aksay, and Kargaly (Kyrgyzbay et al., 2023).

Over the past century, Almaty has undergone rapid urban expansion, with the city’s built-up area increasing from approx. 3 km2 in the 19th century to over 700 km2 in recent decades. This urban growth has resulted in significant transformations of the natural hydrographic network, including the channelization of streams, construction of stormwater infrastructure, and creation of artificial drainage pathways (Choudhary & Kumar, 2025). These changes have substantially altered the hydrological behavior of the area, increasing the likelihood of urban flooding. The area of Almaty city is now approx. 682 km2.

The city is highly susceptible to natural hazards, including heavy rainfall, flash floods, and debris flows. These risks are exacerbated by the combination of steep mountainous topography, convective precipitation events, and reduced infiltration capacity in urbanized zones (Mirlas et al., 2024). Historical flood events, including the catastrophic mudflows of 1921 and the urban floods in 2015, have caused significant damage to infrastructure and property.

Furthermore, recent groundwater modeling studies in the northern parts of the city, particularly in the Akbulak Micro District, have identified recurring groundwater flooding events caused by poor drainage and excessive surface runoff. These issues are compounded by the obstruction of natural subsurface flows and the degradation of the historical Karasu stream system (Mirlas et al., 2024).

The combination of climate variability, increasing rainfall intensity, land-use change, and hydrological modification has created a complex and dynamic urban flood regime. Therefore, Almaty represents a critical case study for assessing the impacts of climate-induced extreme rainfall and evaluating probabilistic models for sustainable flood risk management.

Data

The rainfall data used in this study were obtained from the official website of Kazhydromet (Republic State Enterprise Kazhydromet, n.d.) covering a continuous 23–24-year period from 2000 to 2023 (from 2001 to 2023). The dataset used in the analysis in that paper consists of the total annual precipitation sums recorded at five meteorological stations within Almaty city: Almaty, Shymbulak, Mynzhylyk, Bolshoe Almatinskoe Ozero (BAL), and Kemen. The names and geographical coordinates of these stations, together with key descriptive statistics, are presented in Table 1. The 24-year data series (2000–2023) can be considered relatively short for long-term climatological analysis, when the normal analytical period for hydrometeorological characteristics should not be shorter than 30 years (World Meteorological Organization [WMO], 1989). However, this period was selected based on the availability of continuous, high-quality digital data provided by the Republican State Enterprise “Kazhydromet” Kazakhstan’s official hydrometeorological service. Earlier data (prior to 2000) are mainly stored in paper archives and contain numerous gaps and inconsistencies, which reduce their homogeneity and reliability. Therefore, the 2000–2023 dataset obtained from the official Kazhydromet portal was considered optimal for ensuring completeness, calibration comparability, and statistical reliability.

TABLE 1.

Geographic characteristics of meteorological stations and descriptive statistics of annual precipitation sum data series observed in the period of 2000–2023

Geographical coordinateMeteorological station

AlmatyShymbulakMynzhylkyBALKemen
Latitude (N)43°24′43°12′43°08′43°05′43°18′
Longitude (E)76°93′77°08′77°07′76°98′76°96′

Descriptive statistics of annual precipitation sum

Minimum [mm]489687668636615
Maximum [mm]1,0101,4801,2401,3201,360
Mean (M) [mm]678968875866904
Standard deviation (SD) [mm]135196154165179
Coefficient of variation (CV)0.1980.2020.1760.1900.198
Skewness (γ1)0.9951.340.7391.120.632
Kurtosis coefficient (γ2)3.244.192.873.602.88
Data length [year]2000–2023 (24 years)2001–2023 (23 years)2000–2023 (24 years)2000–2023 (24 years)2000–2023 (24 years)

Source: own work.

To evaluate the variability of precipitation, several statistical indicators were calculated. The standard deviation (SD), which represents the dispersion of precipitation around the mean, ranged from 135 mm at Almaty station to 196 mm at Shymbulak, indicating moderate interannual variability. The coefficient of variation (CV), computed as the ratio of SD to the mean (M), varied between 0.176 and 0.202, corresponding to moderate variability (10–20%), which suggests relative stability of precipitation across the studied period.

To further describe the statistical characteristics of rainfall, skewness and kurtosis were analyzed. Positive skewness values ranging from 0.63 to 1.34 indicate a right-skewed distribution, meaning that most rainfall events are moderate, but rare extreme events occasionally occur. Kurtosis coefficients (γ2) between 2.87 and 4.19 show that certain stations (e.g., Shymbulak) exhibit more pronounced peaks and heavier tails compared to a normal distribution (kurtosis = 3), which is typical for regions exposed to intense orographic precipitation. All descriptive parameters were calculated using the formulas presented in Wilks (2011).

To verify the reliability and temporal consistency of the precipitation dataset, three non-parametric statistical tests were conducted for all five meteorological stations (Table 2). The Wald–Wolfowitz runs test (U) was used to evaluate the independence of the data series, ensuring that precipitation values are not serially correlated. The Kruskal–Wallis test (K) was applied to assess stationarity, confirming that there are no systematic changes or trends within the time series. The Pettitt test (W) was employed to check homogeneity, detecting possible abrupt shifts or inhomogeneities in the data over time.

TABLE 2.

Statistical characteristics of annual precipitation sum data series observed in the period of 2000–2023 by stations

Statistical test valueMeteorological station

AlmatyShymbulakMynzhylkyBALKemen
Independence|U| = 1.5|U| = 0.83|U| = 0.38|U| = 0.24|U| = 1.23
Stationarity|K| = 1.76|K| = 1.32|K| = 0.74|K| = 0.86|K| = 0.82
Homogeneity|W| = 1.01|W| = 0.22|W| = 1.01|W| = 0.72|W| = 0.43

Source: own work.

The calculated test statistics fall within the following ranges:

  • independence (U): from 1.30 to 1.83,

  • stationarity (K): from 0.74 to 1.76,

  • homogeneity (W): from 0.22 to 1.01.

All values are below the critical threshold of ±1.96 at a 95% confidence level, indicating that none of the null hypotheses were rejected. Therefore, the precipitation series for all stations can be considered independent, stationary, and homogeneous, providing a reliable basis for further hydrological and probabilistic modeling.

The magnitude of the test statistics varies among stations, reflecting spatial differences in precipitation behavior and local influences. The Almaty station exhibits higher |U| and |K| values (1.5 and 1.76), indicating a slight upward trend likely associated with urbanization and impervious surfaces that affect local convection and moisture dynamics. In contrast, the mountain stations Shymbulak, Mynzhylky, BAL, and Kemen exhibit lower values of |U| and |K| (< 1), confirming stable and homogeneous precipitation patterns typical of natural high-altitude environments with minimal anthropogenic influence. The homogeneity test (|W| < 1.1) for all stations confirms the absence of structural breaks in the data. Overall, the precipitation records are statistically consistent, although the elevated values at Almaty suggest emerging urban–climate interactions influencing rainfall variability.

The topographic complexity of the research area requires the inclusion of meteorological data collected across the entire area. As shown in Figure 1, the network of meteorological stations covers both the urbanized areas and the surrounding mountainous part of Almaty city. Located in the southeastern part of Kazakhstan, within the Almaty Region, the city lies at the foothills of the Zailiyskiy Alatau mountains. This spatial distribution enables a more accurate representation of precipitation variability and supports the identification of potential flash-flood risk zones in Almaty city.

FIGURE 1.

Location map of the meteorological stations in Almaty city, with a general map of Kazakhstan

Source: own work.

As illustrated in Figure 2, the spatial distribution of annual precipitation across Almaty city demonstrates a clear north-south gradient. Precipitation amounts increase significantly from the northern plains toward the foothills and mountainous zones of the Zailiyskiy Alatau. The highest annual totals, exceeding 900 mm, are observed at high-altitude stations such as Shymbulak and Mynzhylyk, while the lowest values, below 700 mm, occur in the northern and central lowland areas. This orographic influence results in substantial spatial variability of rainfall, which plays a crucial role in shaping surface runoff and the occurrence of flash floods in the city.

FIGURE 2.

Map of annual precipitation distribution in Almaty city

Source: own work.

Given the high spatial and seasonal variability of climatic parameters, it is crucial to determine the probabilistic distributions of atmospheric precipitation. This enables:

  • Assessment of the risk of extreme weather events (e.g., rare but high-impact rainfall).

  • Hydrological modeling of stormwater runoff based on the statistical characteristics of precipitation.

  • Development of adaptation scenarios and sustainable stormwater management measures under climate variability.

The application of probabilistic approaches, such as the five tested probability distributions, allows for a shift from descriptive analysis to quantitative risk assessment of pluvial flooding.

In recent decades, Almaty city has regularly faced challenges related to stormwater flooding, caused by a combination of intense precipitation events, rapid urbanization, and insufficient capacity of the existing drainage infrastructure. This problem is particularly acute during spring and summer, when short-duration but high-intensity rainfall generates substantial surface runoff that exceeds the capacity of current stormwater systems.

According to the Almaty Department for Emergency Situations, more than 50 incidents of localized flooding affecting streets, residential basements, and public infrastructure were recorded between 2010 and 2022. One of the most significant events occurred in July 2015, when over 50 mm of rainfall fell within a few hours, flooding central areas of the city. According to municipal authorities, total material damage exceeded 1.2 billion tenge, including damage to vehicles, underground utilities, and commercial facilities. One can see one of the flooding outcomes in the following link (BES Media, 2025).

Methodology

This section presents the methodology used to identify the most suitable probability distribution function for estimating design values of annual maximum precipitation. Such a model can be applied by engineers and water resource specialists for a priori assessments in frequency analysis.

During the first stage, five different probability distribution functions (PDFs) were tested on the series of annual maximum precipitation values to determine which best fit the data. These were selected due to their widespread use in contemporary hydrometeorological studies (Kundzewicz et al., 2013; Dong et al., 2020). The annual maximum precipitation series, obtained from various meteorological stations, were fitted using the HYFRAN software package (Duskayev et al., 2023), a practical tool for evaluating probability distributions. The list of applied distribution functions found in the software is presented in Table 3.

TABLE 3.

Probability density functions, equations, and descriptions

DistributionProbability formulaTypical useAdvantagesLimitations
Gumbel fx1βexpxμβexpxμβ \[ f\left( x \right)\frac{1} {\beta }\exp \left[ { - \frac{{x - \mu }} {\beta } - \exp \left( { - \frac{{x - \mu }} {\beta }} \right)} \right] \] Maxima (annual peak precipitation)Simple, widely used in hydrologyMay underestimate tail behavior (underpredict extremes)
Generalized extreme value (GEV) fx=1σ1+ξxμσ1ξ1exp1+ξxμσ1ξ \[ f\left( x \right) = \frac{1} {\sigma }\left( {1 + \xi \frac{{x - \mu }} {\sigma }} \right)^{ - \frac{1} {\xi } - 1} \exp \left[ { - \left( {1 + \xi \left( {\frac{{x - \mu }} {\sigma }} \right)^{ - \frac{1} {\xi }} } \right.} \right] \] Flexible for maxima with different tail behaviorAccounts for skewness and heavy tailsMore complex parameter estimation
Log--Pearson Type III logX~PearsonTypeIII \[ \log \left( X \right)\~\textit{Pearson}\,\textit{Type}\,III \] Flood and rainfall extremesRecommended by the U.S. Water Resources CouncilSensitive to sample size and outliers
Pearson Type III fx=βαxx0α1eβxx0Γα \[ f\left( x \right) = \frac{{\beta ^\alpha \left( {x - x_0 } \right)^{\alpha - 1} e^{ - \beta \left( {x - x_0 } \right)} }} {{\Gamma \left( \alpha \right)}} \ , where x > x0General hydrologic useGood fit for varied data with skewnessAssumes continuous positive data
Lognormal fx=1xσ2πexplnxμ22σ2,x>0 $f\left( x \right) = \frac{1} {{x\sigma \sqrt {2\pi } }}\exp \left( { - \frac{{\left( {\ln x - \mu } \right)^2 }} {{2\sigma ^2 }}} \right),x > 0$] Precipitation intensitiesSuitable for moderately skewed dataSensitive to zero and near-zero values
Normal fx=12πσ2expxμ22σ2 $f\left( x \right) = \frac{1} {{\sqrt {2\pi \sigma ^2 } }}\exp \left( { - \frac{{\left( {x - \mu } \right)^2 }} {{2\sigma ^2 }}} \right)$ Not recommended for extremesEasy to interpretPoor fit for skewed and extreme event data

Source: own work.

To fit the parameters of each PDF, maximum likelihood estimation (MLE) and method of moments (MoM) approaches were used. These methods enabled the estimation of distribution parameters based on a series of annual maximum precipitation values.

In addition, it was necessary to determine the empirical exceedance probability of the observed annual maximum precipitation values. For this purpose, the following formula was used:

F(xk)=kαn2α+1,0α0.5, \[ F\left( {x_k } \right) = \frac{{k - \alpha }} {{n - 2\alpha + 1}},0 \leqslant \alpha \leqslant 0.5, \]

where: F(xk) is empirical probability corresponding to the value xk, k is rank of the value in the ascending ordered series, n is total number of observations, α is offset parameter depending on the selected formula (cf. Table 4).

TABLE 4.

Values of parameter α for estimating the empirical exceedance probability

αCommentReference
0.4Recommended for hydrometeorological data; provides a balanced estimation for both small and large sample sizesCunnane (1978)
0.5Universal, but performs poorly at the tails (rare events). Frequently used in engineering calculations, but may overestimate probabilities at the distribution tailsHazen (1914)
0Simple and widely used, but may introduce biasWeibull (1939)
0.3Based on Soviet tradition, but often inadequate in mountainous regionsChegodayev (1955)
0.375Used in statistics; closer to the normal distributionBlom (1958)
≈0.333Sometimes applied in scientific publicationsTukey (1962)

Source: own work.

To calculate the empirical exceedance probability of annual maximum precipitation in Almaty city, the Cunnane formula (Cunnane, 1978) was selected. The choice of this formula is justified by the region’s topographic and climatic characteristics. Almaty is situated in the foothills of the Zailiyskiy Alatau, where orographic influences result in high spatial and temporal variability in precipitation. This is particularly evident in the case of intense convective rainfall events occurring during the warm season, often associated with moisture-laden mountain air and strong atmospheric instability.

Under such conditions, the Cunnane formula provides the most balanced probability estimates, especially when dealing with a limited number of observations (n ≤ 25 years), which is typical for meteorological stations in the city.

The Cunnane formula minimizes systematic errors at the distribution tails, offering a more accurate representation of both frequent and rare (extreme) events. This makes it particularly useful for comparing empirical data with theoretical distributions, such as the Gumbel, GEV, or log-Pearson Type III distributions, which are widely used in hydrological and climate-related studies (Kundzewicz et al., 2013; Dong et al., 2020). Moreover, the World Meteorological Organization recommends the use of the Cunnane formula for analyzing extreme hydrometeorological data in mountainous regions (WMO, 2009, 2017).

Thus, the use of the Cunnane formula is both scientifically and methodologically justified for analyzing extreme precipitation in the mountainous conditions of Almaty city. It provides a reliable foundation for estimating return periods and assessing the risks of pluvial flooding.

Results and discussion

In this study, the 24-year annual precipitation records (2000–2023) from five meteorological stations near Almaty city were analyzed. For hydrological planning and design purposes, the best available long-term precipitation data were used. The annual maximum precipitation values (in mm) were statistically processed to estimate design rainfall intensities for return periods of 10 (10%), 50 (2%), and 100 (1%) years using several PDFs, namely the exponential, GEV, normal, lognormal, and gamma distributions.

The goodness of fit of each distribution was evaluated using multiple criteria: the chi-square (χ2) test, Akaike information criterion (AIC), Bayesian information criterion (BIC), and posterior model probability P(Mi|x). Table 5 presents the comparison of empirical and theoretical precipitation quantiles (in mm) for each station.

Figures 3 and 4 show the results of fitting the GEV and lognormal distributions to the observed values of annual precipitation maxima at the Almaty weather station. The red line shows the theoretical model, the black crosses show the empirical data, and the blue lines show the 95% confidence interval. These two graphs are provided as illustrative examples.

TABLE 5.

Design rainfall intensities for return periods of 1%, 2% and 10% calculated by selected empirical probability functions applied to precipitation annual sum time series in meteorological stations of Almaty city

Meteorological stationDesign rainfall for return periods calculated by selected probability distribution functions [mm]
exp.GEVnormallognormalgamma
10%2%1%10%2%1%10%2%1%10%2%1%10%2%1%
Almaty9361,2501,3908511,0201,09085195599184998310308459641,010
Shymbulak1,3501,8202,0301,2101,4801,5901,2201,3701,4201,2101,4001,4701,2101,3801,440
Mynzhylky1,1601,5001,6501,0701,2601,3301,0701,1901,2301,0701,2301,2801,0701,2001,250
BAL1,1801,5701,7301,0801,3101,4101,0801,2001,2501,0701,2301,2901,0701,2101,270
Kemen1,3001,7801,9901,1401,3301,4001,1301,2701,3201,1401,3301,4001,1301,2901,350

exp. – exponential distribution, GEV – generalized extreme value distribution, normal – normal distribution, lognormal – lognormal distribution, gamma – gamma distribution.

Source: own work.

FIGURE 3.

Annual maximum precipitation distribution fitted with the generalized extreme value distribution using the maximum likelihood estimation method for the Almaty meteorological station

Source: own work.

FIGURE 4.

Annual maximum precipitation distribution fitted with the lognormal distribution using the maximum likelihood estimation method for the Almaty meteorological station

Source: own work.

The analysis of AIC, BIC, and P(Mi|x) for the Almaty meteorological station, which serves as a representative location for an urban area in this study, shows that the lognormal distribution demonstrates superior performance, yielding the highest posterior probability P(Mi|x) = 39.94 along with the lowest AIC (303.2) and BIC (305.6) values (Table 6).

TABLE 6.

Comparison of the analysis for the meteorological station based on AIC, BIC, P(Mi|x), χ2 test, and p-value criteria

Meteorological stationParameterProbability distribution type
exp.GEVnormallognormalgamma
AlmatyAIC305.7304.2306.4303.2304.0
BIC308.1307.7308.7305.6306.3
P(Mi|x)11.213.48.239.927.1
χ29.22.87.55.15.1
p0.050.410.110.270.27
ShymbulakAIC311.4307.2311.0306.9309.2
BIC313.7310.6313.3309.2310.2
P(Mi|x)4.521.15.543.425.3
χ27.12.84.71.63.4
p0.130.410.320.800.48
MynzhylkyAIC310.0312.2312.8310.8311.2
BIC312.4315.7315.2313.1313.6
P(Mi|x)37.47.19.225.820.4
χ28.64.02.84.54.5
p0.070.260.580.330.33
BALAIC315.1312.8316.0312.5313.4
BIC317.4316.3318.4314.9315.8
P(Mi|x)10.918.86.738.924.5
χ212.75.71.65.15.1
p0.010.120.790.270.27
KemenAIC326.0320.3320.2318.5318.5
BIC328.3323.9322.5320.8320.8
P(Mi|x)0.98.616.939.334.1
χ211.06.34.008.06.3
p0.020.090.400.080.17

exp. – exponential distribution, GEV – general extreme value distribution, normal – normal distribution, lognormal – lognormal distribution, gamma – gamma distribution.

Source: own work.

Although the GEV showed the best agreement with empirical data according to the χ2 test, the lognormal distribution proved more favorable when accounting for model simplicity and goodness-of-fit through information-theoretic metrics. The gamma distribution also performed relatively well, with a posterior probability of 27.1 and competitive AIC (304.0) and BIC (306.3) values (Table 6).

The variation in the best-fit distribution across different stations and criteria underscores the complexity of hydrological modeling in a region with complex climatic conditions, such as those in Almaty city. The discrepancy between the χ2 test (favoring the GEV) and information criteria/Bayesian probabilities (favoring lognormal/gamma) highlights the importance of using multiple goodness-of-fit measures for robust model selection. The GEV, while showing good agreement with empirical data via the χ2 test, might be penalized by AIC and BIC if its additional parameter does not significantly improve the fit relative to its complexity.

Estimating design rainfall with different return periods (e.g., 1%, 2%, 5%, and 10%) is crucial for infrastructure planning and for mitigating risks from extreme precipitation events, such as flash floods and urban inundations, which are intensified by ongoing urbanization and climate variability in Almaty. The selection of the most appropriate PDF directly impacts the accuracy of these design values.

Similar approaches have been widely applied in other countries, where various probability distributions have been tested to model extreme rainfall events. For instance, the GEV is the most commonly used model in Europe, China, and Australia for estimating design rainfall and flood return periods due to its flexibility in capturing both short-term and long-term extremes (Coles, 2001; Burnham & Anderson, 2002; Katz et al., 2002; Gentilucci et al., 2023). In contrast, lognormal and gamma distributions are frequently applied in regions with moderate rainfall variability and limited data availability, such as parts of South Asia and Africa, owing to their computational simplicity and stable fitting performance (Stedinger et al., 1993; Cho et al., 2004; Montes-Pajuelo et al., 2024). However, these models differ in their ability to represent the upper tail of extreme precipitation: the GEV tends to perform better for rare events, whereas the lognormal and gamma distributions may underestimate extremes. Therefore, comparing multiple statistical distributions, as presented in this study, provides a more robust and comprehensive understanding of precipitation extremes under non-stationary climate conditions.

Conclusions

This study presents a comprehensive probabilistic assessment of extreme precipitation in the city of Almaty, based on 24 years of ground-based meteorological observations (2000–2023) from five stations: Almaty, Shymbulak, Mynzhylky, BAL, and Kemen. The analysis used annual maximum precipitation values and applied five theoretical probability distribution functions – exp., GEV, normal, lognormal, and gamma – to model extreme rainfall behavior.

All rainfall series were confirmed to be independent, stationary, and homogeneous, as verified by statistical tests (Wald–Wolfowitz, Kruskal–Wallis, and Pettitt). The results of the MLE showed that the GEV and lognormal distributions provided the best fit across most stations.

Quantitatively, the maximum annual precipitation during the study period ranged from 615 mm (Kemen) to 1,010 mm (Shymbulak), while the mean annual precipitation varied between 678 mm (Almaty) and 866 mm (BAL). The CV ranged from 0.17 to 0.20, indicating moderate interannual variability.

Based on the best-fitting probability models, the estimated rainfall quantiles showed a clear increase with longer return periods. On average across all stations, the 10-year event (10% exceedance probability) corresponded to rainfall amounts of approximately 950–1,050 mm, the 50-year event (2% exceedance) to around 1,100–1,200 mm, and the 100-year event (1% exceedance) to 1,200–1,300 mm. These values represent the expected magnitudes of extreme precipitation for the respective return periods and can serve as reference thresholds for hydrological design and risk assessment in Almaty city. The lognormal distribution demonstrated the lowest AIC and BIC values and the highest posterior probability P(Mi|x) > 0.35 for the Almaty and Shymbulak meteorological stations, while the GEV was slightly better for BAL and Kemen meteorological stations. The gamma model also performed reasonably well (AIC differences < 5), suggesting its potential suitability for areas with complex mountain–urban topography.

These quantitative results confirm that extreme rainfall events have intensified in Almaty city, especially during the warm season (May–September). The observed upward trend in the magnitude of extreme precipitation highlights growing risks of urban flash floods and surface runoff overload.

From a practical standpoint, the derived rainfall quantiles can be directly applied to the design of stormwater drainage networks, flood risk assessments, and climate-resilient infrastructure planning in Almaty. For instance, a 100-year design rainfall of approx. 1,250 mm can serve as a baseline parameter for the dimensioning of retention basins and flood control structures.

DOI: https://doi.org/10.22630/srees.10847 | Journal eISSN: 2543-7496 | Journal ISSN: 1732-9353
Language: English
Page range: 20 - 39
Submitted on: Oct 3, 2025
Accepted on: Dec 30, 2025
Published on: Mar 31, 2026
In partnership with: Paradigm Publishing Services

© 2026 Yerlan Mukhanbet, Jarosław Chormański, Dana Tungatar, Mariusz Paweł Barszcz, Ainura Aldiyarova, published by Warsaw University of Life Sciences - SGGW Press
This work is licensed under the Creative Commons Attribution-NonCommercial 4.0 License.