Speech Processing Using Dynamic Micro-Block Optimization Based on Deep Learning

Jiajun Hao; Chaoyang Geng

doi:10.2478/ijanmc-2025-0035

.blurhash-client-img { display: none !important; }

Speech Processing Using Dynamic Micro-Block Optimization Based on Deep Learning

International Journal of Advanced Network, Monitoring and Controls

Volume 10 (2025): Issue 4 (December 2025)

By: Jiajun Hao and Chaoyang Geng

Open Access

|Dec 2025

Figures & Tables

Overview of the E2E Speech Recognition Framework with DMBO

Gender-based distribution of datasets samples

Accent-based distribution of datasets samples

Train loss (left), LER (right) for gender-based strategies

Train loss (left), LER (right) for accent-based strategies

Train and test loss and ler of strategies based on accent

Model	Test LER
Standard	14.62%
Homogeneous gender	6.71%
Heterogeneous gender	13.13%
Homogeneous accent	13.58%
Heterogeneous accent	5.51%
Attention-LSTM[21]	9.77%
CNN-LSTM[22]	14.05%
BLSTM[23]	12.9%
BiLSTM-E[24]	8.07%

Loss, LER for accent-based strategies during both train and test

	Test Loss	Test LER	Train Loss	Train LER
standard	25.69	14.62%	12.72	11.14%
Homogeneous accent	22.26	13.58%	11.64	10.04%
Heterogeneous accent	12.08	5.51%	5.70	4.42%

Loss, LER for gender-based strategies during both train and test

	Test Loss	Test LER	Train Loss	Train LER
Standard	25.69	14.62%	12.72	11.14%
Homogeneous gender	13.53	6.71%	6.15	5.65%
Heterogeneous gender	24.15	13.12%	11.08	9.21%

References

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.2478/ijanmc-2025-0035 | Journal eISSN: 2470-8038

Journal RSS Feed

Language: English

Page range: 46 - 58

Published on: Dec 31, 2025

Published by: Xi’an Technological University

In partnership with: Paradigm Publishing Services

Publication frequency: 4 issues per year

Keywords:

Dynamic Optimization,

Related subjects:

Computer sciences, other

© 2025 Jiajun Hao, Chaoyang Geng, published by Xi’an Technological University
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 10 (2025): Issue 4 (December 2025)

Speech Processing Using Dynamic Micro-Block Optimization Based on Deep Learning

Figures & Tables

Figure 1.

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Train and test loss and ler of strategies based on accent

Loss, LER for accent-based strategies during both train and test

Loss, LER for gender-based strategies during both train and test

Paradigm

My account