Have a personal or library account? Click to login
Development of a Database and Models for Children’s Speech in the Slovak Language for Speech-oriented Applications Cover

Development of a Database and Models for Children’s Speech in the Slovak Language for Speech-oriented Applications

Open Access
|Nov 2025

Abstract

Children’s speech differs significantly from adult speech due to physiological and cognitive developmental factors. Key differences include higher pitch, a shorter vocal tract, greater formant frequencies, slower speaking rates, and greater variability in pronunciation and articulation. These differences result in acoustic mismatches between children’s and adult speech, making traditional automatic speech recognition models trained on adult speech less effective for children. Additionally, linguistic differences, such as limited vocabulary and evolving grammar, further contribute to this challenge. This paper focuses on the creation of a children’s speech database for the low-resource Slovak language. This database has been used to train acoustic models for the automatic recognition of spontaneous children’s speech in Slovak. In this research, we compared three different approaches to speech recognition, with self-supervised learning achieving results comparable to similar studies in this area, despite using relatively small amounts of training data.

DOI: https://doi.org/10.2478/jazcas-2025-0020 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 223 - 233
Published on: Nov 27, 2025
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2025 Ján Staš, Stanislav Ondáš, Matúš Pleva, Matej Horváth, Richard Ševc, Patrik Michalanský, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.