Have a personal or library account? Click to login
Identification of Spontaneous Spoken Texts in Slovak Cover
Open Access
|Dec 2019

Abstract

We propose a text classification method for the purpose of creating a language model for automatic recognition of spontaneous spoken speech. Transcripts from our departmental speech database served as spontaneous spoken texts. Using supervised machine learning methods, we have created multiple classification models (including neural networks), that were able to distinguish them from written texts with high accuracy. We subsequently verified the accuracy of our trained models on a database of texts containing direct speech extracted from newspaper articles.

DOI: https://doi.org/10.2478/jazcas-2019-0076 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 481 - 490
Published on: Dec 21, 2019
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2019 Róbert Sabo, Peter Krammer, Ján Mojžiš, Marcel Kvassay, published by Slovak Academy of Sciences, Ľudovít Štúr Institute of Linguistics
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.