Have a personal or library account? Click to login
Differences in Spoken Language Processing in General Corpora (ORAL, ORTOFON) and in a Specialized Corpus (DIALEKT) and their Reflection in the Mapka Application Cover

Differences in Spoken Language Processing in General Corpora (ORAL, ORTOFON) and in a Specialized Corpus (DIALEKT) and their Reflection in the Mapka Application

Open Access
|Dec 2023

Abstract

ORAL and ORTOFON, general corpora of the spoken Czech language, capture authentic and prototypical informal spoken language. DIALEKT, a specialized corpus, represents traditional regional dialects of the Czech language. Since the corpora’s goals and the nature of the captured language data differ, different data collection methods were required. It concerns not only the choice of speakers, but the whole communication situation. Samples chosen from these three corpora are included in the Mapka application and reflect the distinct character of the corpora. The ORAL and ORTOFON samples show general spoken language in various informal situations and capture a wide range of speakers. The DIALEKT samples represent traditional regional dialects spoken by chosen types of speakers in a semiformal situation of guided interview.

DOI: https://doi.org/10.2478/jazcas-2023-0038 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 204 - 213
Published on: Dec 25, 2023
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2023 Martina Waclawičová, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.