Have a personal or library account? Click to login
Relevant Criteria for Selection of Spoken Data: Theory Meets Practice Cover

Relevant Criteria for Selection of Spoken Data: Theory Meets Practice

Open Access
|Dec 2019

Abstract

The present paper seeks to review relevant criteria used in classifying speech events (SEs) from the perspective of spoken corpus design. The primary goal is to survey the landscape of possible types of spoken language, so as to assess in which directions the coverage of spoken Czech offered by Czech National Corpus corpora can be expanded in the future. We approach the problem from both theoretical and practical points of view, examining what the theoretical literature has to say as well as approaches implemented in practice by existing spoken corpora of various languages. We then synthesize the obtained information into a pragmatically motivated set of SE classification criteria which does not aspire to be universal or definitive but aims to serve as a useful guiding principle and conceptual framework for understanding and promoting SE diversity when collecting spoken data.

DOI: https://doi.org/10.2478/jazcas-2019-0062 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 324 - 335
Published on: Dec 21, 2019
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2019 Marie Kopřivová, Zuzana Komrsková, Petra Poukarová, David Lukeš, published by Slovak Academy of Sciences, Ľudovít Štúr Institute of Linguistics
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.