Have a personal or library account? Click to login

Issues of POS Tagging of the (Diachronic) Corpus of Czech : Preparing a Morphological Dictionary

Open Access
|Jan 2018

Abstract

Many important decisions concerning the part-of-speech categorization remain unexplained in the current practice, only reported in corpus manuals. The aim of this paper is to offer a different perspective on the problems of morphological annotation of corpora – the perspective of mapping and analyzing conceptual problems in the annotation. Focused mainly on function words in Czech, we discuss the possibilities of the POS tagging of the inherently ambiguous category of particles and we introduce criteria for distinguishing particles from interjections.

DOI: https://doi.org/10.1515/jazcas-2017-0041 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 316 - 325
Published on: Jan 24, 2018
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2018 Anna Řehořková, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.