Have a personal or library account? Click to login
Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation Cover

Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation

Open Access
|Dec 2019

Abstract

We describe systematic changes that have been made to the Czech morphological dictionary related to annotating new data within the project of Prague Dependency Treebank (PDT). We bring new solutions to several complicated morphological features that occur in Czech texts. We introduced two new parts of speech, namely foreign word and segment. We adopted new principles for morphological analysis of global and inflectional variants, homonymous lemmas, abbreviations and aggregates. The changes were initiated by the need of consistency between the data and the dictionary and of the dictionary itself.

DOI: https://doi.org/10.2478/jazcas-2019-0067 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 380 - 389
Published on: Dec 21, 2019
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2019 Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková, Jan Hajič, published by Slovak Academy of Sciences, Ľudovít Štúr Institute of Linguistics
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.