Have a personal or library account? Click to login
Consistency of morphological dictionary MorfFlex Cover

Abstract

Language corpora usually contain, in addition to their own texts, various types of annotations. The most common one is a morphological annotation, which consists in assigning a lemma and a morphological tag to each wordform. For morphological tagging, morphological dictionaries are traditionally used. Our paper presents a new version of the so-called “Prague” morphological dictionary MorfFlex used for tagging many Czech corpora (particularly Prague Dependency Treebanks, corpora published by the Institute of the Czech National Corpus in Prague or large Czech web corpora of the Aranea series). Three basic principles were used to update the dictionary: the Golden Rule of Morphology, the Principle of Paradigm Unity, and the Principle of Paradigm Uniqueness.

DOI: https://doi.org/10.2478/jazcas-2022-0010 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 855 - 861
Published on: Aug 17, 2022
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2022 Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.