Have a personal or library account? Click to login
Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation Cover

Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation

Open Access
|Dec 2019

References

  1. [1] Banko, M., and Brill, E. (2001). Scaling to Very Very Large Corpora for Natural Language Disambiguation. In Proceedings of the 39th annual meeting on ACL. Association for Computational Linguistics, pages 26–33.10.3115/1073012.1073017
  2. [2] Hajič, J. (2004). Disambiguation of Rich Inflection. (Computational Morphology of Czech). Karolinum, Prague.
  3. [3] Hajič, J. (2000). Morphological Tagging: Data vs. Dictionaries. In Proceedings of the 6th Applied Natural Language Processing and the 1st NAACL Conference, Seattle, pages 94–101.
  4. [4] Hajič J., Hajičová E., Panevová J., Sgall P., Bojar O., Cinková S., Fučíková E., Mikulová M., Pajas P., Popelka J., Semecký J., Šindlerová J., Štěpánek J., Toman J., Urešová Z., and Žabokrtský Z. (2012). Announcing Prague Czech-English Dependency Treebank 2.0. In Proceedings of the 8th International Conference on LREC 2012, European Language Resources Association, İstanbul, pages 3153–3160.
  5. [5] Hajič, J., and Hlaváčová, J. (2013). MorfFlex CZ. LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University. Accessible at: http://hdl.handle.net/11858/00-097C-0000-0015-A780-9.
  6. [6] Hajič, J., Bejček, E., Bémová, A., Buráňová, E., Hajičová, E., Havelka, J., Homola, P., Kárník, J., Kettnerová, V., Klyueva, N., Kolářová, V., Kučová, L., Lopatková, M., Mikulová, M., Mírovský, J., Nedoluzhko, A., Pajas, P., Panevová, J., Poláková, L., Rysová, M., Sgall, P., Spoustová, D. J., Straňák, P., Synková, P., Ševčíková, M., Štěpánek, J., Urešová, Z., Vidová Hladká, B., Zeman, D., Zikánová, Š., and Žabokrtský, Z. 2018, Prague Dependency Treebank 3.5. LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University. Accessible at http://hdl.handle.net/11234/1-2621
  7. [7] Hlaváčová, J. (2017). Golden Rule of Morphology and Variants of Wordforms. Jazykovedný časopis / Journal of Linguistics, 68(2), pages 136–144.10.1515/jazcas-2017-0024
  8. [8] Hlaváčová, J. (2009). Formalizace systému české morfologie s ohledem na automatické zpracování českých textů. Disertační práce. Univerzita Karlova.
  9. [9] Church, K., and Mercer, R. (1993). Introduction to the special issue on computational linguistics using large corpora. Computational Linguistics, 19(1), pages 1–24.
  10. [10] Mikulová M., Mírovský J., Nedoluzhko A., Pajas P., Štěpánek J., and Hajič J. (2017). PDTSC 2.0 – Spoken Corpus with Rich Multi-layer Structural Annotation. In Lecture Notes in Computer Science, No. 20th International Conference TSD 2017, Prague, pages 129–137. Cham, Switzerland: Springer International Publishing.10.1007/978-3-319-64206-2_15
  11. [11] Nivre, J., de Marneffe, M.-C., Ginter, F., Goldberg, Y., Hajič, J., Manning, C., McDonald, R., Petrov, S., Pyysalo, S., Silveira, N., Tsarfaty, R., and Zeman, D. (2016). Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of the 10th International Conference on LREC 2016, pages 1659–1666. Paris.
  12. [12] Petkevič, V., Hlaváčová, J., Osolsobě, K., Šimandl, J., and Svášek, M. (2019). Microsyntactic Parts of Speech in NovaMorf, a New Morphological Annotation of Czech. In Proceedings of SLOVKO 2019 (this volume).10.2478/jazcas-2019-0065
  13. [13] Straková J., Straka M., and Hajič J. (2014). Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition. In Proceedings of 52nd Annual Meeting of the ACL: System Demonstrations, Association for Computational Linguistics, pages 13–18. Baltimore.10.3115/v1/P14-5003
  14. [14] Zeman, D. (2018). The World of Tokens, Tags and Trees. Studies in Computational and Theoretical Linguistics, Charles University, Prague.
  15. [15] Žabokrtský Z., Ševčíková M., Straka M., Vidra J., and Limburská A. (2016). Merging Data Resources for Inflectional and Derivational Morphology in Czech. In Proceedings of the 10th International Conference on LREC 2016, pages 1307–1314, Paris, European Language Resources Association.
DOI: https://doi.org/10.2478/jazcas-2019-0067 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 380 - 389
Published on: Dec 21, 2019
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2019 Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková, Jan Hajič, published by Slovak Academy of Sciences, Ľudovít Štúr Institute of Linguistics
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.