Have a personal or library account? Click to login
Wikidata and LiLa for Latin: Enabling Interoperability and Access to Inflected Forms and Corpus Attestations Cover

Wikidata and LiLa for Latin: Enabling Interoperability and Access to Inflected Forms and Corpus Attestations

Open Access
|Dec 2025

References

  1. Aronoff, M. (1993). Morphology by Itself: Stems and Inflectional Classes. MIT Press.
  2. Batsuren, K., et al. (2022). UniMorph 4.0: Universal Morphology. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 840855). Marseille, France: European Language Resources Association. URL: https://aclanthology.org/2022.lrec-1.89/. Last accessed 2025/11/15.
  3. Beniamine, S. (2018). Classifications flexionnelles. Étude quantitative des structures de paradigms. Doctoral dissertation, Université Sorbonne Paris Cité-Université Paris Diderot (Paris 7).
  4. Beniamine, S., Anderson, C., Carroll, M., Guzmán Naranjo, M., Herce, B., Pellegrini, M., Round, E., Sims-Williams, H., & Tresoldi, T. (2023). Paralex: a DeAR standard for rich lexicons of inflected forms. In The Fourth International Symposium of Morphology.
  5. Bonami, O., & Beniamine, S. (2016). Joint predictiveness in inflectional paradigms. Word structure, 9(2), 156182. 10.3366/word.2016.0092
  6. Boyé, G., & Schalchli, G. (2019). Realistic data and paradigms: The paradigm cell finding problem. Morphology, 29(2), 199248. 10.1007/s11525-018-9335-1
  7. Cimiano, P., Chiarcos, C., McCrae, J. P., & Gracia, J. (2020). Linguistic linked data. Springer International Publishing. 10.1007/978-3-030-30225-2
  8. Cotterell, R., Kirov, C., Sylak-Glassman, J., Yarowsky, D., Eisner, J., & Hulden, M. (2016). The SIGMORPHON 2016 shared task—morphological reinflection. In Proceedings of the 14th SIGMORPHON workshop on computational research in phonetics, phonology, and morphology (pp. 1022). 10.18653/v1/W16-2002
  9. De Felice, I., Tamponi, L., Iurescia, F., & Passarotti, M. (2023). Linking the Corpus CLaSSES to the LiLa Knowledge Base of Interoperable Linguistic Resources for Latin. In Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023) (pp. 172178). Venice, Italy: CEUR Workshop Proceedings. URL: https://aclanthology.org/2023.clicit-1.22/. Last accessed 2025/11/15.
  10. De Paoli, A., Passarotti, M. C., Ruffolo, P., Moretti, G., & Kernerman, I. (2025). Linking the Lexicala Latin-French Dictionary to the LiLa Knowledge Base. In Proceedings of the 5th Conference on Language, Data and Knowledge (pp. 197207). URL: https://aclanthology.org/2025.ldk-1.21/. Last accessed 2025/11/15.
  11. Dezotti, L. C., Passarotti, M., & Mambrini, F. (2024). Modelling and Linking an Old Latin-Portuguese Dictionary to the LiLa Knowledge Base. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) (pp. 1153711547). URL: https://aclanthology.org/2024.lrec-main.1008/. Last accessed 2025/11/15.
  12. Dressler, W. U., Kilani-Schoch, M., Gagarina, N., Pestal, L., & Pöchtrager, M. (2008), On the Typology of Inflection Class Systems. Folia Linguistica, 40(1–2), 5174. 10.1515/flin.40.1-2.51
  13. Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., & Vrandečić, D. (2014). Introducing wikidata to the linked data web. In International semantic web conference (pp. 5065). Cham: Springer International Publishing. 10.1007/978-3-319-11964-9_4
  14. Fantoli, M., Passarotti, M., Mambrini, F., Moretti, G., & Ruffolo, P. (2022). Linking the LASLA Corpus in the LiLa Knowledge Base of Interoperable Linguistic Resources for Latin. In Proceedings of the Linked Data in Linguistics Workshop@ LREC2022 (pp. 2634). URL: https://aclanthology.org/2022.ldl-1.4/. Last accessed 2025/11/15.
  15. Fradin, B., & Kerleroux, F. (2003). Troubles with lexemes. In G. Booij, J. DeCesaris, A. Ralli, & S. Scalise (Eds.), Selected papers from the third Mediterranean Morphology Meeting (pp. 177196). IULA – Universitat Pompeu Fabra.
  16. Herce, B. (2025). VeLeSpa: An inflected verbal lexicon of Peninsular Spanish and a quantitative analysis of paradigmatic predictability. Language Resources and Evaluation, 59(2), 17051718. 10.1007/s10579-024-09776-2
  17. Kirov, C., Sylak-Glassman, J., Que, R., & Yarowsky, D. (2016). Very-large scale parsing and normalization of Wiktionary morphological paradigms. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16) (pp. 31213126). URL: https://aclanthology.org/L16-1498/. Last accessed 2025/11/15.
  18. Lassila, O., & Swick, R. R. (1998) Resource Description Framework (RDF) Model and Syntax Specification. URL: https://www.w3.org/TR/1999/REC-rdf-syntax-19990222/. Last accessed 2025/11/15.
  19. Lindemann, D. (2025). Ontolex-Lemon in Wikidata and other Wikibase instances. In Proceedings of the 5th Conference on Language, Data and Knowledge: The 5th OntoLex Workshop (pp. 3545). 10.5281/zenodo.15471514
  20. Lindemann, D., Ahmadi, S., Khan, A. F., Mambrini, F., Iurescia, F., & Passarotti, M. C. (2023). When OntoLex Meets Wikibase: Remodeling Use Cases. CEUR Workshop Proceedings, 2773. URL: https://ceur-ws.org/Vol-3640/paper14.pdf. Last accessed 2025/11/15.
  21. Mambrini, F., Litta, E., Passarotti, M., & Ruffolo, P. (2021a). Linking the Lewis & short dictionary to the LiLa knowledge base of interoperable linguistic resources for Latin. In Proceedings of the eighth Italian conference on computational linguistics (CLiC-it 2021) (pp. 216222). URL: https://aclanthology.org/2021.clicit-1.34/. Last accessed 2025/11/15.
  22. Mambrini, F., Passarotti, M., Litta, E., & Moretti, G. (2021b). Interlinking Valency Frames and Wordnet Synsets in the LiLa Knowledge Base of Linguistic Resources for Latin. In Further with Knowledge Graphs (pp. 1628). IOS Press. 10.3233/SSW210032
  23. Mambrini, F., Passarotti, M., Moretti, G., & Pellegrini, M. (2022, June). The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base. In Proceedings of the Thirteenth Language Resources and Evaluation Conference (pp. 40224029). URL: https://aclanthology.org/2022.lrec-1.428/. Last accessed 2025/11/15.
  24. McCrae, J. P., Bosque-Gil, J., Gracia, J., Buitelaar, P., & Cimiano, P. (2017). The Ontolex-Lemon model: development and applications. In Proceedings of eLex 2017 conference (pp. 1921).
  25. Nicolai, G., Chodroff, E., Mailhot, F., & Çöltekin, Ç. (2024). Proceedings of the 21st SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. Mexico City, Mexico: Association for Computational Linguistics. URL: https://aclanthology.org/2024.sigmorphon-1/. Last accessed 2025/11/15.
  26. Passarotti, M. (2019). The Project of the Index Thomisticus Treebank. In M. Berti (Ed.), Digital Classical Philology. Ancient Greek and Latin in the Digital Revolution (pp. 299319). De Gruyter. 2019. 10.1515/9783110599572-017
  27. Passarotti, M., Mambrini, F., Franzini, G., Cecchini, F. M., Litta, E., Moretti, G., Ruffolo, P., & Sprugnoli, R. (2020). Interlinking through Lemmas. The Lexical Collection of the LiLa Knowledge Base of Linguistic Resources for Latin. Studi e Saggi Linguistici, 58(1), 177212. 10.4454/ssl.v58i1.277
  28. Pedonese, G., Cecchini, F. M., & Passarotti, M. C. (2023). Linking the Computational Historical Semantics Corpus to the LiLa Knowledge Base of Interoperable Linguistic Resources for Latin. In Proceedings of the 4th conference on language, data and knowledge (pp. 7485). URL: http://www.lrec-conf.org/proceedings/lrec2012/pdf/274_Paper.pdf. Last accessed 2025/11/15.
  29. Pellegrini, M. (2023). Paradigm Structure and Predictability in Latin Inflection. An Entropy-based Approach. Springer. 10.1007/978-3-031-24844-3
  30. Pellegrini, M., & Passarotti, M. (2018). LatInfLexi: an Inflected Lexicon of Latin Verbs. In Proceedings of the Fifth Italian Conference on Computational Linguistics (CLiC-it 2018) (pp. 325330), Turin: CEUR Workshop Proceedings. URL: https://aclanthology.org/2018.clicit-1.57/. Last accessed 2025/11/15.
  31. Pellegrini, M., Passarotti, M., Litta, E., Mambrini, F., Moretti, G., Corbetta, C., & Verdelli, M. (2022). Enhancing Derivational Information on Latin Lemmas in the LiLa Knowledge Base. A Structural and Diachronic Extension. Prague Bulletin of Mathematical Linguistics, 119, 6792. 10.14712/00326585.023
  32. Pellegrini, M., Passarotti, M., Mambrini, F., & Moretti, G. (2025). PrinParLat: a lexicon of principal parts of Latin verbs linked to the LiLa Knowledge Base. Language Resources and Evaluation. 10.1007/s10579-025-09847-y
  33. Petrov, S., Das, D., & McDonald, R. (2012). A Universal Part-of-Speech Tagset. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC’12), (pp. 20892096), Istanbul, Turkey: European Language Resources Association (ELRA). URL: http://www.lrec-conf.org/proceedings/lrec2012/pdf/274_Paper.pdf. Last accessed 2025/11/15.
  34. Sanderson, R., Ciccarese, P., & Van de Sompel, H. (2013). Designing the W3C open annotation data model. In Proceedings of the 5th Annual ACM Web Science Conference (pp. 366375). 10.1145/2464464.2464474
  35. Sprugnoli, R., Passarotti, M., Testori, M., & Moretti, G. (2021). Extending and Using a Sentiment Lexicon for Latin in a Linked Data Framework. In Workshop on Sentiment Analysis and Linguistic Linked Data (SALLD-1) (pp. 114). 10.5281/zenodo.6303163
  36. Stump, G. T., & Finkel, R. A. (2013). Morphological typology: From word to paradigm. Cambridge University Press. 10.1017/CBO9781139248860
  37. Vrandečić, D., & Krötzsch, M. (2014). Wikidata: a free collaborative knowledge base. Communications of the ACM, 57(10), 7885. 10.1145/2629489
DOI: https://doi.org/10.5334/johd.464 | Journal eISSN: 2059-481X
Language: English
Submitted on: Nov 9, 2025
|
Accepted on: Dec 8, 2025
|
Published on: Dec 29, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 David Lindemann, Matteo Pellegrini, Francesco Mambrini, Marco Passarotti, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.