Have a personal or library account? Click to login
iWEEMS: Interactive Word Embeddings for Early Modern Science Cover

iWEEMS: Interactive Word Embeddings for Early Modern Science

Open Access
|Oct 2025

References

  1. Akopyan, O., Barton, W., Baumgartner, F., Berrens, D., Kirchler, U., Korenjak, M., Luggin, J., Tautschnig, I., & Zathammer, S. (2023). Noscemus Wiki [Dataset]. Zenodo. 10.5281/ZENODO.7855322
  2. Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics, 5, 135146. 10.1162/tacl_a_00051
  3. Burns, P. J. (2023). LatinCy: Synthetic Trained Pipelines for Latin NLP (Version 1). arXiv. 10.48550/ARXIV.2305.04365
  4. Denooz, J. (2004). Opera Latina: Une base de données sur internet. Euphrosyne, 32, 7988. 10.1484/J.EUPHR.5.125535
  5. Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. 10.48550/ARXIV.1810.04805
  6. Ehrmanntraut, A., Hagen, T., Konle, L., & Jannidis, F. (2021). Type- and Token-based Word Embeddings in the Digital Humanities. CHR 2021: Computational Humanities Research 2021, 2989, 23.
  7. Harris, Z. S. (1954). Distributional Structure. Word World, 10(2–3), 146162. 10.1080/00437956.1954.11659520
  8. Hedesan, G., Huber, A., Kodetová, J., Kříž, O., Kubíčková, J., Kaše, V., & Pavlas, P. (2025). EMLAP (Version v0.4) [Dataset]. Zenodo. 10.5281/ZENODO.14765294
  9. Lenci, A. (2018). Distributional Models of Word Meaning. Annual Review of Linguistics, 4, 151171. 10.1146/annurev-linguistics-030514-125254
  10. Lenci, A., Sahlgren, M., Jeuniaux, P., Cuba Gyllensten, A., & Miliani, M. (2022). A comparative evaluation and analysis of three generations of Distributional Semantic Models. Language Resources and Evaluation, 56(4), 12691313. 10.1007/s10579-021-09575-z
  11. Longree, D., Fantoli, M., & LASLA (ULiège). (2023). LASLAfiles_Latin_DATformat [Dataset]. ULiège Open Data Repository. 10.58119/ULG/27VZID
  12. McInnes, L., Healy, J., & Melville, J. (2018). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction (Version 3). arXiv. 10.48550/ARXIV.1802.03426
  13. Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed Representations of Words and Phrases and their Compositionality. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, & K. Q. Weinberger (Eds), Advances in Neural Information Processing Systems 26 (NIPS 2013), Vol. 26, pp. 31113119). New York City: Curran Associates, Inc. 10.48550/arXiv.1310.4546
  14. Montani, I., Honnibal, M., Honnibal, M., Van Landeghem, S., Boyd, A., Peters, H., McCann, P. O., Geovedi, J., O’Regan, J., Samsonov, M., Orosz, G., De Kok, D., Blättermann, M., Altinok, D., Kristiansen, S. L., Madeesh Kannan, Mitsch, R., Bournhonesque, R., Edward, … Tamura, Y. (2023). spaCy: Industrial-strength Natural Language Processing in Python (Version v3.5.1) [Computer software]. Zenodo. 10.5281/zenodo.10009823
  15. Passarotti, M. (2010). Leaving behind the less-resourced status. The case of latin through the experience of the index thomisticus treebank. 7th SaLTMiL Workshop on Creation and Use of Basic Lexical Resources for Less-Resourced Languages LREC 2010. Valetta, Malta, 23 May 2010 Workshop Programme, 27.
  16. Passarotti, M. (2015). What you can do with linguistically annotated data. From the Index Thomisticus to the Index Thomisticus Treebank. In P. Roszak & J. Vijgen (Eds.), Reading Sacred Scripture with Thomas Aquinas. Hermeneutical Tools. Theological Questions and New Perspectives (pp. 344). Turnhout: Brepols. 10.1484/M.TEMA-EB.4.000129
  17. Pražák, O., Přibáň, P., Taylor, S., & Sido, J. (2020). UWB at SemEval-2020 Task 1: Lexical Semantic Change Detection. In A. Herbelot, X. Zhu, A. Palmer, N. Schneider, J. May, & E. Shutova (Eds.), Proceedings of the Fourteenth Workshop on Semantic Evaluation (pp. 246254). International Committee for Computational Linguistics. 10.18653/v1/2020.semeval-1.30
  18. Řehůřek, R., & Sojka, P. (2010). Software Framework for Topic Modelling with Large Corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 4550. http://is.muni.cz/publication/884893/en (last accessed 10 October 2025).
  19. Sahlgren, M. (2008). The distributional hypothesis. Rivista Di Linguistica, 20(1), 3353.
  20. Schlechtweg, D., McGillivray, B., Hengchen, S., Dubossarsky, H., & Tahmasebi, N. (2020). SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection. In A. Herbelot, X. Zhu, A. Palmer, N. Schneider, J. May, & E. Shutova (Eds.), Proceedings of the Fourteenth Workshop on Semantic Evaluation (pp. 123). International Committee for Computational Linguistics. 10.18653/v1/2020.semeval-1.1
  21. Sprugnoli, R., Moretti, G., & Passarotti, M. (2020). Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas. Italian Journal of Computational Linguistics, 6(1). 10.4000/ijcol.624
  22. Sprugnoli, R., Passarotti, M., & Moretti, G. (2019). Vir is to Moderatus as Mulier is to Intemperans—Lemma Embeddings for Latin. Proceedings of the Sixth Italian Conference on Computational Linguistics Bari, Italy, 13–15 November 2019. 10.5281/zenodo.3565572
  23. van der Maaten, L., & Hinton, G. (2008). Visualizing Data using t-SNE. Journal of Machine Learning Research, 9(86), 25792605.
  24. Venna, J., & Kaski, S. (2001). Neighborhood Preservation in Nonlinear Projection Methods: An Experimental Study. In G. Dorffner, H. Bischof, K. Hornik (Eds.), Artificial Neural Networks — ICANN 2001. ICANN 2001. Lecture Notes in Computer Science, vol 2130 (pp 485491). Berlin, Heidelberg: Springer. 10.1007/3-540-44668-0_68
  25. Zathammer, S. (2025). Noscemus Digital Sourcebook [Dataset]. Zenodo. 10.5281/ZENODO.15040256
DOI: https://doi.org/10.5334/johd.379 | Journal eISSN: 2059-481X
Language: English
Submitted on: Aug 22, 2025
|
Accepted on: Sep 30, 2025
|
Published on: Oct 28, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Vojtěch Kaše, Jana Švadlenková, Jan Tvrz, Georgiana Hedesan, Petr Pavlas, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.