Have a personal or library account? Click to login
Machine Learning Assisted Conversion of Biographical Records Into Wikidata Triples Cover

Machine Learning Assisted Conversion of Biographical Records Into Wikidata Triples

By: Daniel Baránek  
Open Access
|Jan 2026

References

  1. Baker, J., & Mahal, A. K. (2024, dec 23). “I have always found the whole area a minefield”: Wikidata, historical lives, and knowledge infrastructure. International Journal of Digital Humanities, 6(2), 217236. 10.1007/s42803-024-00090-5
  2. Baránek, D. (2024a). biography2wikidata. 10.57967/HF/1898
  3. Baránek, D. (2024b, mar 5). Kraken segmentation model for two-column prints. 10.5281/ZENODO.10783346
  4. Baránek, D. (2025). Research: Wikimedia versus traditional biographical encyclopedias. Meta-Wiki (Wikimedia Research project page). Retrieved 2025-10-15, from https://meta.wikimedia.org/w/index.php?title=Research:Wikimedia_versus_traditional_biographical_encyclopedias&oldid=29453744 (Last updated: 2025-10-15).
  5. Das, P., Karnam, S. K., Panda, A., Guda, B. P. R., Sarkar, S., & Mukherjee, A. (2023). Diversity matters: Robustness of bias measurements in Wikidata. arXiv. 10.48550/ARXIV.2302.14027
  6. Durdík, P. (1893). Docent. In Ottův slovník naučný (p. 745). J. Otto. Retrieved from https://ceskadigitalniknihovna.cz/uuid/uuid:3bf15a30-0a07-11e5-ae7e-001018b5eb5c.
  7. Fagerving, A. (2023, Oct.). Wikidata for authority control: sharing museum knowledge with the world. Digital Humanities in the Nordic and Baltic Countries Publications, 5(1), 222239. 10.5617/dhnbpub.10665
  8. Jaskulski, P., Latos, T., Ryńca, M., & Zapała, A. (2025, mar 5). Reliability of large language models as a tool for knowledge extraction from biographical dictionaries: the case of the Polish Biographical Dictionary. Digital Scholarship in the Humanities, 40(2), 538548. 10.1093/llc/fqaf014
  9. Neubert, J. (2017, sep 23). Wikidata as a linking hub for knowledge organization systems? Integrating an authority mapping into Wikidata and learning lessons for KOS mappings. Proceedings of the 17th European Networked Knowledge Organization Systems Workshop (pp. 1425). Retrieved from http://ceur-ws.org/Vol-1937/paper2.pdf.
  10. Redi, M., Gerlach, M., Johnson, I., Morgan, J., & Zia, L. (2020). A Taxonomy of Knowledge Gaps for Wikimedia Projects (Second Draft). 10.48550/ARXIV.2008.12314
  11. Zahra, T. (2008). Kidnapped Souls. Cornell University Press. 10.7591/9780801461910
DOI: https://doi.org/10.5334/johd.466 | Journal eISSN: 2059-481X
Language: English
Submitted on: Nov 10, 2025
|
Accepted on: Dec 27, 2025
|
Published on: Jan 16, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Daniel Baránek, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.