Have a personal or library account? Click to login
Creating a Historical Migration Dataset from Finnish Church Records, 1800–1920 Cover

Creating a Historical Migration Dataset from Finnish Church Records, 1800–1920

Open Access
|Aug 2025

References

  1. Alvarez-Palau, E. J., & Martí-Henneberg, J. (2020). Shaping the common ground: State-building, the railway network, and regional development in finland. The Journal of Interdisciplinary History, 51(2), 267296. 10.1162/jinh_a_01557 (Accessed: 2025-07-15).
  2. Anonymous. (1953). Sukututkimusaineiston valokuvaaminen. Genos, 3, 89. (In Finnish)
  3. Barker, E., O’Doherty, M., & Isaksen, L. (Eds.) (2021). Introduction to the pelagios special issue. (Vol. 15) (No. 1–2). Edinburgh: Edinburgh University Press. 10.3366/ijhac.2021.0259 (Accessed: 2025-07-15).
  4. Beise, J., & Voland, E. (2008). Intrafamilial resource competition and mate competition shaped social-group-specific natal dispersal in the 18th and 19th century Krummhörn population. American Journal of Human Biology, 20(3), 32536. Retrieved from http://www.ncbi.nlm.nih.gov/pubmed/18186514 (Accessed: 2025-07-15).
  5. Blomqvist, C., Enflo, K., Jakobsson, A., & Åström, K. (2022). Joint handwritten text recognition and word classification for tabular information extraction. In 26th international conference on pattern recognition (ICPR) (pp. 15641570). Montreal, QC, Canada, 21–25 August 2022. 10.1109/ICPR56361.2022.9956282
  6. Briga, M., Ketola, T., & Lummaa, V. (2022). The epidemic dynamics of three childhood infections and the impact of first vaccination in 18th and 19th century Finland. medRxiv. 10.1101/2022.10.30.22281707
  7. Clarke, A. L., & Low, B. S. (1992). Ecological correlates of human dispersal in 19th century Sweden. Animal Behaviour, 44(4), 677693. 10.1016/S0003-3472(05)80295-7
  8. Clinchant, S., Déjean, H., Meunier, J.-L., Lang, E. M., & Kleber, F. (2018). Comparing machine learning approaches for table recognition in historical register books. In 13th IAPR international workshop on document analysis systems (DAS) (pp. 133138). (Accessed: 2025-02-28). 10.1109/DAS.2018.44
  9. Colutto, S., Kahle, P., Guenter, H., & Muehlberger, G. (2019). Transkribus. a platform for automated text recognition and searching of historical documents. In 15th international conference on escience (escience) (pp. 463466). San Diego, CA, USA, 2019. 10.1109/eScience.2019.00060
  10. Deng, Q., Ibrayim, M., Hamdulla, A., & Zhang, C. (2024). The YOLO model that still excels in document layout analysis. Signal, Image and Video Processing, 18(2), 15391548. 10.1007/s11760-023-02838-y
  11. Ehrmann, M., Hamdi, A., Pontes, E. L., Romanello, M., & Doucet, A. (2023). Named entity recognition and classification in historical documents: A survey. ACM Computing Surveys, 56(2), 147. 10.1145/3604931
  12. Engman, M. (1978). Migration from Finland to Russia during the Nineteenth Century. Scandinavian Journal of History, 3, 155177. Retrieved 2024-06-20, from https://www.tandfonline.com/doi/abs/10.1080/03468757808578934
  13. Finland’s Family History Association (FFHA). (2025). Finland’s family history association website. https://www.sukuhistoria.fi/sshy/index.htm. (Accessed: 2025-01-08).
  14. Fiorucci, M., Khoroshiltseva, M., Pontil, M., Traviglia, A., Del Bue, A., & James, S. (2020). Machine learning for cultural heritage: A survey. Pattern Recognition Letters, 133, 102108. Retrieved from https://www.sciencedirect.com/science/article/pii/S0167865520300532 (Accessed: 2025-05-02).
  15. Granell, E., Romero, V., Prieto, J. R., Andrés, J., Quirós, L., Sánchez, J. A., & Vidal, E. (2023). Processing a large collection of historical tabular images. Pattern Recognition Letters, 170, 916. 10.1016/j.patrec.2023.04.007
  16. He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask r-cnn. In 2017 ieee international conference on computer vision (iccv) (pp. 29802988). Venice, Italy, 22–29 October 2017. Retrieved from https://ieeexplore.ieee.org/document/8237584 (Accessed: 2024-10-31).
  17. Hietala, K. (1981). Internal migration and technological development. Finnish Yearbook of Population Research, 19, 2846. 10.23979/fypr.44751
  18. Honkola, T., Ruokolainen, K., Syrjänen, K. J. J., Leino, U.-P., Tammi, I., Wahlberg, N., & Vesakoski, O. (2018). Evolution within a language: environmental differences contribute to divergence of dialect groups. BMC Evolutionary Biology, 18(1), 132. Retrieved from 10.1186/s12862-018-1238-6 (Accessed: 2025-07-15).
  19. Huang, Y., Yan, Q., Li, Y., Chen, Y., Wang, X., Gao, L., & Tang, Z. (2019). A YOLO-Based Table Detection Method. In 2019 international conference on document analysis and recognition (icdar) (pp. 813818). Sydney, NSW, Australia, 2019. 10.1109/ICDAR.2019.00135
  20. Kansallisarkisto. (2024). Multicentury htr model: Handwritten text recognition. Hugging Face. https://huggingface.co/Kansallisarkisto/multicentury-htr-model/. (Accessed: 2024-12-27)
  21. Kerminen, S., Cerioli, N., Pacauskas, D., Havulinna, A. S., Perola, M., Jousilahti, P., … Pirinen, M. (2021). Changes in the fine-scale genetic structure of Finland through the 20th century. PLOS Genetics, 17(3). Retrieved from https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1009347 (Accessed: 2025-07-15).
  22. Kesztenbaum, L. (2008). Cooperation and coordination among siblings: Brothers’ migration in France, 1870–1940. The history of the family, 13(1), 85104. Retrieved from http://www.sciencedirect.com/science/article/pii/S1081602X08000067
  23. Ketola, T., Briga, M., Honkola, T., & Lummaa, V. (2021). Town population size and structuring into villages and households drive infectious disease risks in pre-healthcare finland. Proceedings of the Royal Society B: Biological Sciences, 288(1949), 20210356. 10.1098/rspb.2021.0356
  24. Lehenmeier, C., Burghardt, M., & Mischka, B. (2020). Layout detection and table recognition – recent challenges in digitizing historical documents and handwritten tabular data. In M. Hall, T. Merčun, T. Risse, & F. Duchateau (Eds.), Digital libraries for open knowledge (pp. 229242). Cham: Springer International Publishing. 10.1007/978-3-030-54956-5_17
  25. Li, M., Lv, T., Chen, J., Cui, L., Lu, Y., Florencio, D., … Wei, F. (2023). TrOCR: Transformer-based optical character recognition with pre-trained models. In Proceedings of the AAAI conference on artificial intelligence (Vol. 37, pp. 1309413102). 10.1609/aaai.v37i11.26538
  26. Nitsch, A., Lummaa, V., & Faurie, C. (2016). Sibship effects on dispersal behaviour in a pre-industrial human population. Journal of Evolutionary Biology, 29, 19861998. Retrieved from http://onlinelibrary.wiley.com/doi/10.1111/jeb.12922/abstract
  27. Nitsch, A., Lummaa, V., & Faurie, C. (2023). Sibling competition, dispersal and fitness outcomes in humans. Scientific Reports, 13(7539). 10.1038/s41598-023-33700-3
  28. Nitsch, A., Lummaa, V., Ketola, T., Honkola, T., Vesakoski, O., & Briga, M. (2025). The spatial distribution of pertussis, but not measles or smallpox, in pre-industrial Finland matches dialects. iScience, 28(6). 2025 Apr 26. 10.1016/j.isci.2025.112530
  29. Nockels, J., Gooding, P., Ames, S., & Terras, M. (2022). Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research. Archival science, 22(3), 367392. 10.1007/s10502-022-09397-0
  30. Paikkala, S., Mikkonen, P., Pitkänen, R. L., Slotte, P., & Aapala, K. (2007). Suomalainen paikannimikirja. Karttakeskus. (In Finnish)
  31. Pasanen, T.-M., Helske, J., Högmander, H., & Ketola, T. (2024). Spatio-temporal modeling of co-dynamics of smallpox, measles, and pertussis in pre-healthcare Finland. PeerJ, 12, e18155. 10.7717/peerj.18155
  32. Pitkänen, K. (1980). Registering people in a changing society-the case of Finland. Finnish Yearbook of Population Research, 18, 6079. 10.23979/fypr.44745
  33. Pontes, E. L., Cabrera-Diego, L. A., Moreno, J. G., Boros, E., Hamdi, A., Doucet, A., … Coustaty, M. (2022). Melhissa: a multilingual entity linking architecture for historical press articles. International Journal on Digital Libraries, 23(2), 133160. 10.1007/s00799-021-00319-6 (Accessed: 2025-07-15)
  34. Svalestuen, A. A. (1977). Five local studies of Nordic emigration and migration. American Studies in Scandinavia, 9, 1763. 10.22439/asca.v9i1.2577
  35. Tkachenko, M., Malyuk, M., Holmanyuk, A., & Liubimov, N. (2020–2025). Label Studio: Data labeling software. Retrieved from https://github.com/HumanSignal/label-studio (Accessed: 2025-05-02)
  36. Ultralytics. (2024). Yolov11 implementation. GitHub. https://github.com/ultralytics/ultralytics. (Retrieved from GitHub on 2024-12-01).
DOI: https://doi.org/10.5334/johd.345 | Journal eISSN: 2059-481X
Language: English
Submitted on: Jun 6, 2025
|
Accepted on: Jul 21, 2025
|
Published on: Aug 29, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Ari Vesalainen, Jenna Kanerva, Aïda Nitsch, Kiia Korsu, Ilari Larkiola, Laura Ruotsalainen, Filip Ginter, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.