Have a personal or library account? Click to login
Creating a Historical Migration Dataset from Finnish Church Records, 1800–1920 Cover

Creating a Historical Migration Dataset from Finnish Church Records, 1800–1920

Open Access
|Aug 2025

Abstract

This article presents a large-scale effort to create a structured dataset of internal migration in Finland between 1800 and 1920 using digitized church moving records. These records, maintained by Evangelical-Lutheran parishes, document the migration of individuals and families and offer a valuable source for studying historical demographic patterns. The dataset includes over six million entries extracted from approximately 200,000 images of handwritten migration records.

The data extraction process was automated using a deep learning pipeline that included layout analysis, table detection, cell classification, and handwriting recognition. The complete pipeline was applied to all images, resulting in a structured dataset suitable for research.

The dataset can be used to study internal migration, urbanization, and family migration, and the spread of disease in preindustrial Finland. A case study from the Elimäki parish shows how local migration histories can be reconstructed. The work demonstrates how large volumes of handwritten archival material can be transformed into structured data to support historical and demographic research.

DOI: https://doi.org/10.5334/johd.345 | Journal eISSN: 2059-481X
Language: English
Submitted on: Jun 6, 2025
|
Accepted on: Jul 21, 2025
|
Published on: Aug 29, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Ari Vesalainen, Jenna Kanerva, Aïda Nitsch, Kiia Korsu, Ilari Larkiola, Laura Ruotsalainen, Filip Ginter, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.