Have a personal or library account? Click to login
Automated Transcription of Historical Encrypted Manuscripts Cover

Automated Transcription of Historical Encrypted Manuscripts

By: Eugen Antal and  Pavol Marák  
Open Access
|Feb 2023

Abstract

This paper deals with historical encrypted manuscripts and introduces an automated method for the detection and transcription of ciphertext symbols for subsequent cryptanalysis. Our database contains documents used in the past by aristocratic families living in the territory of Slovakia. They are encrypted using a nomenclator which is a specific type of substitution cipher. In our case, the nomenclator uses digits as ciphertext symbols. We have proposed a method for the detection, classification, and transcription of handwritten digits from the original documents. Our method is based on Mask R-CNN which is a deep convolutional neural network for instance segmentation. Mask R-CNN was trained on a manually collected database of digit annotations. We employ a specific strategy where the input image is first divided into small blocks. The image blocks are then passed to Mask R-CNN to obtain detections. This way we avoid problems related to the detection of a large number of small dense objects in a high-resolution image. Experiments have shown promising detection performance for all digit types with minimum false detections.

DOI: https://doi.org/10.2478/tmmp-2022-0019 | Journal eISSN: 1338-9750 | Journal ISSN: 12103195
Language: English
Page range: 65 - 86
Submitted on: Oct 2, 2022
Published on: Feb 15, 2023
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 3 issues per year

© 2023 Eugen Antal, Pavol Marák, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.