Deep Learning in Archiving Indus Script and Motif Information

Vaishnavi Dixit; Nushrat Hussain; Shubham Basak; Deva Atturu; Debasis Mitra; Ujjwal Bhattacharya

doi:10.5334/jcaa.175

Abstract

This work presents a novel computational system for the automated digitization of image-based data from seals of the ancient Indus Valley Civilization (IVC). The objective of this system’s design is to automatically extract and archive key information from seals or images, including the script and motifs. The system operates as a pipeline comprising three deep learning models integrated with a custom-designed database. Two models form the Ancient Script Recognition network (ASR-net), which digitizes sequences of graphemes from Indus seals, similar to Optical Character Recognition for modern languages. The third model, the Motif Identification network (MI-net), identifies recurring motifs—distinctive symbols or iconographic elements with specific functional significance in the IVC. The database stores the extracted information, linking it to the respective seal images in a structured format. This end-to-end pipeline has been fully implemented, from image input to database archival. The overarching aim of this work is to support the application of automated statistical methods in the ongoing efforts to decipher the Indus script.

References

1Atturu, DMR. 2024. Deep Learning in Indus Valley Script Digitization. Master’s thesis. Florida Institute of Technology. Available at: https://repository.fit.edu/etd/1416.
Back to article
2Barucci, A, Amendola, M, Argenti, F, Canfailla, C, Cucci, C, Guidi, T, Python, L and Franci, M. 2023. Discovering Ancient Egyptian Hieroglyphs with Deep Learning. Italian National Council (CNR). Available at: https://www.ifac.cnr.it/wp-content/BOOKS/BOOK/HORUS/testoHORUS.pdf.
Back to article
3Daggumati, S and Revesz, PZ. 2021. ‘A method of identifying allographs in undeciphered scripts and its application to the indus valley script’. Humanities and Social Sciences Communications, 8: 1–14. DOI: 10.1057/s41599-021-00713-0
Back to article
4Fuls, A. 2013. ‘Positional analysis of indus signs’. Epigrafika, 7: 253–275.
Back to article
5Fuls, A. 2015. ‘Appendix I: Automated Segmentation of Indus Texts’. Archaeopress Publishing Ltd. pp. 100–118. DOI: 10.2307/j.ctvr43jmf.15
Back to article
6He, K, Zhang, X, Ren, S and Sun, J. 2016. ‘Deep residual learning for image recognition’. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778. DOI: 10.1109/CVPR.2016.90
Back to article
7Howard, AG, Zhu, M, Chen, B, Kalenichenko, D, Wang, W, Weyand, T, Andreetto, M and Adam, H. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. CoRR abs/1704.04861. Available at: http://arxiv.org/abs/1704.04861, arXiv:1704.04861.
Back to article
8Hu, W, Zhan, HJ, Liu, C, Yin, B and Lu, Y. 2023. Ots: A one-shot learning approach for text spotting in historical manuscripts. arXiv preprint arXiv:2304.00746. DOI: 10.2139/ssrn.4419850
Back to article
9Huang, G, Liu, Z, Van Der Maaten, L and Weinberger, KQ. 2017. ‘Densely connected convolutional networks’. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261–2269. DOI: 10.1109/CVPR.2017.243
Back to article
10Jocher, G, Chaurasia, A, Stoken, A, Borovec, J, NanoCode012, Kwon, Y, Michael, K, Xie, T, Fang, J, Imyhxy, Lorna, Yifu, Z, Wong, C, Abhiram V, Montes, D, Wang, Z, Fati, C, Nadar, J, Laughing, UnglvKitDe, Sonck, V, Tkianai, YxNONG, Skalski, P, Hogan, A, Nair, D, Strobel, M and Jain, M. 2022. ultralytics/yolov5: v7.0 – yolov5 sota realtime instance segmentation. Available at: https://zenodo.org/record/3908559.
Back to article
11Joshi, JP and Parpola, A. 1987. Corpus of Indus Seals and Inscriptions, Volumes I and II. Memoirs of the Archaeological Survey of India; no. 86, 96, 116; nide 359, 383, 386; tom. 239–240, 359, 383, 386. Suomalainen Tiedeakatemia, Helsinki.
Back to article
12Jun. 2022. ‘Historical review of mohenjo-daro and harappan civilization in pakistan’. Pacific International Journal, 5: 31–42. DOI: 10.55014/pij.v5i2.185
Back to article
13Kenoyer, J. 1998. Ancient cities of indus valley civilization. Karachi, Islamabad: Oxford University Press.
Back to article
14Mahadevan, I. 1977. The indus script: Text, concordance and tables. Issue 77 of Memoirs of the Archaeological Survey of India.
Back to article
15Mukhopadhyay, A. 2023. ‘Semantic scope of indus inscriptions comprising taxation, trade and craft licensing, commodity control and access control: archaeological and script-internal evidence’. Humanities and Social Sciences Communications, 10: 1–12. DOI: 10.1057/s41599-023-02320-7
Back to article
16Mukhopadhyay, BA. 2019. ‘Interrogating indus inscriptions to unravel their mechanisms of meaning conveyance’. Palgrave Communications, 5: 73. DOI: 10.1057/s41599-019-0274-1
Back to article
17Oakes, MP. 2019. ‘Statistical analysis of the tables in mahadevan’s concordance of the indus valley script’. Journal of Quantitative Linguistics, 26: 401–422. DOI: 10.1080/09296174.2017.1406294
Back to article
18Palaniappan, S and Adhikari, R. 2017. Deep learning indus script. PLOS Submission: arXiv preprint arXiv:1702.00523.DOI: 10.48550/arXiv.1702.00523
Back to article
19Parpola, A. 1994. Deciphering the Indus Script. Cambridge University Press.
Back to article
20Rao, VN and Mohanty, MK. 2015. ‘Comparative visual analysis of symbolic and illegible indus valley script with other languages’. IOSR Journal of Humanities and Social Science (IOSR-JHSS), 20: 66–72.
Back to article
21Redmon, J and Farhadi, A. 2018a. YOLOv3: An incremental improvement. arXiv preprint arXiv:1804.0276.
Back to article
22Redmon, J and Farhadi, A. 2018b. Yolov3: An incremental improvement. Available at: https://arxiv.org/abs/1804.02767, arXiv:1804.02767.
Back to article
23Sobhy, A, Helmy, M, Khalil, M, Elmasry, S, Boules, Y and Negied, N. 2023. ‘An ai based automatic translator for ancient hieroglyphic language—from scanned images to english text’. IEEE Access, 11: 38796–38804. DOI: 10.1109/ACCESS.2023.3267981
Back to article
24Szegedy, C, Ioffe, S and Vanhoucke, V. 2016a. ‘Inception-v4, inception-resnet and the impact of residual connections on learning’. In: CoRR. Available at: http://arxiv.org/abs/1602.07261, arXiv:1602.07261.
Back to article
25Szegedy, C, Vanhoucke, V, Vinyals, O and Wojna, Z. 2016b. ‘Rethinking the inception architecture for computer vision’. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818–2826. DOI: 10.1109/CVPR.2016.308
Back to article
26Tan, M and Le, QV. 2019. ‘Efficientnet: Rethinking model scaling for convolutional neural networks’. In: 2019 International Conference on Machine Learning (ICML), pp. 6105–6114. Available at: http://proceedings.mlr.press/v97/tan19a.html.
Back to article
27Venkatesan, R and Li, B. 2017. Convolutional Neural Networks in Visual Computing: A Concise Guide. CRC Press. DOI: 10.4324/9781315154282
Back to article
28Wang, W, Duan, L, En, Q, Zhang, B and Liang, F. 2022. ‘Tpsn: Transformer-based multi-prototype search network for few-shot semantic segmentation’. Computers and Electrical Engineering, 103: 108326. DOI: 10.1016/j.compeleceng.2022.108326
Back to article
29Wells, B. 2006. Epigraphic approaches to indus writing. Dissertation, Harvard University.
Back to article
30Wells, B and Fuls, A. 2024. Online indus writing database. www.indus.epigraphica.de.
Back to article
31Yadav, N, Joglekar, H, Rao, RPN, Vahia, MN, Adhikari, R and Mahadevan, I. 2010. ‘Statistical analysis of the indus script using n-grams’. PLoS ONE, 5: e9506. DOI: 10.1371/journal.pone.0009506
Back to article

Deep Learning in Archiving Indus Script and Motif Information

Abstract

Paradigm

My account