Langevin Resonance Transformer for Cross-Script Writer Identification

Sk Golam Sarowar Hossain; Mridul Ghosh; Tonmoy Mete; Mária Ždímalová; Kaushik Roy; Sk Md Obaidullah

doi:10.2478/tmmp-2026-0006

.blurhash-client-img { display: none !important; }

Langevin Resonance Transformer for Cross-Script Writer Identification

Tatra Mountains Mathematical Publications

AHEAD OF PRINT

By: Sk Golam Sarowar Hossain, Mridul Ghosh, Tonmoy Mete, Mária Ždímalová, Kaushik Roy and Sk Md Obaidullah

Open Access

|Apr 2026

Abstract

Vision Transformers (ViTs) have emerged as a powerful architecture for various computer vision tasks, including writer identification. However, like their CNN counterparts, they are susceptible to performance degradation in cross-script scenarios because their standard self-attention mechanism learns script-specific visual correlations. To overcome this, we propose a novel architecture, the Langevin Resonance Transformer (LRT). The LRT fundamentally redefines self-attention by replacing the abstract mathematical operation with a physically-grounded dynamic simulation. Each image patch is treated as a particle. The core of the LRT is a novel Langevin Attention Layer, where the interaction between pairs of particles is governed by a learnable potential energy function. The net force on each particle is aggregated from all other particles, and its state is evolved according to the Langevin equation, which models motion under both deterministic and stochastic forces. The LRT treats a writer’s style as a physical system defined by an energy landscape. Because it models biomechanics rather than just visual shapes, the resulting representation works regardless of the script. We tested this on the BRS-ID dataset using a custom augmentation strategy. The results show that LRT achieves higher accuracy than standard Vision Transformers and other top-tier models on the cross-script identification task.

References

ALI, M.—RAZA, M.—ULLAH, N.—AHMAD, I.—KIM, D.: Deep learning in writer identification: a comprehensive review of recent trends and future directions, IEEE Access 11 (2023), 55412–55435.
Search in Google Scholar Back to article
ALWADEI, A. H.—KESSENTINI, Y.: A multi-scale deep learning-based approach for offline writer identification, Multimedia Tools and Applications 82 (2023), 14939–14958.
Search in Google Scholar Back to article
APPALARAJU, S.—TANG, P.—DONG, Q.—SANKARAN, N.—ZHOU, Y.—MANMATHA, R.: Docformerv2: Local features for document understanding.In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 38 (2024), pp. 709–718.
Search in Google Scholar Back to article
CHEN, J.—HE, S.—SCHOMAKER, L.: FragNet: Writer Identification Using Small Handwriting Fragments.In: 2023 International Conference on Document Analysis and Recognition (ICDAR), IEEE (2023), pp. 112–127.
Search in Google Scholar Back to article
CHEN, Z.—ZHANG, L.: The next frontier: A survey of physics-informed architectures in deep learning, arXiv preprint arXiv:2501.01234 (2025).
Search in Google Scholar Back to article
CHRISTLEIN, V.—SEURET, M.—NICOLAOU, A.—RIBA, P.—ANGELOV, D.: Writer identification for historical documents: A survey.In: 16th IAPR International Conference on Document Analysis and Recognition (ICDAR), 2021, Springer, 2021. pp. 1–16.
Search in Google Scholar Back to article
DAS, A.—ROY, P.: Cross-Script Writer Identification via Contrastive Style Representation Learning.In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (2024), pp. 1–9.
Search in Google Scholar Back to article
DEY, S.—DUTTA, A.—LLADÓS, J.—PAL, U.: HanDA: A transformer-based system for handwritten document analysis.In: 26th International Conference on Pattern Recognition (ICPR), IEEE (2022), pp. 2501–2508.
Search in Google Scholar Back to article
DIAZ, M.—FERRER, M. A.—CARMONA, C.—MORALES, A.: A review on offline and online signature verification systems, IEEE Transactions on Cybernetics 52 (2021), 9177–9190.
Search in Google Scholar Back to article
DOSOVITSKIY, A.—BEYER, L.—KOLESNIKOV, A.—WEISSENBORN, D.— ZHAI, X.—UNTERTHINER, T.—DEHGHANI, M.—MINDERER, M.—HEI-GOLD, G.—GELLY, S. et al.: An image is worth 16x16 words: Transformers for image recognition at scale, arXiv preprint arXiv:2010.11929 (2020).
Search in Google Scholar Back to article
DUBE, S.—JAIN, A. K.—VATSA, M.—SINGH, R.: Forensic handwriting analysis in the digital age: A review of recent advancements and challenges, Wiley Interdisciplinary Reviews: Forensic Science 4 (2022), e1439.
Search in Google Scholar Back to article
GHOSH, M.—CHATTERJEE, S.—MUKHERJEE, H.—SEN, S.—OBAIDULLAH, S. M.: Text/non-text scene image classification using deep ensemble network. In: Proceedings of International Conference on Advanced Computing Applications: ICACA (2021). Springer-Verlag, 2021, pp. 561–570.
Search in Google Scholar Back to article
GHOSH, M.—MUKHERJEE, H.—OBAIDULLAH, S. M.—GAO, X.-Z.—ROY, K.: Scene text understanding: recapitulating the past decade, Artificial Intelligence Review 56 (2023), 15301–15373.
Search in Google Scholar Back to article
GHOSH, M.—ROY, S. S.—MUKHERJEE, H.—OBAIDULLAH, S. M.—GAO, X.-Z.— ROY, K.: Movie title extraction and script separation using shallow convolution neural network, IEEE Access 9 (2021), 125184–125201.
Search in Google Scholar Back to article
GHOSH, M.—ROY, S. S.—MUKHERJEE, H.—OBAIDULLAH, S. M. — SANTOSH, K.—ROY, K.: Understanding movie poster: transfer-deep learning approach for graphic-rich text recognition, The Visual Computer (2022), 1–20.
Search in Google Scholar Back to article
HE, K.—ZHANG, X.—REN, S.—SUN, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 770–778.
Search in Google Scholar Back to article
JAVAD, S. M.—EBRAHIMI, M.: ViT-WI: A vision transformer-based approach for writer identification in offline handwritten documents.In: 9th International Conference on Web Research (ICWR), IEEE (2023), pp. 105–110.
Search in Google Scholar Back to article
KARNIADAKIS, G. E.—KEVREKIDIS, I. G.—LU, L.—PERDIKARIS, P.—WANG, S.—YANG, L.: Physics-informed machine learning, Nature Reviews Physics 3 (2021), 422–440.
Search in Google Scholar Back to article
KAUSHAL, V.—KUMAR, P.—BALASUBRAMANIAN, V.: Unsupervised domain adaptation for cross-script writer identification. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (2022), pp. 1245–1254.
Search in Google Scholar Back to article
KUMAR, A.—SINGH, M.: Synergizing CNNs and transformers for enhanced writer identification.In: International Conference on Pattern Recognition (ICPR), 2024, IEEE (2024), pp. 1–8.
Search in Google Scholar Back to article
LI, Y.—ZHANG, D.—WANG, P.: Handwriting representation with graph neural networks.In: Proceedings of the 30th ACM International Conference on Multimedia (2022), pp. 1513–1521.
Search in Google Scholar Back to article
LIU, Z.—LIN, Y.—CAO, Y.—HU, H.—WEI, Y.—ZHANG, Z.—LIN, S.—GUO, B.: Swin transformer: Hierarchical vision transformer using shifted windows.In: Proceedings of the IEEE/CVF International Conference on Computer Vision (2021), pp. 10012–10022.
Search in Google Scholar Back to article
PANDEY, A.—JAIN, R.: Zero-shot writer identification, Pattern Analysis and Applications (2023), 1–13.
Search in Google Scholar Back to article
SULAIMAN, A.—OMAR, K.—NASRUDIN, M. F.: Few-shot writer identification using asiamese network.In: International Conference on Computer and Information Sciences IEEE (2021), (ICCOINS), pp. 257–262.
Search in Google Scholar Back to article
TAN, M.—LE, Q. V.: Efficientnet: Rethinking model scaling for convolutional neural networks.In: International Conference on Machine Learning, PMLR (2019), pp. 6105–6114.
Search in Google Scholar Back to article
TANG, Y.—WU, X.—PU, S.: Attention-based Multi-scale Feature Fusion for Writer Identification.In: 2022 IEEE International Conference on Image Processing (ICIP), IEEE (2022), pp. 1666–1670.
Search in Google Scholar Back to article
XING, L.—WU, Y.: Cross-script writer identification with disentangled representation learning, Pattern Recognition 118 (2021), 108027.
Search in Google Scholar Back to article
ZHAO, S.—LU, Y.—HE, S.: Self-Supervised Representation Learning for Writer Identification.In: 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2023), pp. 1–5.
Search in Google Scholar Back to article

Articles in this issue

DOI: https://doi.org/10.2478/tmmp-2026-0006 | Journal eISSN: 1338-9750 | Journal ISSN: 12103195

Journal RSS Feed

Language: English

Submitted on: Dec 10, 2025

Accepted on: Mar 18, 2026

Published on: Apr 22, 2026

Published by: Slovak Academy of Sciences, Mathematical Institute

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

writer identification,

cross-script analysis,

vision transformer,

physics-informed machine learning,

Related subjects:

© 2026 Sk Golam Sarowar Hossain, Mridul Ghosh, Tonmoy Mete, Mária Ždímalová, Kaushik Roy, Sk Md Obaidullah, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

AHEAD OF PRINT