Skip to main content
Have a personal or library account? Click to login
Contextualized Vision Transformers (CVT): Adaptive Spectral Embedding and Feature Gating for Precise Text-Graphics Classification Cover

Contextualized Vision Transformers (CVT): Adaptive Spectral Embedding and Feature Gating for Precise Text-Graphics Classification

Open Access
|Mar 2026

Authors

Mridul Ghosh

mridulxyz@gmail.com

Department of Computer Science, Shyampur Siddheswari Mahavidyalaya, Howrah, India
Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Konrad Dürrbeck

konrad.duerrbeck@iis.fraunhofer.de

Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Roland Fischer

roland.fischer@iis.fraunhofer.de

Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Mária Ždímalová

maria.zdimalova@stuba.sk

Department of Mathematics and Descriptive Geometry, Slovak University of Technology in Bratislava, Bratislava, Slovakia

Tonmoy Mete

tonmoy.mete@asutoshcollege.in

Department of Computer Science, Asutosh College, Kolkata, India
DOI: https://doi.org/10.2478/tmmp-2026-0003 | Journal eISSN: 1338-9750 | Journal ISSN: 1210-3195
Language: English
Submitted on: Oct 8, 2025
Accepted on: Nov 27, 2026
Published on: Mar 17, 2026
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Mridul Ghosh, Konrad Dürrbeck, Roland Fischer, Mária Ždímalová, Tonmoy Mete, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

AHEAD OF PRINT