Skip to main content

Contextualized Vision Transformers (CVT): Adaptive Spectral Embedding and Feature Gating for Precise Text-Graphics Classification Cover

.blurhash-client-img { display: none !important; }

Contextualized Vision Transformers (CVT): Adaptive Spectral Embedding and Feature Gating for Precise Text-Graphics Classification

Tatra Mountains Mathematical Publications

By: Mridul Ghosh, Konrad Dürrbeck, Roland Fischer, Mária Ždímalová and Tonmoy Mete

Open Access

|Mar 2026

Authors

Mridul Ghosh

mridulxyz@gmail.com

Department of Computer Science, Shyampur Siddheswari Mahavidyalaya, Howrah, India

Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Konrad Dürrbeck

konrad.duerrbeck@iis.fraunhofer.de

Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Roland Fischer

roland.fischer@iis.fraunhofer.de

Risk and location analysis, Fraunhofer IIS, Nuremberg, Germany

Mária Ždímalová

maria.zdimalova@stuba.sk

Department of Mathematics and Descriptive Geometry, Slovak University of Technology in Bratislava, Bratislava, Slovakia

Tonmoy Mete

tonmoy.mete@asutoshcollege.in

Department of Computer Science, Asutosh College, Kolkata, India

Articles in this issue

DOI: https://doi.org/10.2478/tmmp-2026-0003 | Journal eISSN: 1338-9750 | Journal ISSN: 1210-3195

Journal RSS Feed

Language: English

Submitted on: Oct 8, 2025

|

Accepted on: Nov 27, 2026

|

Published on: Mar 17, 2026

Published by: Slovak Academy of Sciences, Mathematical Institute

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

vision transformer,

classification,

learnable patch decomposition

Related subjects:

General mathematics

© 2026 Mridul Ghosh, Konrad Dürrbeck, Roland Fischer, Mária Ždímalová, Tonmoy Mete, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.