ARMDiaRD: A robust multi-class diabetic retinopathy detection using hybrid swin transformers with hierarchical fusion

J. Dhiviya Rose; Ved Prakash Bhardwaj

doi:10.2478/ijssis-2026-0011

.blurhash-client-img { display: none !important; }

ARMDiaRD: A robust multi-class diabetic retinopathy detection using hybrid swin transformers with hierarchical fusion

International Journal on Smart Sensing and Intelligent Systems

Volume 19 (2026): Issue 1 (January 2026)

By: J. Dhiviya Rose and Ved Prakash Bhardwaj

Open Access

|Feb 2026

Abstract

Diabetic retinopathy (DR) is an eye issue of prolonged diabetes, which continues to be a major cause of vision loss among middle-aged adults. Early detection of DR through advanced imaging and artificial intelligence (AI) techniques can lead to a significant reduction in risk and severity of visual loss. This proposed ARMDiaRD combines state-of-the-art techniques from EfficientNet, Swin Transformers, and multi-scale feature fusion (MSFF) to enhance multiclass classification of DR severity levels across diverse generalised datasets. In the proposed architecture, a global context (GC) block is integrated into EfficientNet to capture long-range dependencies and contextual relationships. This is followed by Swin Transformer layers equipped with an MSFF block that hierarchically aggregates features from multiple levels, enabling the model to learn richer and more discriminative representations. The fundus images of four publicly available datasets for DR: asia pacific tele ophthalmology society (APTOS), Indian diabetic retinopathy image dataset (IDRID), MESSIDOR-V1, and EYEPACS, are combined through a comprehensive aggregation process to facilitate the training and testing of the generalised model. This design demonstrates consistent improvements across multiple evaluation metrics, underlining its potential to reduce misclassification in medical diagnosis. To evaluate the proposed model, the performance metrics, such as accuracy, precision, recall, specificity, and F1-score, are calculated along with quadratic weighted kappa (QWK), Spearman rank correlation coefficient, and mean absolute error (MAE). Simulation results revealed that the proposed model achieved an accuracy of 87.59%, precision of 87.6%, recall of 87.9%, QWK of 91.47% and Spearman coefficients with 92.53% respectively. Importantly, the MAE, a critical metric for evaluating false predictions in medical diagnosis, is 0.1736 in the proposed model. The outcome clearly demonstrates the supremacy of the proposed model over the other models in handling multiple larger datasets. This research shows the ubiquitous nature of the proposed model to predict the severity grades using larger complex datasets. This proposed ARMDiaRD confirms that combining local, global, and hierarchical characteristics is highly effective in preventing overfitting issues with the existing architecture, such as CNN, when tested using new data for reliable clinical image analysis.

References

Li, T., et al.: Applications of deep learning in fundus images: A review. Elsevier B.V. (2021) DOI: 10.1016/j.media.2021.101971
Open DOI Search in Google Scholar Back to article
Ting, D.S.W., Cheung, G.C.M., Wong, T.Y.: Diabetic retinopathy: global prevalence, major risk factors, screening practices and public health challenges: a review. Blackwell Publishing (2016) DOI: 10.1111/ceo.12696
Open DOI Search in Google Scholar Back to article
Nagpal, D., Panda, S.N., Malarvel, M., Pattanaik, P.A., Khan, M.Z.: A review of diabetic retinopathy: Datasets, approaches, evaluation metrics and future trends. King Saud bin Abdulaziz University (2022). DOI: 10.1016/j.jksuci.2021.06.006
Open DOI Search in Google Scholar Back to article
Tang, L., Xu, G.T., Zhang, J.F.: Inflammation in diabetic retinopathy: possible roles in pathogenesis and potential implications for therapy. Wolters Kluwer Medknow Publications (2023) DOI: 10.4103/1673-5374.355743
Open DOI Search in Google Scholar Back to article
Yau, J.W.Y., et al.: Global prevalence and major risk factors of diabetic retinopathy. Diabetes Care 35(3), 556–564 (2012) DOI: 10.2337/dc11-1909
Open DOI Search in Google Scholar Back to article
Abràmoff, M.D., Folk, J.C., Han, D.P., al.: Automated analysis of retinal images for detection of referable diabetic retinopathy. JAMA Ophthalmology 131(3), 351–357 (2013) DOI: 10.1001/jamaophthalmol.2013.1743
Open DOI Search in Google Scholar Back to article
Bora, A., Balasubramanian, S., Babenko, B., Virmani, S., Venugopalan, S., Mitani, A., Oliveira Marinho, G., Cuadros, J., Ruamviboonsuk, P., Corrado, G.S., Peng, L., Webster, D.R., Varadarajan, A.V., Hammel, N., Liu, Y., Bavishi, P.: Predicting the risk of developing diabetic retinopathy using deep learning. The Lancet Digital Health 3, 10–19 (2021) DOI: 10.1016/S2589-7500(20)30250-8
Open DOI Search in Google Scholar Back to article
Serener, A., Serte, S.: Geographic variation and ethnicity in diabetic retinopathy detection via deep learning. Turkish Journal of Electrical Engineering and Computer Sciences 28, 664–678 (2020) DOI: 10.3906/elk-1902-131
Open DOI Search in Google Scholar Back to article
Wilkinson, C.P., Ferris, F.L., Klein, R.E., et al.: Proposed international clinical diabetic retinopathy and diabetic macular edema disease severity scales. Ophthalmology 110(9), 1677–1682 (2003)
Search in Google Scholar Back to article
Lakshminarayanan, V., Kheradfallah, H., Sarkar, A., Balaji, J.J.: Automated detection and diagnosis of diabetic retinopathy: A comprehensive survey. MDPI (2021). DOI: 10.3390/jimaging7090165
Open DOI Search in Google Scholar Back to article
Ai, Z., Huang, X., Fan, Y., Feng, J., Zeng, F., Lu, Y.: Driixrn: Detection algorithm of diabetic retinopathy based on deep ensemble learning and attention mechanism. Frontiers in Neuroinformatics 15 (2021) DOI: 10.3389/fninf.2021.778552
Open DOI Search in Google Scholar Back to article
Dhiviya Rose, J., Jain, A., Tiwari, S.: Challenges and solutions with lightweight models for diabetic retinopathy detection. In: International Conference on MAchine inTelligence for Research & Innovations, pp. 95–105 (2023). Springer
Search in Google Scholar Back to article
Mutawa, A., Alnajdi, S., Sruthi, S.: Transfer learning for diabetic retinopathy detection: A study of dataset combination and model performance. Applied Sciences 13(9), 5685 (2023)
Search in Google Scholar Back to article
Ansari, G.A., Bhat, S.S., Ansari, M.D.: Machine learning techniques for diabetes mellitus based on lifestyle predictors. Recent Advances in Electrical & Electronic Engineering 18(7), 1060–1071 (2025)
Search in Google Scholar Back to article
Bhat, S.S., Ansari, G.A., Ansari, M.D.: Performance analysis of machine learning based on optimized feature selection for type ii diabetes mellitus. Multimedia Tools and Applications 84(8), 4945–4964 (2025)
Search in Google Scholar Back to article
Bhat, S.S., Selvam, V., Ansari, G.A., Ansari, M.D.: Hybrid prediction model for type-2 diabetes mellitus using machine learning approach. In: 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), pp. 150–155 (2022). IEEE
Search in Google Scholar Back to article
Bhat, S.S., Selvam, V., Ansari, G.A., Ansari, M.D., Rahman, M.H.: Prevalence and early prediction of diabetes using machine learning in north kashmir: a case study of district bandipora. Computational Intelligence and Neuroscience 2022(1), 2789760 (2022)
Search in Google Scholar Back to article
Bhat, S.S., Selvam, V., Ansari, G.A., Ansari, M.D.: Analysis of diabetes mellitus using machine learning techniques. In: 2022 5th International Conference on Multimedia, Signal Processing and Communication Technologies (IMPACT), pp. 1–5 (2022). IEEE
Search in Google Scholar Back to article
Gulshan, V., Peng, L., Coram, M., Stumpe, M.C., et al.: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316(22), 2402–2410 (2016)
Search in Google Scholar Back to article
Community, K.: APTOS 2019 Blindness Detection. https://www.kaggle.com/competitions/aptos2019-blindness-detection (2019)
Search in Google Scholar Back to article
Porwal, P., Pachade, S., Kamble, R., et al.: Indian diabetic retinopathy image dataset (idrid): A database for diabetic retinopathy screening research. Data 3(3), 25 (2018)
Search in Google Scholar Back to article
Jiang, Z., Fu, H., Shen, J., Xu, D., Shao, L.: Fgadr: Fine-grained annotated diabetic retinopathy dataset. arXiv preprint arXiv:2004.06803 (2021)
Search in Google Scholar Back to article
Decenciere, E., Zhang, X., Cazuguel, G., et al.: Feedback on a publicly distributed image database: the messidor database. Image Analysis & Stereology 33(3), 231–234 (2014)
Search in Google Scholar Back to article
Decenciere, E., Zhang, X., Cazuguel, G., et al.: Teleophta: Machine learning and image processing methods for teleophthalmology. IRBM 34(2), 196–203 (2013)
Search in Google Scholar Back to article
Baba, S.M., Bala, I., Dhiman, G., Sharma, A., Viriyasitavat, W.: Automated diabetic retinopathy severity grading using novel dr-resnet+ deep learning model. Multimedia Tools and Applications 83(28), 71789–71831 (2024)
Search in Google Scholar Back to article
Islam, M. M., Yang, H.-C., Poly, T. N., Jian, W.-S., & Li, Y.-C. J. (2020). Deep learning algorithms for detection of diabetic retinopathy in retinal fundus photographs: A systematic review and meta-analysis. Computer Methods and Programs in Biomedicine, 191, 105320. https://doi.org/10.1016/j.cmpb.2020.105320.
Search in Google Scholar Back to article
Pratt, H., Coenen, F., Broadbent, D. M., Harding, S. P., & Zheng, Y. (2016). Convolutional Neural Networks for Diabetic Retinopathy. Procedia Computer Science, 90, 200–205. https://doi.org/10.1016/j.procs.2016.07.014
Search in Google Scholar Back to article
Wang, X., Ji, Z., Ma, X., Zhang, Z., Yi, Z., Zheng, H., Fan, W., & Chen, C. (2021). Automated grading of diabetic retinopathy with ultra-widefield fluorescein angiography and deep learning. Journal of Diabetes Research, 2021, Article 2611250. https://doi.org/10.1155/2021/2611250.
Search in Google Scholar Back to article
Voets, M., Møllersen, K., & Bongo, L. A. (2019). Reproduction study using public data of: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. PLoS ONE, 14(6), e0217541. https://doi.org/10.1371/journal.pone.0217541.
Search in Google Scholar Back to article
Tiwari, S., Jain, A., Ahuja, N.J., Shukla, A.: Deep learning-based multi-class classification of diabetic retinopathy utilizing transfer learning with mobilenet architecture. In: Rathore, V.S., Piuri, V., Babo, R., Tiwari, V. (eds.) Emerging Trends in Expert Applications and Security, pp. 83–92. Springer, Singapore (2024)
Search in Google Scholar Back to article
Lin, C.-L., Wu, K.-C.: Development of revised resnet-50 for diabetic retinopathy detection. BMC bioinformatics 24(1), 157 (2023)
Search in Google Scholar Back to article
Al-Antary, M.T., Arafa, Y.: Multi-scale attention network for diabetic retinopathy classification. IEEE Access 9, 54190–54200 (2021) DOI: 10.1109/ACCESS.2021.3070685
Open DOI Search in Google Scholar Back to article
Abramovich, O., Pizem, H., Van Eijgen, J., Oren, I., Melamed, J., Stalmans, I., Blumenthal, E.Z., Behar, J.A.: Fundusq-net: A regression quality assessment deep learning algorithm for fundus images quality grading. Computer methods and programs in biomedicine 239, 107522 (2023)
Search in Google Scholar Back to article
Koonce, B.: Resnet 50. In: Convolutional Neural Networks with Swift for Ten-sorflow: Image Recognition and Dataset Categorization, pp. 63–72. Springer, (2021)
Search in Google Scholar Back to article
Alwakid, G., Gouda, W., Humayun, M.: Deep learning-based prediction of diabetic retinopathy using clahe and esrgan for enhancement. In: Healthcare, vol. 11, p. 863 (2023). MDPI
Search in Google Scholar Back to article
Mohanty, C., Mahapatra, S., Acharya, B., Kokkoras, F., Gerogiannis, V.C., Karamitsos, I., Kanavos, A.: Using deep learning architectures for detection and classification of diabetic retinopathy. Sensors 23(12), 5726 (2023)
Search in Google Scholar Back to article
Deepak, G.D., Bhat, S.K.: Deep learning-based CNN for multiclassification of ocular diseases using transfer learning. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization 12(1), 2335959 (2024)
Search in Google Scholar Back to article
Dai, L., Sheng, B., Chen, T., Wu, Q., Liu, R., Cai, C., Wu, L., Yang, D., Hamzah, H., Liu, Y., et al.: A deep learning system for predicting time to progression of diabetic retinopathy. Nature Medicine 30(2), 584–594 (2024)
Search in Google Scholar Back to article
Al-Naji, A., Khalid, G.A., Mahmood, M.F., Chahl, J.: Computer vision for eye diseases detection using pre-trained deep learning techniques and raspberry pi. The Journal of Engineering 2024(7), 12410 (2024)
Search in Google Scholar Back to article
Gupta, I.K., Patil, S., Mahadevkar, S., Kotecha, K., Mishra, A.K., Rodrigues, J.J.: Retinal fundus imaging-based diabetic retinopathy classification using transfer learning and fennec fox optimization. MethodsX 14, 103232 (2025)
Search in Google Scholar Back to article
Faramarzi, A., Heidarinejad, M., Mirjalili, S., Gandomi, A.H.: Marine predators algorithm: A nature-inspired metaheuristic. Expert systems with applications 152, 113377 (2020)
Search in Google Scholar Back to article
Chun, Y., Hua, X., Qi, C., Yao, Y.X.: Improved marine predators algorithm for engineering design optimization problems. Scientific reports 14(1), 13000 (2024)
Search in Google Scholar Back to article
Stimper, V., Bauer, S., Ernstorfer, R., Schölkopf, B., Xian, R.P.: Multidimensional contrast limited adaptive histogram equalization. IEEE Access 7, 165437–165447 (2019)
Search in Google Scholar Back to article
Reza, A.M.: Realization of the contrast limited adaptive histogram equalization (clahe) for real-time image enhancement. Journal of VLSI signal processing systems for signal, image and video technology 38, 35–44 (2004)
Search in Google Scholar Back to article
Pertuz, S., Puig, D., Garcia, M.A.: Analysis of focus measure operators for shape-from-focus. Pattern Recognition 46(5), 1415–1432 (2013)
Search in Google Scholar Back to article
Abràmoff, M.D., Folk, J.C., Han, D.P., al.: Automated analysis of retinal images for detection of referable diabetic retinopathy. JAMA Ophthalmology 131(3), 351–357 (2013) DOI: 10.1001/jamaophthalmol.2013.1743
Open DOI Search in Google Scholar Back to article

Articles in this issue

DOI: https://doi.org/10.2478/ijssis-2026-0011 | Journal eISSN: 1178-5608

Journal RSS Feed

Language: English

Submitted on: Aug 22, 2025

Published on: Feb 20, 2026

Published by: Macquarie University, Australia

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

Diabetic retinopathy,

fundus image,

multi-class classification,

Swin transformers,

multi-scale feature fusion

Related subjects:

Engineering,

Introductions and overviews,

Engineering, other

© 2026 J. Dhiviya Rose, Ved Prakash Bhardwaj, published by Macquarie University, Australia
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 19 (2026): Issue 1 (January 2026)