Have a personal or library account? Click to login
Exploring Data Preparation Strategies: A Comparative Analysis of Vision Transformer and ConvNext Architectures in Breast Cancer Histopathology Classification Cover

Exploring Data Preparation Strategies: A Comparative Analysis of Vision Transformer and ConvNext Architectures in Breast Cancer Histopathology Classification

Open Access
|Jun 2025

Abstract

Breast cancer remains a major global health challenge and the accurate classification of histopathological samples into benign and malignant categories is critical for effective diagnosis and treatment planning. This study offers a comparative analysis of two state-of-the-art deep learning architectures, Vision Transformer (ViT) and ConvNeXT for breast cancer histopathology image classification, focusing on the impact of data preparation strategies. Using the BreakHis benchmark dataset, we investigated six distinct preprocessing approaches, including image resizing, patch-based techniques, and cellular content filtering, applied across four magnification levels (40×, 100×, 200×, and 400×). Both models were fine-tuned and evaluated using multiple performance metrics: accuracy, precision, recall, F1 score, and area under the receiver operating characteristic curve (AUC). The results highlight the critical influence of data preparation on model performance. ViT achieved its highest accuracy of 95.6% and an F1 score of 96.8% at 40× magnification with randomly generated patches. ConvNeXT demonstrated strong robustness across scenarios, attaining a precision of 98.5% at 100× magnification using non-overlapping patches. These findings emphasize the importance of customized data preprocessing and informed model selection in improving diagnostic accuracy. Optimizing both architectural design and data handling is essential to enhancing the reliability of automated histopathological analysis and supporting clinical decision-making.

DOI: https://doi.org/10.61822/amcs-2025-0023 | Journal eISSN: 2083-8492 | Journal ISSN: 1641-876X
Language: English
Page range: 329 - 339
Submitted on: Jan 10, 2025
Accepted on: Mar 27, 2025
Published on: Jun 24, 2025
Published by: University of Zielona Góra
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Mikołaj Kaczmarek, Marek Kowal, Józef Korbicz, published by University of Zielona Góra
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.