Have a personal or library account? Click to login
Hierarchical Arabic text classification: deep learning-based approach Cover

Abstract

Text classification is the task of assigning textual data to predefined categories, playing a crucial role in natural language processing. In recent years, deep learning models have demonstrated superior performance over traditional machine learning approaches in text classification tasks. This paper presents a supervised deep learning approach for hierarchical Arabic text classification. To facilitate this study, we developed WiHArD, a novel hierarchical Arabic text dataset, where each text is systematically labeled according to a structured category hierarchy. We then propose a deep learning model that integrates BERT-based feature extraction with a neural network classifier. BERT encodes textual inputs into dense vector representations, while the neural network learns to accurately classify texts within the hierarchical structure. Our comparative study demonstrates that the proposed BERT-ANN model achieves significant improvements in hierarchical classification performance, outperforming the existing HMATC model. These findings highlight the e ectiveness of deep learning-based approaches in advancing Arabic text classification.

DOI: https://doi.org/10.2478/awutm-2026-0001 | Journal eISSN: 1841-3307 | Journal ISSN: 1841-3293
Language: English
Page range: 1 - 15
Submitted on: Nov 25, 2024
Accepted on: Oct 26, 2025
Published on: Nov 4, 2025
Published by: West University of Timisoara
In partnership with: Paradigm Publishing Services
Publication frequency: Volume open

© 2025 Djelloul Bouchiha, Benamar Hamzaoui, Abdelghani Bouziane, Noureddine Doumi, Farouk Omar Berbouchi, Aymen Abdelghani Kebir, Nihad Mebarki, Badiâ Achouak Benameur, published by West University of Timisoara
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Volume 62 (2026): Issue 1 (January 2026)