Have a personal or library account? Click to login
Deepfake Voice Detection for Underrepresented Languages: A Romanian Case Study Cover

Abstract

The rapid advancement of artificial intelligence and its subsequent application in deepfake media present a significant global security concern. Despite this widespread issue, effective detection solutions are notably absent for less commonly used languages. This paper proposes a potential solution for identifying generated audio in Romanian. The solution centres on an SVM-based algorithm, previously demonstrated to perform effectively in English language tests, adapted with a dataset specifically tailored to the Romanian language. The resulting language-specific model exhibits better performance in differentiating between authentic and synthetic Romanian audio, thereby offering an improvement over general-purpose, Anglocentric systems. This constitutes an effective strategy for developing localised solutions for a language with limited resources.

DOI: https://doi.org/10.2478/ijasitels-2025-0003 | Journal eISSN: 2559-365X | Journal ISSN: 2067-354X
Language: English
Page range: 63 - 72
Published on: Dec 17, 2025
Published by: Lucian Blaga University of Sibiu
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2025 Bălășoiu Robert-Alin, Ivașcu Ioana-Daniela, Mihu Cantemir, Muntean Robert-Andrei, Olescu Marco Leon, Resiga Sorana-Ioana, published by Lucian Blaga University of Sibiu
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.