Large Language Model-Based Detoxification for Bahasa Indonesia

Badrus Zaman; Naufal Humam; Indra Kharisma Raharjana

doi:10.2478/cait-2025-0019

.blurhash-client-img { display: none !important; }

Large Language Model-Based Detoxification for Bahasa Indonesia

Cybernetics and Information Technologies

Volume 25 (2025): Issue 3 (September 2025)

By: Badrus Zaman, Naufal Humam and Indra Kharisma Raharjana

Open Access

|Sep 2025

Margono, H., M. Saud, A. Ashfaq. Dynamics of Hate Speech in Social Media: Insights from Indonesia. Global Knowledge, Memory, and Communication. 2024. DOI: 10.1108/GKMC-11-2023-0464.
Search in Google Scholar Back to article
Pamungkas, E. W., D. G. P. Putri, A. Fatmawati. Hate Speech Detection in Bahasa Indonesia: Challenges and Opportunities. – International Journal of Advanced Computer Science and Applications, Vol. 14, 2023, No 6. DOI: 10.14569/IJACSA.2023.01406125.
Search in Google Scholar Back to article
Zaman, B., A. Justitia, K. N. Sani, E. Purwanti. An Indonesian Hoax News Detection System Using Reader Feedback and Naïve Bayes Algorithm. – Cybernetics and Information Technologies, Vol. 20, 2020, No 1, pp. 82-94.
Search in Google Scholar Back to article
Ibrohim, M. O., M. A. Setiadi, I. Budi. Identification of Hate Speech and Abusive Language on Indonesian Twitter Using Word2vec, Part-of-Speech, and Emoji Features. – In: Proc. of 1st International Conference on Advanced Information Science and System, November 2019, pp. 1-5. DOI: 10.1145/3373477.3373495.
Search in Google Scholar Back to article
Kusuma, J. F., A. Chowanda. Indonesian Hate Speech Detection Using IndoBERTweet and BiLSTM on Twitter. – JOIV: International Journal on Informatics Visualization, Vol. 7, 2023, No 3, pp. 773-780. DOI: 10.30630/joiv 7.3.1035.
Search in Google Scholar Back to article
Dementieva, D., D. Moskovskiy, V. Logacheva, D. Dale, O. Kozlova, N. Semenov, A. Panchenko. Methods for Detoxification of Texts for the Russian Language. – Multimodal Technologies and Interaction, Vol. 5, 2021, No 9, p. 54. DOI: 10.3390/mti5090054.
Search in Google Scholar Back to article
Dale, D., A. Voronov, D. Dementieva, V. Logacheva, O. Kozlova, N. Semenov, A. Panchenko. Text Detoxification Using Large Pre-Trained Neural Models. – arXiv preprint arXiv:2109.08914. 2021. DOI: 10.18653/v1/2021.emnlp-main.629.
Search in Google Scholar Back to article
Hamtini, T., A. J. Assaf. Exploring the Efficacy of GenAI in Grading SQL Query Tasks: A Case Study. – Cybernetics and Information Technologies, Vol. 3, 2024, No 3, pp. 102-111.
Search in Google Scholar Back to article
Sourabrata, M., B. Akanksha, K. O. Atul, P. M. John, D. Ondrej. Text Detoxification as Style Transfer in English and Hindi. – In: Proc. of 20th International Conference on Natural Language Processing (ICON’23), December 2023, pp. 133-144. DOI: 10.48550/arXiv.2402.07767.
Search in Google Scholar Back to article
Dementieva, D., N. Babakov, A. Panchenko. Multiparadetox: Extending Text Detoxification with Parallel Data to New Languages. – arXiv preprint arXiv:2404.02037. 2024. DOI: 10.18653/v1/2024.naacl-short.12.
Search in Google Scholar Back to article
Rana, M. R. R., A. Nawaz, T. Ali, A. S. Alattas, D. S. AbdElminaam. Sentiment Analysis of Product Reviews Using Transformer Enhanced 1D-CNN and BiLSTM. – Cybernetics and Information Technologies, Vol. 24, 2024, No 3, pp. 112-131.
Search in Google Scholar Back to article
Harisanty, D., N. E. V. Anna, R. Sugihartati, K. Srimulyo, M. F. B. Hamzah. Netizen Views on Artificial Intelligence: A Social Media Content Analysis. – Kurdish Studies, Vol. 12, 2024, No 1, pp. 365-376.
Search in Google Scholar Back to article
Ibrohim, M. O., I. Budi. Hate Speech and Abusive Language Detection in Indonesian Social Media: Progress and Challenges. – Heliyon, Vol. 9, 2023, No 8. DOI: 10.1016/j.heliyon.2023.e18647.
Search in Google Scholar Back to article
Logacheva, V., D. Dementieva, S. Ustyantsev, D. Moskovskiy, D. Dale, I. Krotova et al. Paradetox: Detoxification with Parallel Data. – In: Proc. of 60th Annual Meeting of the Association for Computational Linguistics, Vol. 1: Long Papers, May 2022, pp. 6804-6818. DOI: 10.18653/v1/2022.acl-long.469.
Search in Google Scholar Back to article
Dementieva, D., S. Ustyantsev, D. Dale, O. Kozlova, N. Semenov, A. Panchenko, V. Logacheva. Crowdsourcing of Parallel Corpora: The Case of Style Transfer for Detoxification. – In: CSW@ VLDB, August 2021, pp. 35-49.
Search in Google Scholar Back to article
Sari, D. A. P., A. Y. Putri, M. Hanggareni, A. Anjani, M. L. O. Siswondo, I. K. Raharjana. Crowdsourcing as a Tool to Elicit Software Requirements. – In: AIP Conference Proceedings. Vol. 2329. No 1. February 2021, 050001. AIP Publishing LLC. DOI: 10.1063/5.0042134.
Search in Google Scholar Back to article
Romadhony, A., S. Al Faraby, R. Rismala, U. N. Wisesti, A. Arifianto. Sentiment Analysis on a Large Indonesian Product Review Dataset. – Journal of Information Systems Engineering & Business Intelligence, Vol. 10, 2024, No 1. DOI: 10.20473/jisebi.10.1.167-178.
Search in Google Scholar Back to article
Logacheva, V., D. Dementieva, I. Krotova, A. Fenogenova, I. Nikishina, T. Shavrina, A. Panchenko. A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification. – In: Proc. of 2nd Workshop on Human Evaluation of NLP Systems (HumEval’22), May 2022, pp. 90-101. DOI: 10.18653/v1/2022.humeval-1.8.
Search in Google Scholar Back to article
Ibrohim, M. O., I. Budi. Multi-Label Hate Speech and Abusive Language Detection in Indonesian Twitter. – In: Proc. of 3rd Workshop on Abusive Language Online, August 2019, pp. 46-57. DOI: 10.18653/v1/w19-3506.
Search in Google Scholar Back to article
Fakhruzzaman, M. N., S. W. Gunawan. CekUmpanKlik: An Artificial Intelligence-Based Application to Detect Indonesian Clickbait. – IAES International Journal of Artificial Intelligence, Vol. 11, 2022, No 4, 1232. DOI: 10.11591/ijai.v11.i4.pp1232-1238.
Search in Google Scholar Back to article
Krishna, K., J. Wieting, M. Iyyer. Reformulating Unsupervised Style Transfer as Paraphrase Generation. – arXiv Preprint arXiv:2010.05700. 2020. DOI: 10.18653/v1/2020.emnlp-main.55.
Search in Google Scholar Back to article
Moskovskiy, D., S. Pletenev, A. Panchenko. LLMs to Replace Crowdsourcing for Parallel Data Creation? The Case of Text Detoxification. –In: Proc. of Findings of the Association for Computational Linguistics (EMNLP’24), November 2024, pp. 14361-14373. DOI: 10.18653/v1/2024.findings-emnlp.839.
Search in Google Scholar Back to article
Susanto, L., M. I. Wijanarko, P. A. Pratama, T. Hong, I. Idris, A. F. Aji, D. Wijaya. IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language. – arXiv Preprint arXiv:2406.19349. 2024. DOI: 10.48550/arXiv.2406.19349.
Search in Google Scholar Back to article
Touvron, H., L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, et al. Llama 2: Open Foundation and Fine-Tuned Chat Models. – arXiv Preprint arXiv:2307.09288. 2023. http://arxiv.org/abs/2307.09288
Search in Google Scholar Back to article
GoToCompany, “Llama3 8B CPT Sahabat-AI v1 Instruct”. Online Accessed 20 February 2025. https://huggingface.co/GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct
Search in Google Scholar Back to article
Brown, T., B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal et al. Language Models Are Few-Shot Learners. – Advances in Neural Information Processing Systems, Vol. 33, 2020, pp. 1877-1901.
Search in Google Scholar Back to article
Koto, F., A. Rahimi, J. H. Lau, T. Baldwin. IndoLEM and IndoBERT: A Benchmark Dataset and Pre-Trained Language Model for Indonesian NLP. – arXiv Preprint arXiv:2011.00677. 2020. DOI: 10.18653/v1/2020.coling-main.66.
Search in Google Scholar Back to article
Luo, Y., Z. Yang, F. Meng, Y. Li, J. Zhou, Y. Zhang. An Empirical Study of Catastrophic Forgetting in Large Language Models during Continual Fine-Tuning. – arXiv Preprint arXiv:2308.08747. 2023.
Search in Google Scholar Back to article
Ayele, A. A., N. Babakov, J. Bevendorff, X. B. Casals, B. Chulvi, D. Dementieva et al. Overview of PAN 2024: Multi-Author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification Condensed Lab Overview. – In: Proc. of International Conference of the Cross-Language Evaluation Forum for European Languages, September 2024, pp. 231-259. Cham, Switzerland, Springer Nature.
Search in Google Scholar Back to article
Iglesias, M., O. Araque, C. Á. Iglesias. A Toxic Style Transfer Method Based on the Delete-Retrieve-Generate Framework Exploiting Toxic Lexicon Semantic Similarity. – Applied Sciences, Vol. 13, 2023, No 15, 8590. DOI: 10.3390/app13158590.
Search in Google Scholar Back to article
Laugier, L., J. Pavlopoulos, J. Sorensen, L. Dixon. Civil Rephrases of Toxic Texts with Self-Supervised Transformers. – arXiv Preprint arXiv:2102.05456. 2021.
Search in Google Scholar Back to article
Dementieva, D., N. Babakov, A. Ronen, A. A. Ayele, N. Rizwan, F. Schneider et al. Multilingual and Explainable Text Detoxification with Parallel Corpora. – arXiv preprint arXiv:2412.11691. 2024.
Search in Google Scholar Back to article
Dementieva, D., V. Logacheva, I. Nikishina, A. Fenogenova, D. Dale, I. Krotova et al. Russe-2022: Findings of the First Russian Detoxification Shared Task Based on Parallel Corpora. – Computational Linguistics and Intellectual Technologies, 2022. DOI: 10.28995/2075-7182-2022-21-114-131.
Search in Google Scholar Back to article
Krause, B., A. D. Gotmare, B. McCann, N. S. Keskar, S. Joty, R. Socher, N. F. Rajani. Gedi: Generative Discriminator Guided Sequence Generation. – arXiv Preprint arXiv:2009.06367. 2020. DOI: 10.18653/v1/2021.findings-emnlp.424.
Search in Google Scholar Back to article
Roiqoh, S., B. Zaman, K. Kartono. Analisis Sentimen Berbasis Aspek Ulasan Aplikasi Mobile JKN Dengan Lexicon Based dan Naïve Bayes. – Jurnal Media Informatika Budidarma, Vol. 7, 2023, No 3, pp. 1582-1592.
Search in Google Scholar Back to article

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.2478/cait-2025-0019 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702

Journal RSS Feed

Language: English

Page range: 3 - 21

Submitted on: May 2, 2025

Accepted on: Jun 26, 2025

Published on: Sep 25, 2025

Published by: Bulgarian Academy of Sciences, Institute of Information and Communication Technologies

In partnership with: Paradigm Publishing Services

Keywords:

Bahasa Indonesia,

Large language models,

Related subjects:

Information technology

© 2025 Badrus Zaman, Naufal Humam, Indra Kharisma Raharjana, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 25 (2025): Issue 3 (September 2025)