Have a personal or library account? Click to login

Large Language Model-Based Detoxification for Bahasa Indonesia

Open Access
|Sep 2025

References

  1. Margono, H., M. Saud, A. Ashfaq. Dynamics of Hate Speech in Social Media: Insights from Indonesia. Global Knowledge, Memory, and Communication. 2024. DOI: 10.1108/GKMC-11-2023-0464.
  2. Pamungkas, E. W., D. G. P. Putri, A. Fatmawati. Hate Speech Detection in Bahasa Indonesia: Challenges and Opportunities. – International Journal of Advanced Computer Science and Applications, Vol. 14, 2023, No 6. DOI: 10.14569/IJACSA.2023.01406125.
  3. Zaman, B., A. Justitia, K. N. Sani, E. Purwanti. An Indonesian Hoax News Detection System Using Reader Feedback and Naïve Bayes Algorithm. – Cybernetics and Information Technologies, Vol. 20, 2020, No 1, pp. 82-94.
  4. Ibrohim, M. O., M. A. Setiadi, I. Budi. Identification of Hate Speech and Abusive Language on Indonesian Twitter Using Word2vec, Part-of-Speech, and Emoji Features. – In: Proc. of 1st International Conference on Advanced Information Science and System, November 2019, pp. 1-5. DOI: 10.1145/3373477.3373495.
  5. Kusuma, J. F., A. Chowanda. Indonesian Hate Speech Detection Using IndoBERTweet and BiLSTM on Twitter. – JOIV: International Journal on Informatics Visualization, Vol. 7, 2023, No 3, pp. 773-780. DOI: 10.30630/joiv 7.3.1035.
  6. Dementieva, D., D. Moskovskiy, V. Logacheva, D. Dale, O. Kozlova, N. Semenov, A. Panchenko. Methods for Detoxification of Texts for the Russian Language. – Multimodal Technologies and Interaction, Vol. 5, 2021, No 9, p. 54. DOI: 10.3390/mti5090054.
  7. Dale, D., A. Voronov, D. Dementieva, V. Logacheva, O. Kozlova, N. Semenov, A. Panchenko. Text Detoxification Using Large Pre-Trained Neural Models. – arXiv preprint arXiv:2109.08914. 2021. DOI: 10.18653/v1/2021.emnlp-main.629.
  8. Hamtini, T., A. J. Assaf. Exploring the Efficacy of GenAI in Grading SQL Query Tasks: A Case Study. – Cybernetics and Information Technologies, Vol. 3, 2024, No 3, pp. 102-111.
  9. Sourabrata, M., B. Akanksha, K. O. Atul, P. M. John, D. Ondrej. Text Detoxification as Style Transfer in English and Hindi. – In: Proc. of 20th International Conference on Natural Language Processing (ICON’23), December 2023, pp. 133-144. DOI: 10.48550/arXiv.2402.07767.
  10. Dementieva, D., N. Babakov, A. Panchenko. Multiparadetox: Extending Text Detoxification with Parallel Data to New Languages. – arXiv preprint arXiv:2404.02037. 2024. DOI: 10.18653/v1/2024.naacl-short.12.
  11. Rana, M. R. R., A. Nawaz, T. Ali, A. S. Alattas, D. S. AbdElminaam. Sentiment Analysis of Product Reviews Using Transformer Enhanced 1D-CNN and BiLSTM. – Cybernetics and Information Technologies, Vol. 24, 2024, No 3, pp. 112-131.
  12. Harisanty, D., N. E. V. Anna, R. Sugihartati, K. Srimulyo, M. F. B. Hamzah. Netizen Views on Artificial Intelligence: A Social Media Content Analysis. – Kurdish Studies, Vol. 12, 2024, No 1, pp. 365-376.
  13. Ibrohim, M. O., I. Budi. Hate Speech and Abusive Language Detection in Indonesian Social Media: Progress and Challenges. – Heliyon, Vol. 9, 2023, No 8. DOI: 10.1016/j.heliyon.2023.e18647.
  14. Logacheva, V., D. Dementieva, S. Ustyantsev, D. Moskovskiy, D. Dale, I. Krotova et al. Paradetox: Detoxification with Parallel Data. – In: Proc. of 60th Annual Meeting of the Association for Computational Linguistics, Vol. 1: Long Papers, May 2022, pp. 6804-6818. DOI: 10.18653/v1/2022.acl-long.469.
  15. Dementieva, D., S. Ustyantsev, D. Dale, O. Kozlova, N. Semenov, A. Panchenko, V. Logacheva. Crowdsourcing of Parallel Corpora: The Case of Style Transfer for Detoxification. – In: CSW@ VLDB, August 2021, pp. 35-49.
  16. Sari, D. A. P., A. Y. Putri, M. Hanggareni, A. Anjani, M. L. O. Siswondo, I. K. Raharjana. Crowdsourcing as a Tool to Elicit Software Requirements. – In: AIP Conference Proceedings. Vol. 2329. No 1. February 2021, 050001. AIP Publishing LLC. DOI: 10.1063/5.0042134.
  17. Romadhony, A., S. Al Faraby, R. Rismala, U. N. Wisesti, A. Arifianto. Sentiment Analysis on a Large Indonesian Product Review Dataset. – Journal of Information Systems Engineering & Business Intelligence, Vol. 10, 2024, No 1. DOI: 10.20473/jisebi.10.1.167-178.
  18. Logacheva, V., D. Dementieva, I. Krotova, A. Fenogenova, I. Nikishina, T. Shavrina, A. Panchenko. A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification. – In: Proc. of 2nd Workshop on Human Evaluation of NLP Systems (HumEval’22), May 2022, pp. 90-101. DOI: 10.18653/v1/2022.humeval-1.8.
  19. Ibrohim, M. O., I. Budi. Multi-Label Hate Speech and Abusive Language Detection in Indonesian Twitter. – In: Proc. of 3rd Workshop on Abusive Language Online, August 2019, pp. 46-57. DOI: 10.18653/v1/w19-3506.
  20. Fakhruzzaman, M. N., S. W. Gunawan. CekUmpanKlik: An Artificial Intelligence-Based Application to Detect Indonesian Clickbait. – IAES International Journal of Artificial Intelligence, Vol. 11, 2022, No 4, 1232. DOI: 10.11591/ijai.v11.i4.pp1232-1238.
  21. Krishna, K., J. Wieting, M. Iyyer. Reformulating Unsupervised Style Transfer as Paraphrase Generation. – arXiv Preprint arXiv:2010.05700. 2020. DOI: 10.18653/v1/2020.emnlp-main.55.
  22. Moskovskiy, D., S. Pletenev, A. Panchenko. LLMs to Replace Crowdsourcing for Parallel Data Creation? The Case of Text Detoxification. –In: Proc. of Findings of the Association for Computational Linguistics (EMNLP’24), November 2024, pp. 14361-14373. DOI: 10.18653/v1/2024.findings-emnlp.839.
  23. Susanto, L., M. I. Wijanarko, P. A. Pratama, T. Hong, I. Idris, A. F. Aji, D. Wijaya. IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language. – arXiv Preprint arXiv:2406.19349. 2024. DOI: 10.48550/arXiv.2406.19349.
  24. Touvron, H., L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, et al. Llama 2: Open Foundation and Fine-Tuned Chat Models. – arXiv Preprint arXiv:2307.09288. 2023. http://arxiv.org/abs/2307.09288
  25. GoToCompany, “Llama3 8B CPT Sahabat-AI v1 Instruct”. Online Accessed 20 February 2025. https://huggingface.co/GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct
  26. Brown, T., B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal et al. Language Models Are Few-Shot Learners. – Advances in Neural Information Processing Systems, Vol. 33, 2020, pp. 1877-1901.
  27. Koto, F., A. Rahimi, J. H. Lau, T. Baldwin. IndoLEM and IndoBERT: A Benchmark Dataset and Pre-Trained Language Model for Indonesian NLP. – arXiv Preprint arXiv:2011.00677. 2020. DOI: 10.18653/v1/2020.coling-main.66.
  28. Luo, Y., Z. Yang, F. Meng, Y. Li, J. Zhou, Y. Zhang. An Empirical Study of Catastrophic Forgetting in Large Language Models during Continual Fine-Tuning. – arXiv Preprint arXiv:2308.08747. 2023.
  29. Ayele, A. A., N. Babakov, J. Bevendorff, X. B. Casals, B. Chulvi, D. Dementieva et al. Overview of PAN 2024: Multi-Author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification Condensed Lab Overview. – In: Proc. of International Conference of the Cross-Language Evaluation Forum for European Languages, September 2024, pp. 231-259. Cham, Switzerland, Springer Nature.
  30. Iglesias, M., O. Araque, C. Á. Iglesias. A Toxic Style Transfer Method Based on the Delete-Retrieve-Generate Framework Exploiting Toxic Lexicon Semantic Similarity. – Applied Sciences, Vol. 13, 2023, No 15, 8590. DOI: 10.3390/app13158590.
  31. Laugier, L., J. Pavlopoulos, J. Sorensen, L. Dixon. Civil Rephrases of Toxic Texts with Self-Supervised Transformers. – arXiv Preprint arXiv:2102.05456. 2021.
  32. Dementieva, D., N. Babakov, A. Ronen, A. A. Ayele, N. Rizwan, F. Schneider et al. Multilingual and Explainable Text Detoxification with Parallel Corpora. – arXiv preprint arXiv:2412.11691. 2024.
  33. Dementieva, D., V. Logacheva, I. Nikishina, A. Fenogenova, D. Dale, I. Krotova et al. Russe-2022: Findings of the First Russian Detoxification Shared Task Based on Parallel Corpora. – Computational Linguistics and Intellectual Technologies, 2022. DOI: 10.28995/2075-7182-2022-21-114-131.
  34. Krause, B., A. D. Gotmare, B. McCann, N. S. Keskar, S. Joty, R. Socher, N. F. Rajani. Gedi: Generative Discriminator Guided Sequence Generation. – arXiv Preprint arXiv:2009.06367. 2020. DOI: 10.18653/v1/2021.findings-emnlp.424.
  35. Roiqoh, S., B. Zaman, K. Kartono. Analisis Sentimen Berbasis Aspek Ulasan Aplikasi Mobile JKN Dengan Lexicon Based dan Naïve Bayes. – Jurnal Media Informatika Budidarma, Vol. 7, 2023, No 3, pp. 1582-1592.
DOI: https://doi.org/10.2478/cait-2025-0019 | Journal eISSN: 1314-4081 | Journal ISSN: 1311-9702
Language: English
Page range: 3 - 21
Submitted on: May 2, 2025
Accepted on: Jun 26, 2025
Published on: Sep 25, 2025
Published by: Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Badrus Zaman, Naufal Humam, Indra Kharisma Raharjana, published by Bulgarian Academy of Sciences, Institute of Information and Communication Technologies
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.