Have a personal or library account? Click to login
Machine learning methods for toxic comment classification: a systematic review Cover

Machine learning methods for toxic comment classification: a systematic review

By: Darko Andročec  
Open Access
|Jan 2021

References

  1. [1] H. Almerekhi, H. Kwak, J. Salminen, B. J. Jansen, Are These Comments Triggering? Predicting Triggers of Toxicity in Online Discussions, Proceedings of The Web Conference 2020, Taipei, Taiwan, Apr. 2020, pp. 3033–3040. ⇒20810.1145/3366423.3380074
  2. [2] M. Anand, R. Eswari, Classification of Abusive Comments in Social Media using Deep Learning, 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, Mar. 2019, pp. 974–977. ⇒20810.1109/ICCMC.2019.8819734
  3. [3] A. Bleiweiss, LSTM neural networks for transfer learning in online moderation of abuse context, ICAART 2019 - Proceedings of the 11th International Conference on Agents and Artificial Intelligence, Prague, Czech Republic,2019, pp. 112–122. ⇒20810.5220/0007358701120122
  4. [4] É. Brassard-Gourdeau, R. Khoury, Using Sentiment Information for Preemptive Detection of Toxic Comments in Online Conversations, ArXiv200610145 Cs, Jun. 2020, Accessed: Jul. 03, 2020. http://arxiv.org/abs/2006.10145.⇒208
  5. [5] S. Carta, A. Corriga, R. Mulas, D. R. Recupero, R. Saia, A supervised multi-class multi-label word embeddings approach for toxic comment classification, IC3K 2019 - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, Vienna, Austria, 2019, pp. 105–112. ⇒20810.5220/0008110901050112
  6. [6] A. G. D’Sa, I. Illina, D. Fohr, Towards non-toxic landscapes: Automatic toxic comment detection using DNN, ArXiv191108395 Cs Stat, Nov. 2019, Accessed: Jul. 03, 2020. http://arxiv.org/abs/1911.08395.⇒208
  7. [7] S. Deshmukh, R. Rade, Tackling Toxic Online Communication with Recurrent Capsule Networks, 2018 Conference on Information and Communication Technology (CICT), Jabalpur, India, 2018. ⇒20810.1109/INFOCOMTECH.2018.8722433
  8. [8] A. Elnaggar, B. Waltl, I. Glaser, J. Landthaler, E. Scepankova, F. Matthes, Stop Illegal Comments: A Multi-Task Deep Learning Approach, ACM International Conference Proceeding Series, 2018, pp. 41–47. ⇒20810.1145/3299819.3299845
  9. [9] S. V. Georgakopoulos, S. K. Tasoulis, A. G. Vrahatis, V. P. Plagianakos, Convolutional Neural Networks for Toxic Comment Classification, Proceedings of the 10th Hellenic Conference on Artificial Intelligence, Patras, Greece, Jul. 2018, pp. 1–6. ⇒20810.1145/3200947.3208069
  10. [10] G. Haralabopoulos, I. Anagnostopoulos, D. McAuley, Ensemble Deep Learning for Multilabel Binary Classification of User-Generated Content, Algorithms, 13, 4 (2020). ⇒20810.3390/a13040083
  11. [11] O. Hosam, Toxic comments identification in arabic social media, Int. J. Comput. Inf. Syst. Ind. Manag. Appl., 11, (2019) 219–226. ⇒208
  12. [12] M. Ibrahim, M. Torki, N. El-Makky, Imbalanced Toxic Comments Classification using Data Augmentation and Deep Learning, Proceedings - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018, Orlando, USA, 2018, pp. 875–878. ⇒20810.1109/ICMLA.2018.00141
  13. [13] E. Jain et al., Adversarial Text Generation for Google’s Perspective API, 2018 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, USA, Dec. 2018, pp. 1136–1141. ⇒20810.1109/CSCI46756.2018.00220
  14. [14] Jigsaw, Data for Toxic Comment Classification Challenge. https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge/data
  15. [15] A. N. M. Jubaer, A. Sayem, Md. A. Rahman, Bangla Toxic Comment Classification (Machine Learning and Deep Learning Approach), 2019 8th International Conference System Modeling and Advancement in Research Trends (SMART), Moradabad, India, Nov. 2019, pp. 62–66. ⇒210, 21210.1109/SMART46866.2019.9117286
  16. [16] B. Kitchenham, S. Charters, Guidelines for performing Systematic Literature Reviews in Software Engineering(2007). ⇒208
  17. [17] V. Kumar, B. K. Tripathy, Detecting Toxicity with Bidirectional Gated Recurrent Unit Networks, Adv. Intell. Syst. Comput., vol. 1034,(2020) 591–600. ⇒206
  18. [18] S. Mestry, H. Singh, R. Chauhan, V. Bisht, K. Tiwari, Automation in Social Networking Comments With the Help of Robust fastText and CNN, 2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), Chennai, India, Apr. 2019, pp. 1–4. ⇒20810.1109/ICIICT1.2019.8741503
  19. [19] F. Mohammad, Is preprocessing of text really worth your time for toxic comment classification?, CSCE 2018 - Proceedings of the 2018 International Conference on Artificial Intelligence, ICAI 2018, Las Vegas, USA, 2018, pp. 447–453. ⇒208
  20. [20] J. Moon, W. I. Cho, J. Lee, BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection, ArXiv200512503 Cs, May 2020, Accessed: Jul. 03, 2020. http://arxiv.org/abs/2005.12503.⇒208
  21. [21] S. Morzhov, Avoiding Unintended Bias in Toxicity Classification with Neural Networks, 2020 26th Conference of Open Innovations Association (FRUCT), Yaroslavl, Russia, Apr. 2020, pp. 314–320. ⇒20810.23919/FRUCT48808.2020.9087368
  22. [22] D. Noever, Machine Learning Suites for Online Toxicity Detection, ArXiv181001869 Cs Stat, Oct. 2018, Accessed: Jul. 03, 2020. http://arxiv.org/abs/1810.01869.⇒208
  23. [23] A. P. Patil, A. Mohammed, G. Elachitaya, M. Tiwary, Practical Significance of GA PartCC in Multi-Label Classification, Proceedings of the 2019 Ieee Region 10 Conference (tencon 2019): Technology, Knowledge, and Society, Kerala, India, 2019, pp. 2487–2490. ⇒20810.1109/TENCON.2019.8929317
  24. [24] Rahul, H. Kajla, J. Hooda, G. Saini, Classification of Online Toxic Comments Using Machine Learning Algorithms, 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, May 2020, pp. 1119–1123. ⇒20810.1109/ICICCS48265.2020.9120939
  25. [25] E. Reichert, H. Qiu, J. Bayrooti, Reading Between the Demographic Lines: Resolving Sources of Bias in Toxicity Classifiers, ArXiv200616402 Cs, Jun. 2020, Accessed: Jul. 03, 2020. http://arxiv.org/abs/2006.16402.⇒208
  26. [26] M. Rybinski, W. Miller, J. Del Ser, M. Nekane Bilbao, J. F. Aldana-Montes, On the Design and Tuning of Machine Learning Models for Language Toxicity Classification in Online Platforms, Intelligent Distributed Computing Xii, 798 (2018), pp. 329–343. ⇒208
  27. [27] H. H. Saeed, K. Shahzad, F. Kamiran, Overlapping Toxic Sentiment Classification using Deep Neural Architectures, 2018 18th Ieee International Conference on Data Mining Workshops (icdmw), Sentosa, Singapore, 2018, pp. 1361–1366. ⇒20810.1109/ICDMW.2018.00193
  28. [28] M. A. Saif, A. N. Medvedev, M. A. Medvedev, T. Atanasova, Classification of Online Toxic Comments Using the Logistic Regression and Neural Networks Models, Proceedings of the 44th International Conference Applications of Mathematics in Engineering and Economics, Sozopol, Bulgaria, 2018. ⇒20810.1063/1.5082126
  29. [29] S. Shtovba, O. Shtovba, M. Petrychko, Detection of social network toxic comments with usage of syntactic dependencies in the sentences, CEUR Workshop Proceedings, Otzenhausen, Germany, 2019, pp. 313–323. ⇒20810.32782/cmis/2353-25
  30. [30] S. Srivastava, P. Khurana, Detecting Aggression and Toxicity using a Multi Dimension Capsule Network. Stroudsburg: Assoc Computational Linguistics-Acl, 2019, pp. 157–162. ⇒208 ⇒20810.18653/v1/W19-3517
  31. [31] A. Vaidya, F. Mai, Y. Ning, Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection, ArXiv190909758 Cs, Mar. 2020, Accessed: Jul. 03, 2020. http://arxiv.org/abs/1909.09758.⇒208
  32. [32] B. van Aken, J. Risch, R. Krestel, A. Löser, Challenges for Toxic Comment Classification: An In-Depth Error Analysis, ArXiv180907572 Cs, Sep. 2018, Accessed: Jul. 03, 2020. http://arxiv.org/abs/1809.07572.⇒208
  33. [33] M. Yao, C. Chelmis, D.-S. Zois, Cyberbullying Ends Here: Towards Robust Detection of Cyberbullying in Social Media, The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019, San Francisco, USA, 2019, pp. 3427–3433. ⇒20810.1145/3308560.3316474
Language: English
Page range: 205 - 216
Submitted on: Jul 23, 2020
|
Accepted on: Oct 5, 2020
|
Published on: Jan 29, 2021
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2021 Darko Andročec, published by Sapientia Hungarian University of Transylvania
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.