Have a personal or library account? Click to login
Tackling the Problem of Class Imbalance in Multi-class Sentiment Classification: An Experimental Study Cover

Tackling the Problem of Class Imbalance in Multi-class Sentiment Classification: An Experimental Study

By: Mateusz Lango  
Open Access
|Jun 2019

Abstract

Sentiment classification is an important task which gained extensive attention both in academia and in industry. Many issues related to this task such as handling of negation or of sarcastic utterances were analyzed and accordingly addressed in previous works. However, the issue of class imbalance which often compromises the prediction capabilities of learning algorithms was scarcely studied. In this work, we aim to bridge the gap between imbalanced learning and sentiment analysis. An experimental study including twelve imbalanced learning preprocessing methods, four feature representations, and a dozen of datasets, is carried out in order to analyze the usefulness of imbalanced learning methods for sentiment classification. Moreover, the data difficulty factors — commonly studied in imbalanced learning — are investigated on sentiment corpora to evaluate the impact of class imbalance.

DOI: https://doi.org/10.2478/fcds-2019-0009 | Journal eISSN: 2300-3405 | Journal ISSN: 0867-6356
Language: English
Page range: 151 - 178
Submitted on: Jan 18, 2019
Accepted on: Feb 24, 2019
Published on: Jun 6, 2019
Published by: Poznan University of Technology
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2019 Mateusz Lango, published by Poznan University of Technology
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.