An evaluation of machine learning and latent semantic analysis in text sentiment classification

Miazga, Justyna; Hachaj, Tomasz

An evaluation of machine learning and latent semantic analysis in text sentiment classification

Technical Transactions

Volume 117 (2020): Issue 1 (January 2020)

By:

Justyna Miazga

and Tomasz Hachaj

Open Access

|Oct 2020

Abstract

In this paper, we compare the following machine learning methods as classifiers for sentiment analysis: k – nearest neighbours (kNN), artificial neural network (ANN), support vector machine (SVM), random forest. We used a dataset containing 5,000 movie reviews in which 2,500 were marked as positive and 2,500 as negative. We chose 5,189 words which have an influence on sentence sentiment. The dataset was prepared using a term document matrix (TDM) and classical multidimensional scaling (MDS). This is the first time that TDM and MDS have been used to choose the characteristics of text in sentiment analysis. In this case, we decided to examine different indicators of the specific classifier, such as kernel type for SVM and neighbour count in kNN. All calculations were performed in the R language, in the program R Studio v 3.5.2. Our work can be reproduced because all of our data sets and source code are public.

DOI: https://doi.org/10.37705/TechTrans/e2020030 | Journal eISSN: 2353-737X | Journal ISSN: 0011-4561

Journal RSS Feed

Language: English

Submitted on: Jun 20, 2020

Accepted on: Sep 22, 2020

Published on: Oct 1, 2020

Published by: Cracow University of Technology

In partnership with: Paradigm Publishing Services

Publication frequency: 1 times per year

Keywords:

sentiment analysis,

machine learning,

sentiment classification,

K-nearest neighbours,

neural network,

support vector machine

Related subjects:

Architecture and design,

Architecture,

Architects, buildings,

Engineering,

Mechanical engineering,

Mechanics,

Industrial chemistry,

Chemical engineering,

Materials sciences,

Materials sciences, other

© 2020 Justyna Miazga, Tomasz Hachaj, published by Cracow University of Technology
This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 License.

Previous article Volume 117 (2020): Issue 1 (January 2020)Next article