Automatic sentiment analysis in Honduran political tweets

Authors

  • Nicole Rodríguez Alcántara Facultad de Ingeniería, Universidad Tecnológica Centroamericana, UNITEC, San Pedro Sula, Honduras
  • Angella Falck Durán Facultad de Ingeniería, Universidad Tecnológica Centroamericana, UNITEC, San Pedro Sula, Honduras
  • Sergio Antonio Suazo Barahona Facultad de Ingeniería, Universidad Tecnológica Centroamericana, UNITEC, Tegucigalpa, Honduras

DOI:

https://doi.org/10.5377/innovare.v11i3.15349

Keywords:

Sentiment analysis, Supervised machine learning, Honduran politics, Natural language processing, Twitter

Abstract

Introduction. Twitter has become a medium for citizens to express in politics, transmitting feelings and opinions of users through tweets. Analyzing this data allows to discover trends and turning points in political criteria. The study aim was to develop an automatic sentiment analysis process in Honduran political tweets, through supervised machine learning techniques. Methods. A collection of 1,800 Honduran political tweets was carried out through filters based in users and hashtags in the period from January to September 2022, followed by a manual tweet tagging. The following techniques of natural language processing were applied: Bag of Words (BOW) and term frequency-inverse document frequency (TF-IDF). The considered methods were: linear SVM, logistic regression and multinomial Naïve Bayes (MNB). The performance metrics used to compare classifiers were a term frequency (F1-score), accuracy and time (training and validation). Results. The selected model was the MNB due to its higher F1-score (62.48%) and shorter training time, while linear SVM obtained 61.80% and logistic regression 61.34%. The final performance of the MNB with new tweets was an F1-score of 63.37%. Conclusion. For the data set presented, it was found that the best classifier was MNB. However, the performance gap between classifiers is small, which implies that preprocessing optimizations and larger scale data collection should be considered.

Downloads

Download data is not yet available.
Abstract
220
HTML (Español (España)) 69
PDF (Español (España)) 194

Published

2022-12-08

How to Cite

Rodríguez Alcántara, N. ., Falck Durán, A. ., & Suazo Barahona, S. A. . (2022). Automatic sentiment analysis in Honduran political tweets . Innovare: Revista De Ciencia Y tecnología, 11(3), 158–165. https://doi.org/10.5377/innovare.v11i3.15349

Issue

Section

Original article