SciELO - Scientific Electronic Library Online

 
vol.26 issue1Learning an Artificial Neural Network for Discovering Combinations of Bit-Quads to Compute the Euler Characteristic of a 2-D Binary ImageProposal for a Forensic Action Guide for Internet of Things (IoT) Environments author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

AKHMETOV, Iskander; GELBUKH, Alexander  and  MUSSABAYEV, Rustam. Topic-Aware Sentiment Analysis of News Articles. Comp. y Sist. [online]. 2022, vol.26, n.1, pp.423-439.  Epub Aug 08, 2022. ISSN 2007-9737.  https://doi.org/10.13053/cys-26-1-4179.

We consider the problem of sentiment analysis in news media articles cast as a three-way classification task: negative, positive, or neutral. We show that subdividing the training corpus by topic (local news, sports, hi-tech, and others) and training separate sentiment classifiers for each sub-corpus improves classification F1 scores. We use topics since some words carry different sentiments in different domains: e.g., the word “force” is typically positive in the sports domain but negative in the political domain. Our experiments on the Kaggle dataset with sentiment-labeled Kazakhstani news articles in Russian language using the Convolutional Neural Network (CNN) model partially proved our hypothesis, showing that for the most prominent “kz” (local news) topic, we achieve an F1 score of 0.70, which is greater than the baseline model trained without the topic-awareness showing just 0.67. Topic-aware improves F1 scores in some topics, but due to the topic/class imbalance further research is needed. However, the performance in terms of F1 over all the corpus does not improve or the improvements are very small. Moreover, our approach shows better results on topics with many text samples than those with relatively small amounts of articles.

Keywords : Mass media; natural language processing; news articles; sentiment analysis.

        · text in English     · English ( pdf )