Sentiment intensity prediction using neural word embeddings

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

1 Downloads (Pure)

Abstract

Sentiment analysis is central to the process of mining opinions and attitudes from online texts. While much attention has been paid to the sentiment classification problem, much less work has tried to tackle the problem of predicting the intensity of the sentiment. The go to method is VADER --- an unsupervised lexicon based approach to scoring sentiment. However, such approaches are limited because of the vocabulary mismatch problem. In this paper, we present in detail and evaluate our AWESSOME framework (A Word Embedding Sentiment Scorer Of Many Emotions) for sentiment intensity scoring, that capitalizes on pre-existing lexicons, does not require training and provides fine grained and accurate sentiment intensity scores of words, phrases and text. In our experiments, we used seven Sentiment Collections to evaluate the proposed approach, against lexicon based approaches (e.g., VADER), and supervised methods such as deep learning based approaches (e.g., SentiBERT). The results show that despite not surpassing supervised approaches, the AWESSOME unsupervised approach significantly outperforms existing lexicon approaches and therefore provides a simple and effective approach for sentiment analysis. The AWESSOME framework can be flexibly adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons -- without the need for extensive supervised learning or retraining.
Original languageEnglish
Title of host publicationICTIR '21 : Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval
Place of PublicationNew York, NY.
Pages93-102
Number of pages10
ISBN (Electronic)9781450386111
DOIs
Publication statusPublished - 11 Jul 2021
Event7th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2017 - Amsterdam, Netherlands
Duration: 1 Oct 20174 Oct 2017

Conference

Conference7th ACM SIGIR International Conference on the Theory of Information Retrieval, ICTIR 2017
Country/TerritoryNetherlands
CityAmsterdam
Period1/10/174/10/17

Keywords

  • sentiment intensity
  • pre-trained model language
  • lexicons
  • BERT
  • VADER
  • LabMT

Fingerprint

Dive into the research topics of 'Sentiment intensity prediction using neural word embeddings'. Together they form a unique fingerprint.

Cite this