Projects per year
Abstract
Sentiment analysis (SA) is the key element for a variety of opinion and attitude mining tasks. While various unsupervised SA tools already exist, a central problem is that they are lexicon-based where the lexicons used are limited, leading to a vocabulary mismatch. In this paper, we present an unsupervised word embedding-based sentiment scoring framework for sentiment intensity scoring (SIS). The framework generalizes and combines past works so that pre-existing lexicons (e.g. VADER, LabMT) and word embeddings (e.g. BERT, RoBERTa) can be used to address this problem, with no require training, and while providing fine grained SIS of words and phrases. The framework is scalable and extensible, so that custom lexicons or word embeddings can be used to core methods, and to even create new corpus specific lexicons without the need for extensive supervised learning and retraining. The Python 3 toolkit is open source, freely available from GitHub (https://github.com/cumulative-revelations/awessome ) and can be directly installed via pip install awessome.
Original language | English |
---|---|
Title of host publication | Advances in Information Retrieval - 43rd European Conference on IR Research, ECIR 2021, Proceedings |
Editors | Djoerd Hiemstra, Marie-Francine Moens, Josiane Mothe, Raffaele Perego, Martin Potthast, Fabrizio Sebastiani |
Place of Publication | Cham, Switzerland |
Publisher | Springer |
Chapter | 56 |
Pages | 509-513 |
Number of pages | 5 |
Volume | 12657 |
ISBN (Electronic) | 9783030722401 |
ISBN (Print) | 9783030722395 |
DOIs | |
Publication status | Published - 28 Mar 2021 |
Event | European Conference on Information Retrieval 2021 - Online, Lucca, Italy Duration: 28 Mar 2021 → 1 Apr 2021 Conference number: 43 https://www.ecir2021.eu/ |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12657 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | European Conference on Information Retrieval 2021 |
---|---|
Abbreviated title | ECIR 2021 |
Country/Territory | Italy |
City | Lucca |
Period | 28/03/21 → 1/04/21 |
Internet address |
Keywords
- sentiment intensity
- pre-trained language model
- lexicon
- BERT
- VADER
Fingerprint
Dive into the research topics of 'AWESSOME: An unsupervised sentiment intensity scoring framework using neural word embeddings'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Cumulative Revelations in Personal Data
EPSRC (Engineering and Physical Sciences Research Council)
1/04/19 → 31/03/22
Project: Research