Abstract
Twitter has emerged as a major social media platform and generated great interest from sentiment analysis researchers. Despite this attention, state-of-the-art Twitter sentiment analysis approaches perform relatively poorly with reported classification accuracies often below 70%, adversely impacting applications of the derived sentiment information. In this research, we investigate the unique challenges presented by Twitter sentiment analysis and review the literature to determine how the devised approaches have addressed these challenges. To assess the state-of-the-art in Twitter sentiment analysis, we conduct a benchmark evaluation of 28 top academic and commercial systems in tweet sentiment classification across five distinctive data sets. We perform an error analysis to uncover the causes of commonly occurring classification errors. To further the evaluation, we apply select systems in an event detection case study. Finally, we summarize the key trends and takeaways from the review and benchmark evaluation and provide suggestions to guide the design of the next generation of approaches.
Original language | English (US) |
---|---|
Article number | 5 |
Journal | ACM Transactions on Management Information Systems |
Volume | 9 |
Issue number | 2 |
DOIs | |
State | Published - Apr 2018 |
Keywords
- Benchmark evaluation
- Natural language processing
- Opinion mining
- Sentiment analysis
- Social media
- Text mining
ASJC Scopus subject areas
- Management Information Systems
- Computer Science(all)