Opinion mining and sentiment analysis
Bo Pang and Lillian Lee
Foundations and Trends in Information Retrieval 2(1-2),
pp. 1–135, 2008.
Also available as a book or e-book.
Abstract:
An important part of our information-gathering behavior has
always been to
find out what other people think. With the growing availability
and popularity of opinion-rich resources such as online review sites
and personal blogs, new opportunities and challenges arise as people
can, and do, actively use information technologies to seek out and
understand the opinions of others. The sudden eruption of activity in
the area of opinion mining and sentiment analysis, which deals with
the computational treatment of opinion, sentiment, and subjectivity in
text, has thus occurred at least in part as a direct response to the
surge of interest in new systems that deal directly with opinions as a
first-class object.
This survey covers techniques and approaches
that promise to directly enable opinion-oriented information-seeking
systems.
Our focus is on methods
that seek to address the new challenges raised by sentiment-aware
applications, as compared to those that are already present in more traditional fact-based analysis.
We include material on summarization of evaluative text and on
broader issues regarding privacy, vulnerability to manipulation, and
economic impact that the development of opinion-oriented
information-access services gives rise to.
To facilitate future work, a discussion of available resources,
benchmark datasets, and evaluation campaigns is also provided.
Mentions: “Congratulations”
(Matthew Hurst) | “THE survey to
read”, “tremendous resource” (Jeffrey Carr) |
“Excellent” (George
Tziralis) | “excellent points” (Jessica Hullman)
Table of Contents:
- Introduction
- The demand for information on opinions and sentiment
- What might be involved? An example examination of the construction of an opinion/review search engine
- Our charge and approach
- Early history
- A note on terminology: Opinion mining, sentiment analysis, subjectivity, and all that
- Applications
- Applications to review-related websites
- Applications as a sub-component technology
- Applications in business and government intelligence
- Applications across different domains
- General Challenges
- Contrasts with standard fact-based textual analysis
- Factors that make opinion mining difficult
- Classification and Extraction
Part One: Fundamentals
- Problem formulations and key concepts
- Sentiment polarity and degrees of positivity
- Subjectivity detection and opinion identification
- Joint topic-sentiment analysis
- Viewpoints and perspectives
- Other non-factual information in text
- Features
- Term presence vs. frequency
- Term-based features beyond term unigrams
- Parts of speech
- Syntax
- Negation
- Topic-oriented features
Part Two: Approaches
- The impact of labeled data
- Domain adaptation and topic-sentiment interaction
- Domain considerations
- Topic (and sub-topic or feature) considerations
- Unsupervised approaches
- Unsupervised lexicon induction
- Other unsupervised approaches
- Classification based on relationship information
- Relationships between sentences and between documents
- Relationships between discourse participants
- Relationships between product features
- Relationships between classes
- Incorporating discourse structure
- Language models
- Special considerations for extraction
- Identifying product features and opinions in reviews
- Problems involving opinion holders
- Summarization
- Single-document opinion-oriented summarization
- Multi-document opinion-oriented summarization
- Some problem considerations
- Textual summaries
- Non-textual summaries
- Review(er) quality
- Broader Implications
- Economic impact of reviews
- Surveys summarizing relevant economic literature
- Economic-impact studies employing automated text analysis
- Interactions with word of mouth (WOM)
- Implications for manipulation
- Publicly Available Resources
- Datasets
- Acquiring labels for data
- An annotated list of datasets
- Evaluation campaigns
- TREC opinion-related competitions
- NTCIR opinion-related competitions
- Lexical resources
- Tutorials, bibliographies, and other references
- Concluding Remarks
- References
Lillian Lee's home page | Lillian
Lee's co-authored papers on
sentiment analysis