Title: Learning Subjective Adjectives From Corpora
1Learning Subjective Adjectives From Corpora
- Janyce M. Wiebe
- New Mexico State University
Office of Naval Research grant N00014-95-1-0776.
2Introduction
- Learning evaluation and opinion clues
- Distributional similarity process
- Refinement with lexical features
- Improved results from both
3Subjectivity Tagging
- Recognizing opinions and evaluations
- (Subjective sentences) as opposed to
- material objectively presented as true
- (Objective sentences)
-
Banfield 1985, Fludernik 1993, Wiebe 1994
4Examples
- At several different levels, its a fascinating
tale. subjective - Bell Industries Inc. increased its quarterly to
10 cents from 7 cents a share. - objective
5Types
Complained You Idiot! Terrible product
6Subjectivity
?
Complained You Idiot! Terrible product
Speculated Maybe
7Subjectivity
- Same word, different types
- Great majority objective
- Great! positive evaluative
- Just great. negative evaluative
8Subjectivity
- Multiple types, sources, targets
- Its the best!, he gushed.
-
Writer He It
9Applications Flame Recognition
10Review Mining
From HoodoogthoodooBUGZAPPER_at_newnorth.netgt Newsgro
ups rec.gardens Subject Re Garden software I
bought a copy of Garden Encyclopedia from
Sierra. Well worth the time and money.
11Information Extraction
- Northwest Airlines settled the remaining
lawsuits, a federal judge said. objective - The cost of health care is eroding our standard
of living and sapping industrial strength,
complains Maher. subjective
12Other Applications
- Clustering documents by ideology
- Text summarization
- Style in machine translation and generation
13Overview
- Identify large set of candidate clues
- Existing resources are not sufficient
- Not consistently marked for subjectivity
- Not customized to the genre
- Learn lexical clues from corpora
14Corpus and Annotation
- Subjectivity tags assigned by multiple annotators
to 1001 WSJ sentences - Tags representing consensus opinions obtained
with EM
Wiebe et al. 1999 Bruce Wiebe 1999
15Adjectives
- Classifications correlated with adjectives
- Adjectives extracted from annotations
- They promised e 2 yet more for
- e 3 really good e? 1 stuff.
- "It's e? 3 really e- 3 bizarre," says Albert
Lerman, creative director at the Wells agency.
16Lins Distributional Similarity
Word R W I R1
have have R2 dog brown R3 dog
. . .
Lin 1998
17Lins Distributional Similarity
Word1
Word2
R W R W R W
R W R W R W
R W R W
R W R W
R W R W
18Bizarre
strange similar scary unusual
fascinating interesting curious tragic
different contradictory peculiar silly sad
absurd poignant crazy funny comic
compelling odd
19Bizarre
strange similar scary unusual
fascinating interesting curious tragic
different contradictory peculiar silly sad
absurd poignant crazy funny comic
compelling odd
20Bizarre
strange similar scary unusual
fascinating interesting curious tragic
different contradictory peculiar silly sad
absurd poignant crazy funny comic
compelling odd
21Bizarre
strange similar scary unusual
fascinating interesting curious tragic
different contradictory peculiar silly sad
absurd poignant crazy funny comic
compelling odd
22Good
- bad better best nice poor
- terrific great decent lousy
- dismal excellent positive exciting
- fantastic marvelous strong
- important dumb fair healthy
23Good
- bad better best nice poor
- terrific great decent lousy
- dismal excellent positive exciting
- fantastic marvelous strong
- important dumb fair healthy
24Experiments
25Experiments
Separate corpus
Distributional similarity
Seeds
26Experiments
Separate corpus
Distributional similarity
Seeds
Filtering
S gt Adj gt Majority
27Lexical features
- Polarity and Gradability
- Learned from corpora
- Statistical processing informed by linguistic
insights - Different data sets used
28Gradability
More additional Very additional
Hatzivassiloglou Wiebe 2000
29Polarity
Corrupt and brutal Corrupt but brutal
Hatzivassiloglou McKeown 1997
30Experiments
Distributional similarity
Seeds
Filtering
31Experiments
Distributional similarity
Seeds
Filtering
Lexical Classification
32Results
Seed 7.5
33Results
Lex
Seed Lex Pol,Grad 6.4
18.0 Pol-, Grad
19.9 21.4 Pol,- Grad
8.4 18.2
Seed 7.5
34Future Work
- Apply process to Netnews and Listservs
- Apply word-sense disambiguation techniques to
potentially subjective expressions - Flame recognition and review mining
35Conclusions
- Learning linguistic knowledge from corpora for a
pragmatic task - Linguistic information
- Manual annotation
- Linguistic constraints
- Processes improve each other
36Application 1 Flame recognition
From pattreck_at_aol.com (PattReck) Newsgroups
rec.gardens.roses Subject Re red(as in wine
red) roses My two favorite old reds Cramoisi
Superieure, especially great climbing, and
Francis Dubreuil. Also Prospero does well
in southern California - aren't you on the west
coast? -- Candace
37Flames (continued)
From Suzanne ltSuzanne_member_at_newsguy.comgt Newsgro
ups rec.gardens.roses Subject Re BILL
WARRINER!!!!
gtgtWow. You guys are really working poor Suzanne
over. gtpo thang. I thank she been workin over
her bottle of Kahlua. Up !!! I've been
working at a job - no Kahlua! You are
a snow-snorting dust-bowl dweller, the dustiest
of the dusties. Bill Bradley has the support of
the "environmentalists" ha ha ha!
38Likely
- likely possible willing probable
- receptive unlikely able logical
- rumored potential counterproductive
- moot significant hesitant worthy
- unwilling probably desirable
- weak forthcoming imminent