Distributional Clustering of English Words - PowerPoint PPT Presentation

1 / 24

About This Presentation

Title:

Distributional Clustering of English Words

Description:

Start with low beta and a single c in C. Search for lowest beta that splits c into two or more leaf c's. ... Introduce human judgment, i.e. a more supervised approach ... – PowerPoint PPT presentation

Number of Views:123

Avg rating:3.0/5.0

Slides: 25

Provided by: Juan163

Category:

Tags: clustering | distributional | english | start | that | with | words

Transcript and Presenter's Notes

Title: Distributional Clustering of English Words

1
Distributional Clustering of English Words

Fernando Pereira- ATT Bell Laboratories, 600
Naftali Tishby- Dept. of Computer Science, Hebrew
University
Lillian Lee- Dept. of Computer Science, Cornell
University
Presenter- Juan Ramos, Dept. of Computer Science,
Rutgers Universtiy, juramos_at_cs.rutgers.edu

2
Overview

Purpose evaluate a method for clustering words
according to their distribution in particular
syntactic contexts.
Methodology find lowest distortion sets of
clusters of words to determine models of word
coocurrence.

3
Applications

Scientific POV lexical acquisition of words
Practical POV classification concerns data
sparseness in garmmar models.
Address clusters in large corpus of documents

4
Definitions

Context function of given word in its sentence.
Eg a noun as a direct object
Sense class hidden model describing word
association tendencies
Mix of cluster and cluster probability given a
word
Cluster probabilistic concept of a sense class

5
Problem Setting

Restrict problem to verbs (V) and nouns (N) in
main verb-direct object relationship
f (v, n) frequencies of occurrence of verb,
noun pairs
Text must be pre-formatted to fit specifications
For given noun n, conditional distribution p(n,
v) f(v,n)/(sum (v, f(v,n))

6
Problem Setting cont.

Goal create set C of clusters and probabilityies
p(cn).
Each c in C associated to cluster centroid p(c)
p(c) average of p(n) over all v in V.

7
Distributional Similarity

Given two distributions p, q, KL distance is D(p
q) sum (x, p(x) log (p(x)/q(x)))
D(p q) 0 implies p q
Small D(p q) implies two distributions are
likely instances of a centroid p(c).
D(p q) measures loss of data by using p(c).

8
Theoretical Foundation

Given unstructured V, N, training data of X
independent pairs of verbs and nouns.
Problem learn joint distribution of pairs given
X
Not quite unsupervised, not quite supervised
No internal structure in pairs
Learn underlying distribution

9
Distributional Clustering

Approximately decompose p(n,v) to p(n,v) sum
(c in C, p(cn)p(c, v)).
p(cn) membership probability of n in c
p(c,v) p(vc) probability of v given centroid
for c
Assuming p(n), p(v) coincide, p(n,v) sum(c in
C, p(c)p(nc)p(vc))

10
Maximum Likelihood Cluster Centroids

Used to maximize goodness of fit between data and
p(n,v)
For sequence of pairs S, Ss model log prob. is
l(S) sum(N, log (sum (c in C, p(n,v)))).
Maximize according to p(nc) and p(vc).
Variation of l(S)

11
Maximum Entropy Cluster Membership

Assume independence between variations of p(nc)
and p(vc).
Can find Bayes inverses of p(nc) given p(vc)
and p(vn)
p(vc) that maximize l(S) also minimize average
distortion between cluster model and data

12
Entropy Cluster Membership cont.

Average cluster distortion
Entropy

13
Entropy Cluster Membership cont.

Class and membership distributions
Z(c) and Z(n) are normalization sums
Previous equations reduce log-likelihood to
At maximum, variation vanishes

14
KL Distortion

Attempt to minimize KL distortion through
variation of KL distances
Results in weighted average of noun distributions.

15
Free Energy Function

Combined minimum distortion and max entropy
equivalent to minimum of free energy F ltDgt -
H/beta
F determines ltDgt and H through partial
derivatives
Min of F determines balance between disordering
max entropy and ordering distortion min.

16
Hierarchical Clustering

Number of clusters is determined through sequence
of increases of beta.
Higher beta implies more local influence of noun
on definition of centroids.
Start with low beta and a single c in C
Search for lowest beta that splits c into two or
more leaf cs.
Repeat until C reaches desired size.

17
Experimental Results

Classify 64 nouns appearing as direct objects of
verb fire in Associated Press documents, 1988,
where V 2147.
Four words most similar to cluster centroid and
KL distances for first splits.
Split 1 cluster of fire as discharging weapons
vs. cluster of fire as releasing employees
Split 2 weapons as projectiles vs. weapons as
guns.

18
Clustering on Verb fire
19
Evaluation
20
Evaluation cont.
21
Conclusions

Clustering is efficient, informative, and returns
good predictions
Future work
Make clustering method more rigorous
Introduce human judgment, i.e. a more supervised
approach
Extend model to other word relationships

22
References
23
References cont.
24
More References

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CS 391L: Machine Learning Clustering PowerPoint PPT Presentation

CS 391L: Machine Learning Clustering - Partition unlabeled examples into disjoint subsets of clusters, such that: ... mammal worm insect crustacean. invertebrate. 5. Aglommerative vs. Divisive Clustering ... | PowerPoint PPT presentation | free to view

Word clustering: Smaller models, Faster training PowerPoint PPT Presentation

Word clustering: Smaller models, Faster training - 10th Anniversary Last Week. Very roughly 500 researchers. I don't know what 430 of them are doing ... Li Deng, Alex Acero, Jasha Droppo. Noise Robustness (Great ... | PowerPoint PPT presentation | free to view

Language, Literacy, and Linguistic Differences: The case of African American English PowerPoint PPT Presentation

Language, Literacy, and Linguistic Differences: The case of African American English - I pay two dollars and fifty cent(s) every day, and I want my lunch to be good if ... Judge Charles W. Joiner (The Ann Arbor Black English decision, 1979) ... | PowerPoint PPT presentation | free to view

Word clustering: Smaller models, Faster training PowerPoint PPT Presentation

Word clustering: Smaller models, Faster training - WEEKDAY = Sunday, Monday, Tuesday, ... EVENT=party, celebration, ... WEEKDAY = Sunday, Monday, Tuesday, ... MONTH = January, February, April, May, June, ... | PowerPoint PPT presentation | free to view

A Semi-supervised Document Clustering Algorithm based on EM PowerPoint PPT Presentation

A Semi-supervised Document Clustering Algorithm based on EM - In between automatic categorization and auto-organization of data ... t, in the expectation step (E), some documents are badly classified, these data ... | PowerPoint PPT presentation | free to view

The teaching of early reading in English primary schools: PowerPoint PPT Presentation

The teaching of early reading in English primary schools: - Adoption of the Simple View of reading (Gough & Tunmer, 1986; Hoover & Gough, 1990) ... into cohesive (pronoun resolution; anaphora) and knowledge-based (bridging) ... | PowerPoint PPT presentation | free to view

A Contrastive Analysis of Persian and English PowerPoint PPT Presentation

A Contrastive Analysis of Persian and English - Unit Two The need for Contrastive Analysis. Unit Three Types ... They can usually be arranged into three semantic groups of dative, benefactive and eliciting. ... | PowerPoint PPT presentation | free to view

Recent Approaches to Machine Translation PowerPoint PPT Presentation

Recent Approaches to Machine Translation - Dictionary lookup. Convert Spanish words into bags of English words ... Dutch, Greek, German, Spanish to English. no parallel corpora. no full parsers ... | PowerPoint PPT presentation | free to view

When/How/Why%20to%20use%20Grouping/Categorizing/Clustering%20in%20Search%20Interfaces PowerPoint PPT Presentation

When/How/Why%20to%20use%20Grouping/Categorizing/Clustering%20in%20Search%20Interfaces - Usually restricted to a fixed set. So help reduce the space of concepts ... Overview (but also wanted zoom) Ease of jumping from one topic to another ... | PowerPoint PPT presentation | free to view

The Weakest Link: Detecting and Correcting Errors in Learner English PowerPoint PPT Presentation

The Weakest Link: Detecting and Correcting Errors in Learner English - Question-answering, Information Extraction. Language Learning and Reference ... result of parsing to detect trigrams straddling syntactic boundaries, and ignore ... | PowerPoint PPT presentation | free to view

Language Independent Methods of Clustering Similar Contexts (with applications) PowerPoint PPT Presentation

Language Independent Methods of Clustering Similar Contexts (with applications) - Our goal is to cluster the target word based on the surrounding contexts ... Convert contexts to be clustered into a vector representation based on these features ... | PowerPoint PPT presentation | free to view

International Center for PowerPoint PPT Presentation

International Center for - Top Ranked English could be responsibility of all. ICLE Curriculum Survey ... Know how to decipher unfamiliar words ... Know the metric system and conversion ... | PowerPoint PPT presentation | free to view

A Statistical Method Of Evaluating Pronunciation Proficiency For English Words Spoken By Japanese PowerPoint PPT Presentation

A Statistical Method Of Evaluating Pronunciation Proficiency For English Words Spoken By Japanese - English speech database read by Japanese learners ... (LL) for a pronunciation dictionary based on concatenation of phone HMMs at the word level. ... | PowerPoint PPT presentation | free to view

Distributional learning PowerPoint PPT Presentation

Distributional learning - Collins Cobuild Dictionary. Scoring. Cluster analysis. Bench-mark. Cluster ... the opposite pattern is observed: better performance for lower frequency words. ... | PowerPoint PPT presentation | free to view

Geometric Clustering and the Information Bottleneck PowerPoint PPT Presentation

Geometric Clustering and the Information Bottleneck - ... 2004. CMU ML lunch. 1. Geometric Clustering. and the Information Bottleneck. Key concepts of the NIPS 2003 paper by Susanne Still, William Bialek, and Leon Bottou ... | PowerPoint PPT presentation | free to view

A note on extracting sentiments in financial news in English, Arabic PowerPoint PPT Presentation

A note on extracting sentiments in financial news in English, Arabic - none | PowerPoint PPT presentation | free to view

Distributional clustering of English words PowerPoint PPT Presentation

Distributional clustering of English words - Distributional clustering of English words Authors: Fernando Pereira, Naftali Tishby, Lillian Lee Presenter: Marian Olteanu Introduction Method for automatic ... | PowerPoint PPT presentation | free to view

A Course on Linguistics for Students of English PowerPoint PPT Presentation

A Course on Linguistics for Students of English - A Course on Linguistics for Students of English The Goals for this Course To get a scientific view on language; To understand some basic theories on linguistics; To ... | PowerPoint PPT presentation | free to view

English in the United States and Canada PowerPoint PPT Presentation

English in the United States and Canada - English in the United States and Canada Pro- und Hauptseminar SS 2006, Campus Essen Raymond Hickey, English Linguistics The following presentation is intended to give ... | PowerPoint PPT presentation | free to view

English accents PowerPoint PPT Presentation

English accents - In accent C, rot and rat sound identical, as do block and black and all other LOT-TRAP pairs; in accent D, they are distinct in pronunciation. 3. | PowerPoint PPT presentation | free to view

CS626-449: Speech, NLP and the Web/Topics in AI PowerPoint PPT Presentation

CS626-449: Speech, NLP and the Web/Topics in AI - ... Study of sound units combine to form bigger units like syllables Ancient 5 x 5 Indian ... English Phonology No. of speech sounds in English varies from ... | PowerPoint PPT presentation | free to view

Recent research at CLAIR PowerPoint PPT Presentation

Recent research at CLAIR - Title: Words & Links: Modeling Webs of Words Author: radev Last modified by: radev Created Date: 3/25/2004 9:55:05 PM Document presentation format | PowerPoint PPT presentation | free to view

Human Geography By James Rubenstein PowerPoint PPT Presentation

Human Geography By James Rubenstein - Human Geography By James Rubenstein Chapter 5 Key Issue 1 Where Are English-Language Speakers Distributed? * S. Mathews * * S. Mathews * Dialects in the United States ... | PowerPoint PPT presentation | free to view

The Failure of Clustering in Search Interfaces PowerPoint PPT Presentation

The Failure of Clustering in Search Interfaces - The Failure of Clustering in Search Interfaces or When/How/Why Clustering can be Successful in Search Interfaces Marti Hearst UC Berkeley Oct 6, 2004 | PowerPoint PPT presentation | free to view

Ling 122: English as a World Language PowerPoint PPT Presentation

Ling 122: English as a World Language - English belongs to a large language family ... Many difficulties in deciding Indic grouping Articles and determiners ( function words ) Articles a / an ... | PowerPoint PPT presentation | free to view

Supercomputing in Plain English PowerPoint PPT Presentation

Supercomputing in Plain English - Supercomputing in Plain English An Introduction to High Performance Computing Part V: Shared Memory Multithreading Henry Neeman, Director OU Supercomputing Center for ... | PowerPoint PPT presentation | free to view

Academic Vocabulary PowerPoint PPT Presentation

Academic Vocabulary - Today s Agenda: How do words get learned and stay learned? What kinds of words are there and how do I decide how much attention to pay to them? 3. | PowerPoint PPT presentation | free to view