Geometric Clustering and the Information Bottleneck - PowerPoint PPT Presentation

1 / 10
About This Presentation
Title:

Geometric Clustering and the Information Bottleneck

Description:

... 2004. CMU ML lunch. 1. Geometric Clustering. and the Information Bottleneck. Key concepts of the NIPS 2003 paper by Susanne Still, William Bialek, and Leon Bottou ... – PowerPoint PPT presentation

Number of Views:54
Avg rating:3.0/5.0
Slides: 11
Provided by: pinePsyc
Category:

less

Transcript and Presenter's Notes

Title: Geometric Clustering and the Information Bottleneck


1
Geometric Clustering and the Information
Bottleneck
  • Key concepts of the NIPS 2003 paper by Susanne
    Still, William Bialek, and Leon Bottou
  • with background from
  • The Information Bottleneck Method
  • Naftali Tishby, Fernando C. Pereira, William
    Bialek

Presented by Mark V. Albert
2
Introduction to RDT
Rate Distortion Theory
X
Solved ! Blahut-Arimoto Algorithm
Minimize I(X X) with constraint on distortion
X
p(xx)
Difficulty Need a Distortion function
What features are important?
p(x)
3
The Information Bottleneck method
X
Y
min I(x x) ß I(y x)
X
p(xx)
Solved ! general, iterative method
p(yx)
p(x)
Wide variety of applications
4
IB method and K-means
With ? lt 1 and n ? 8 this is equivalent to
k-means clustering
i
x
c
Empirical Results ( global optimality
convergence)
Low dimensional K-means 75.8 IB derived 100
max I(x,c) ? I(c,i)
Four High Dimensional Gaussian clusters
IB Method finds iterative equations for p(ci),
p(c), and p(xc)
K-means 37.8 IB derived 78-81
5
IB method applications
X position
indices
Geometric Clustering
clusters
6
IB method applications
verbs
nouns
Geometric Clustering
context
Semantic Clustering
Periera, Tishby, Lee. Distributional Clustering
of English Words
7
IB method applications
words
document
Geometric Clustering
category
Semantic Clustering
Document Categorization
document
words
Slonim, Tishby. Document Clustering using Word
Clusters via the Information Bottleneck Method
8
IB method applications
spike trains
stimuli
stimulus features
Geometric Clustering
Semantic Clustering
Document Categorization
Neural Coding
9
IB method applications
galaxy
spectra
galaxy clusters
Geometric Clustering
Semantic Clustering
Document Categorization
Neural Coding
Spectral Analysis
Slonim, Somerville,Tishby, Lahav. Objective
Classification of Galaxies Spectra using the
Information Bottleneck Method
10
IB method applications
?
?
Geometric Clustering
?
Semantic Clustering
Document Categorization
Neural Coding
Spectral Analysis
???
Write a Comment
User Comments (0)
About PowerShow.com