Probabilistic Latent Semantic Indexing - PowerPoint PPT Presentation

1 / 17

About This Presentation

Title:

Probabilistic Latent Semantic Indexing

Description:

'Probabilistic Latent Semantic Indexing' Written by Thomas Hofmann ... Perplexity: Inverse of its. likelihood. Conclusions. PLSI is a good thing because ... – PowerPoint PPT presentation

Number of Views:473

Avg rating:3.0/5.0

Slides: 18

Provided by: cseLe

Category:

Tags: indexing | latent | perplexity | probabilistic | semantic

Transcript and Presenter's Notes

Title: Probabilistic Latent Semantic Indexing

1
Probabilistic Latent Semantic Indexing

Written by Thomas Hofmann
Presentation by Chris Janneck
Lehigh University CSE 397/497
Sept. 24, 2004

2
Overview

Introduction
LSI review
PLSI introduction
Aspect Model
Tempered Expectation Maximization
Geometrical Description
LSI-PLSI comparison
Similarities
Differences
Experiments
Conclusion

3
Intro If it werent for the people

Problem effective information retrieval
Corpus of data
Constantly increasing
Primarily text
(Human) user requests answers to be retrieved
from corpus
Uses natural language query user-formulated
query, often similar to spoken/written language
Human languages introduce ambivalence polysemy,
synonomy
Simple term matching is no longer sufficient

4
Latent Semantic Indexing (LSI)

Popular retrieval enhancement procedure
Reduces dimensionality for faster and (ideally)
more relevant results
Decompose term-doc matrix
Using Singular Value Decomposition (SVD)
Break into
Term-K matrix (U)
K-K (rank-rank) matrix (S)
Doc-K matrix (V)
Sort and eliminate rows with smaller ks
Dimension reduction via truncation

5
Probabilistic LSI

Uses LSI idea, but based in probability theory
Comes from statistical Aspect Model
Generate co-occurrence model based on
non-observed class
This is a mixture model
Models a distribution through a mixture (weighted
sum) of other distributions
Independence Assumptions
Observed pairs (doc, word) are generated randomly
Conditional independence conditioned on latent
class, words are generated independently of
document

6
Aspect Model

Generation process
Choose a doc d with prob P(d)
There are N ds
Choose a latent class z with (generated) prob
P(zd)
There are K zs, and K ltlt N
Generate a word w with (generated) prob P(wz)
This creates pair (d, w), without direct concern
for z
Joining the probabilities gives you

Remember P(zd) means probability of z, given d
7
Aspect Model (2)

Log-likelihood
Maximize this to find P(d), P(zd), P(wz)
Bayes format end up with
This is conceptually different than LSI
Doc-specific word distributions, P(wd), are
based on combination of specific
classes/factors/aspects, P(wz)
Not just assigned to nearest cluster

8
Tempered Expectation Maximization

EM is common technique to maximize likelihood
estimation
Alternates between
E-step calculate future probabilities of z based
on current estimates
M-step update estimate parameters based on
calculated probabilities

9
Tempered Expectation Maximization (2)

Tempered include control b, where blt1
Use this b until performance plateaus, then
change by b hb, where hlt1
Stop when no better performance after change

10
Geometrical Description

Prob distributions can now be mapped in K-1
dimensional space
Instead of M-1
Since K-1 lt M-1, this is dimension reduction
M-1 is dimensions of all possible multinomials
Even though discrete points are mapped, convex
hull provides continuous space

11
Similarities LSI and PLSI

Using intermediate, latent, non-observed data for
classification (hence the L)
Can compose Joint Probability similar to LSI SVD
U ? U_hat P(di zk)
V ? V_hat P(wj zk)
S ? S_hat diag(P(zk))k
JP U_hatS_hatV_hat
JP is simliar to SVD term-doc matrix N
Values calculated probabilistically

12
Differences LSI and PLSI

Basis
LSI term frequencies (usually) and performs
dimension reduction via projection or 0-ing
weaker components
PLSI statistical generate (mostly random)
model of probabilistic relation between W, D and
Z refine until effective model is produced

13
Experiments
14
Experiments (2)
R3
R1
R2
R4
15
Experiments (3)
16
Experiments (4)
Perplexity Inverse of its likelihood
17
Conclusions

PLSI is a good thing because
Consistently better Prec/Rec curves than LSI
TEM SVD, computationally
Better from a modeling sense
Uses likelihood of sampling and aims for
maximization
SVD uses L2-norm or other implicit Gaussian
noise assumption
Polysemy is recognizable
By viewing P(wz)
Similar handling of synonomy

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Bayesian Co-clustering for Dyadic Data Analysis PowerPoint PPT Presentation

Bayesian Co-clustering for Dyadic Data Analysis - Probabilistic latent semantic indexing (Hoffman, '99) ... Movielens: Movie recommendation data. 100,000 ratings (1-5) for 1682 movies from 943 users (6.3 ... | PowerPoint PPT presentation | free to view

Data Mining: Concepts and Techniques Mining Text Data PowerPoint PPT Presentation

Data Mining: Concepts and Techniques Mining Text Data - Playground(p1). Chasing(d1,b1,p1). Semantic analysis. Lexical. analysis (part ... articles, research papers, books, digital libraries, e-mail messages, and Web ... | PowerPoint PPT presentation | free to view

Language Technology: Document Categorization Walter Daelemans walter.daelemans@ua.ac.be PowerPoint PPT Presentation

Language Technology: Document Categorization Walter Daelemans walter.daelemans@ua.ac.be - which documents contain an optimistic view on the ... automatic indexing for Boolean IR systems. web search engines (grouping search results, web directories, ... | PowerPoint PPT presentation | free to view

Other IR Models PowerPoint PPT Presentation

Other IR Models - Index terms have synonyms. [Use thesauri?] Index terms have multiple meanings (polysemy) ... Basic Idea: Keywords in a query are just one way of specifying the ... | PowerPoint PPT presentation | free to view

An Introduction to Latent Semantic Analysis PowerPoint PPT Presentation

An Introduction to Latent Semantic Analysis - for stable document collection, only have to run once ... run SVD once with big dimension, say k = 1000. then can test dimensions = k ... | PowerPoint PPT presentation | free to view

Probabilistic Latent Semantic Analysis PowerPoint PPT Presentation

Probabilistic Latent Semantic Analysis - LSI puts documents together even if they don't have common words if. The docs share frequently co-occurring terms. Disadvantages: Statistical foundation is missing ... | PowerPoint PPT presentation | free to view

First-Order Probabilistic Languages: Into the Unknown PowerPoint PPT Presentation

First-Order Probabilistic Languages: Into the Unknown - Physical Review A 40:404--421. Russell, S., and Norvig, P. 1995. ... Same car? Need to take into account. competing matches! 35. Example: natural language ... | PowerPoint PPT presentation | free to view

High-dimensional Indexing based on Dimensionality Reduction PowerPoint PPT Presentation

High-dimensional Indexing based on Dimensionality Reduction - LDR - Clustering Algo. Construct spatial clusters. Determine max number of clusters: M ... LDR - Clustering Algo (cont) Compute ... LDR - Clustering Algo (cont) ... | PowerPoint PPT presentation | free to view

An Introduction to Latent Semantic Analysis PowerPoint PPT Presentation

An Introduction to Latent Semantic Analysis - synonymy: many ways to refer to the same object, e.g. car and automobile. leads to poor recall ... Search Engines. Probabilistic LSA (Hofmann) Iterative Scaling ... | PowerPoint PPT presentation | free to view

CS 676: Computer vision and image processing PowerPoint PPT Presentation

CS 676: Computer vision and image processing - ... annotated images of objects like cars, motorbikes etc. ... R. Zhang and Z. Zhang. Hidden semantic concept discovery in region based image retrieval. 2004. ... | PowerPoint PPT presentation | free to view

Semantic web role and its method: Domain ontology PowerPoint PPT Presentation

Semantic web role and its method: Domain ontology - 1. Department of Information Management Chaoyang University of Technology ... In order to resolve the antinomy of stability and plasticity, the ART network ... | PowerPoint PPT presentation | free to view

Latent Semantic Analysis A Gentle Tutorial Introduction Tutorial Resources http://cis.paisley.ac.uk/giro-ci0/GU_LSA_TUT PowerPoint PPT Presentation

Latent Semantic Analysis A Gentle Tutorial Introduction Tutorial Resources http://cis.paisley.ac.uk/giro-ci0/GU_LSA_TUT - http://cis.paisley.ac.uk/giro-ci0/GU_LSA_TUT. M.A. Girolami. 9/27/09 ... Alternate Basis to the ... Lexical matching at term level inaccurate (claimed) ... | PowerPoint PPT presentation | free to view

Probabilistic Latent Semantic Analysis PowerPoint PPT Presentation

Probabilistic Latent Semantic Analysis - Perplexity Comparison (1/2) What is perplexity? Indicator ... High probability will give lower perplexity, thus good predictions. Perplexity Comparison (2/2) ... | PowerPoint PPT presentation | free to view

Probabilistic Latent Semantic Analysis PowerPoint PPT Presentation

Probabilistic Latent Semantic Analysis - Evaluation: Perplexity Comparison. Perplexity Log-averaged inverse ... High probability will give lower perplexity, thus good predictions. MED data. 10/5/09 ... | PowerPoint PPT presentation | free to view

Latent Dirichlet Allocation D' Blei, A' Ng, and M' Jordan' Journal of Machine Learning Research, 3:9 PowerPoint PPT Presentation

Latent Dirichlet Allocation D' Blei, A' Ng, and M' Jordan' Journal of Machine Learning Research, 3:9 - Let's assume that all the words within a document are exchangeable. ... Griffiths, T. ... D. Blei, T. Griffiths, M. Jordan, and J. Tenenbaum In S. Thrun, ... | PowerPoint PPT presentation | free to view

(Semi-)Supervised Probabilistic Principal Component Analysis PowerPoint PPT Presentation

(Semi-)Supervised Probabilistic Principal Component Analysis - (Semi-)Supervised Probabilistic. Principal Component Analysis. Shipeng Yu ... Do eigen-decomposition (sort eigenvalues decreasingly) ... | PowerPoint PPT presentation | free to view

Latent Semantic Indexing and Probabilistic (Bayesian) Information Retrieval PowerPoint PPT Presentation

Latent Semantic Indexing and Probabilistic (Bayesian) Information Retrieval - See next : 500 dimensions gives highest proportion correct on synonym text. ... Retrieval based on synonyms has been achieved. Probabilisitic (Bayesian) Retrieval ... | PowerPoint PPT presentation | free to view

Lecture%205:%20Probabilistic%20Latent%20Semantic%20Analysis PowerPoint PPT Presentation

Lecture%205:%20Probabilistic%20Latent%20Semantic%20Analysis - Lecture 5: Probabilistic Latent Semantic Analysis. Ata Kaban. The University of Birmingham ... They are latent. How to find out topics from the words in an ... | PowerPoint PPT presentation | free to view

Generative learning methods for bags of features PowerPoint PPT Presentation

Generative learning methods for bags of features - Codeword distributions. per topic (class) (M K) Class distributions. per image (K N) ... number of codewords. N ... number of images. Slide credit: Josef Sivic ... | PowerPoint PPT presentation | free to view

A Survey on Automatic Text/Speech Summarization PowerPoint PPT Presentation

A Survey on Automatic Text/Speech Summarization - Y. T. Chen et al., A probabilistic generative framework for ... Osborne (2002) used log-linear models to obviate the assumption of feature independence ... | PowerPoint PPT presentation | free to view

Latent Dirichlet Allocation PowerPoint PPT Presentation

Latent Dirichlet Allocation - ... (wn|zn, ), a multinomial probability conditioned on the ... LDA is a simple model and is readily extended to continuous data or other non-multinomial data. ... | PowerPoint PPT presentation | free to view

Information Retrieval PowerPoint PPT Presentation

Information Retrieval - The Boolean Model. Simple model based on set theory. Queries specified as boolean expressions ... Objective: to capture the IR problem using a probabilistic framework ... | PowerPoint PPT presentation | free to view

Principles and Applications of Probabilistic Learnin PowerPoint PPT Presentation

Principles and Applications of Probabilistic Learnin - Principles and Applications of Probabilistic Learning Padhraic Smyth Department of Computer Science University of California, Irvine www.ics.uci.edu/~smyth | PowerPoint PPT presentation | free to view

A Bayesian Hierarchical Model for Learning Natural Scene Categories L. Fei-Fei and P. Perona. CVPR 2005 Discovering objects and their location in images J. Sivic, B. Russell, A. Efros, A. Zisserman and B. Freeman. ICCV 2005 PowerPoint PPT Presentation

A Bayesian Hierarchical Model for Learning Natural Scene Categories L. Fei-Fei and P. Perona. CVPR 2005 Discovering objects and their location in images J. Sivic, B. Russell, A. Efros, A. Zisserman and B. Freeman. ICCV 2005 - Inference How to make decision on a novel image ... Blei et al Unsupervised Learning by Probabilistic Latent Semantic Analysis, T. Hoffman A Bayesian Hierarchical ... | PowerPoint PPT presentation | free to view

What is coming PowerPoint PPT Presentation

What is coming - What is coming Today: Probabilistic models Improving classical models Latent Semantic Indexing Relevance feedback (Chapter 5) Monday Feb 5 Chapter 5 continued | PowerPoint PPT presentation | free to view

(Semi-)Supervised Probabilistic Principal Component Analysis PowerPoint PPT Presentation

(Semi-)Supervised Probabilistic Principal Component Analysis - Title: PowerPoint Presentation Last modified by: Shipeng Yu Created Date: 1/1/1601 12:00:00 AM Document presentation format: On-screen Show Other titles | PowerPoint PPT presentation | free to view

Latent%20Dirichlet%20Allocation PowerPoint PPT Presentation

Latent%20Dirichlet%20Allocation - Latent Dirichlet Allocation David M Blei, Andrew Y Ng & Michael I Jordan presented by Tilaye Alemu & Anand Ramkissoon Motivation for LDA In lay terms: document ... | PowerPoint PPT presentation | free to view