Title: Personalizing Search via Automated Analysis of Interests and Activities
 1Personalizing Search via Automated Analysis of 
Interests and Activities
- Jaime Teevan, MIT, CSAIL 
- Susan, MS Research 
- Eric, MS Research
2Outline
- Introduction 
- Pursuit of Personalization 
- Corpus, User, Doc  Query Representation 
- Evaluation Framework 
- Results 
- Conclusion  Future Work 
3Introduction
- Web search for IR returns a wide range of 
 results etc.
- Beyond the commons Investigating the value of 
 personalizing Web search. (workshop on new
 technologies for PIA 2005)
- They found that there is an opportunity to 
 achieve significant improvement by
 custom-tailoring search results to individuals,
 and were thus motivated to pursue search
 algorithms that return personalized results
 instead of treating all users the same.
-  
4Pursuit of Personalization
- Focused on re-ranking the top research results 
 locally.
- They explored Web search personalization by 
 modifying BM25 as their Probabilistic weighting
 scheme.
- Corpus representation N, ni 
- User representation R, ri 
- Doc  Query representation terms 
-  
5Corpus, User, Doc  Query Representation
- Corpus Representation 
- Full-text VS Title  Snippet 
- User Representation 
- A rich index of personal content that captured a 
 users interests and computational activities
- Within last month VS Full index of documents 
- Doc  Query Representation 
- Full-text VS Title  Snippet 
-  
6Evaluation Framework
- 15 participants, top 50 search results. 
- For 10 self-selected Queries 
- 2 way to evaluate queries 
- Collected a total of 131 queries 
- DCG (Discounted Cumulative Gain) 
- Rank(D1,..Dn), Relevance(3,2,3,0,0,1,2,) 
-  
7Results
- One-way effects in which we hold all but one 
 variable constant.
- Baseline Comparisons 
- Combining Rankings 
8One-way effects
  9Best combination of parameters for personalized 
search
- Corpus Representation Approximated by the 
 result set title and snippets.
- User Representation Built from the users 
 entire personal index.
- Document  Query Representation Documents 
 represented by the title and snippet returned by
 the search engine, with query expansion based on
 words that occur near the query term.
-  
10Baseline Comparisons
  11Combining Rankings 
 12Conclusion
- They have investigated the feasibility of 
 personalizing Web search by using an
 automatically constructed user profile as
 relevance feedback in their ranking algorithm.
- Better than explicit relevance feedback. 
- Its possible to approximate the corpus, making 
 efficient client-side computation feasible.
-  
13Future Work