Title: Web Markov Skeleton Processes and Applications
1Web Markov Skeleton Processes and Applications
- Zhi-Ming Ma
- 10 June, 2013,
St.Petersburg - Email mazm_at_amt.ac.cn
- http//www.amt.ac.cn/member/mazhiming/index.html
2- Y. Liu, Z. M. Ma, C. Zhou
- Web Markov Skeleton Processes and Their
Applications, Tohoku Math J. 63 (2011), 665-695 - Y. Liu, Z. M. Ma, C. Zhou
- Further Study on Web Markov Skeleton
Processes, in Stochastic Analysis and
Applications to Finance,World Scientific,2012 - C. Zhou Some Results on Mirror Semi-Markov
Processes, manuscript
3Web Markov Skeleton Process
4(No Transcript)
5Simple WMSP
Many simple WMSPs are
Non-Markov Processes
6(No Transcript)
7Mirror Semi-Markov Process
Mirror Semi-Markov Process is not a Hou-Lius
Markov Skeleton Process, i.e. it does not satisfy
8Multivariate Point Process
associated with WMSP
9(No Transcript)
10(No Transcript)
11Let
12(No Transcript)
13Consequently
where
Define
We can prove that
14where
15Time-homogeneous mirror semi-Markov processes
are all independent of n
16More property of of time homogeneity
Renewal Theory
Contribution probability
Staying times and first entry times
Limit distribution for
semi-Markov process
Limit distribution for mirror
semi-Markov processes
Reconstruction of Mirror Semi-Markov
Processes
17Why it is called a Web Markov Skeleton
Process?
18(No Transcript)
19A simple Markov Skeleton Process
Page Rank, a ranking algorithm used by the
Google search engine.
1998, Sergey Brin and Larry Page ,
Stanford University
- From probabilistic point of view,
- PageRank is the stationary distribution
of a Markov chain. -
20Markov chain describing surfing behavior
21Markov chain describing surfing behavior
22- Web surfers usually have two basic ways to
access web pages - with probability a, they visit a web page by
clicking a hyperlink. - 2. with probability 1-a, they visit a web page
by inputting its URL address.
23(No Transcript)
24Weak points of PageRank
- Using only static web graph structure
- Reflecting only the will of web managers,
- but ignore the will of users e.g. the staying
time of users on a web. - Can not effectively against spam and junk pages.
BrowseRankSIGIR.ppt
25Data Mining
26Browsing Process
27(No Transcript)
28(No Transcript)
29(No Transcript)
30(No Transcript)
31Computation of the Stationary Distribution
- Stationary distribution
-
-
- is the mean of the staying time on page i.
- The more important a page is, the longer
staying time on it is. - is the mean of the first re-visit time at
page i. The more important a page is, the smaller
the re-visit time is, and the larger the visit
frequency is.
32(No Transcript)
33BrowseRank Letting Web Users Vote for Page
Importance
- Yuting Liu,
- Bin Gao, Tie-Yan Liu, Ying Zhang,
- Zhiming Ma, Shuyuan He, and Hang Li
- July 23, 2008, Singapore
- the 31st Annual International ACM SIGIR
Conference on Research Development on
Information Retrieval.
Best student paper !
34(No Transcript)
35(No Transcript)
36(No Transcript)
37(No Transcript)
38(No Transcript)
39- Browse Rank the next PageRank
- says Microsoft
40- Browsing Processes will be a
- Basic Mathematical Tool in
- Internet Information Retrieval
- Beyond
- --General fromework of Browsing Processes?
- --How about inhomogenous process?
- --Marked point process
- --Mobile Web not really Markovian
41ExtBrowseRank and semi-Markov
processes
42(No Transcript)
43MobileRank and Mirror Semi-Markov
Processes
44MobileRank and Mirror Semi-Markov
Processes
45Web Markov Skeleton Process
- 10 B. Gao, T. Liu, Z. M. Ma, T. Wang, and H. Li
- A general markov framework for page importance
computation, In proceedings of CIKM '2009, - 11 B. Gao, T. Liu, Y. Liu, T. Wang, Z. M. Ma
and H. LI - Page Importance Computation based on Markov
Processes, Information Retrieval - online first
- lthttp//www.springerlink.com/conten
t/7mr7526x21671131
46Research on Random Complex Networks and
Information Retrieval In recent years
we have been involved in the research direction
of Random Complex Netowrks and Information
Retrieval. Below are some of the related outputs
by our group (in collaboration with Microsoft
Research Asia)
47(No Transcript)
48(No Transcript)
49(No Transcript)
50More property of time homogeneity
51Theorem LMZ 2011a
for all n
Theorem LMZ 2011b General case
52The statistical properties of a time homogeneous
mirror semi-Markov process is completely
determined by
53Reconstruction of Mirror Semi-Markov
Processes
Theorem LMZ 2011b
We can construct
such that
54uniformly
55Limit distribution for
semi-Markov process
56(No Transcript)
57(No Transcript)
58Limit distribution for mirror
semi-Markov processes
59(No Transcript)
60Staying times and first entry times
Staying time on the state j
Distribution
Expectation
Distribution
Expectation
61Contribution probability
from state i to state j
62 Renewal Theory
Proposition
63Renewal Equation LMZ2011a
64Renewal functional
Below are the resuls on the renewal functional
LMZ2011a
65(No Transcript)
66Thank you !
67(No Transcript)
68Time Homogeneous WMSP
69(No Transcript)
70More property of of time homogeneity
Theorem LMZ 2011b
71Reconstruction of WMSP
LMZ2011b
72(No Transcript)
73Ranking Websites, a Probabilistic View
Internet Mathematics, Volume 3 (2007), Issue 3
- Ying Bao, Gang Feng, Tie-Yan Liu, Zhi-Ming Ma,
and Ying Wang
AggregateRank Bring Order to
Web Sites 29th Annual International Conference
on Research Development on Information
Retrieval (SIGIR06). G.Feng, T.Y. Liu, Ying
Wang, Y.Bao, Z.M.Ma et al
74(No Transcript)