Sampling from Large Graphs poster - PowerPoint PPT Presentation

About This Presentation
Title:

Sampling from Large Graphs poster

Description:

A: (at least) the 13 ones we list. Q: How to measure success ... Citation (HEP-TH, HEP-PH) A.S. epinions.com. 26K - 500K edges. KDD 2006. Leskovec & Faloutsos ... – PowerPoint PPT presentation

Number of Views:22
Avg rating:3.0/5.0
Slides: 13
Provided by: csC76
Learn more at: http://www.cs.cmu.edu
Category:
Tags: graphs | hep | poster | sampling

less

Transcript and Presenter's Notes

Title: Sampling from Large Graphs poster


1
??
??
2
Sampling from Large Graphsposter 305
  • Jurij (Jure) Leskovec
  • Christos Faloutsos
  • Carnegie Mellon University

3
Problems and recommendations
  • Q How to sample from a large graph?
  • A FF, RN
  • Q Which properties to preserve?
  • A (at least) the 13 ones we list
  • Q How to measure success/similarity?
  • A K-S, towards back-in-time version

4
Criteria
STATIC
TEMPORAL
  • in-degree out-degree distribution
  • distr. of WCC SCC
  • hop-plot hop-plot for WCC
  • distr. of first left singular vector values
  • scree plot
  • distr. of clustering coefficient
  • Densification power law
  • shrinking diameter
  • normalized size of largest c.c.
  • first eigenvalue

5
Targets
  • scale-down ( fewer nodes same diameter, same
    degree etc)
  • back-in-time (match an earlier, real, smaller
    version of the graph)

6
Sampling Methods
  • RN random nodes
  • RPN pageRank random nodes
  • RDN random nodes, degree-biased
  • RE random edges
  • RNE
  • HYB (Hybrid)
  • RNN
  • RJ random jump
  • RW random walk
  • FF Forest fire

7
4 Datasets
  • Arxiv (author-paper)
  • Citation (HEP-TH, HEP-PH)
  • A.S.
  • epinions.com
  • 26K - 500K edges

8
Diameter vs N CC vs degree
9
degree distribution avg CC vs N
10
diameter
DPL
11
D-statistic vs sample size
better
scale-down
back-in-time
12
Conclusions
  • random nodes a little exploration -gt FF
  • (RN, RJ are close)
  • 15 sample seems enough
  • back-in-time concept
Write a Comment
User Comments (0)
About PowerShow.com