Indexing of Dynamic Abstract Regions - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Indexing of Dynamic Abstract Regions

Description:

... clipping-based index for both overlapping and non-overlapping spatial data ... Combines dynamic segmentation, domain reduction and partial rebuilding ... – PowerPoint PPT presentation

Number of Views:74
Avg rating:3.0/5.0
Slides: 30
Provided by: kenn1
Category:

less

Transcript and Presenter's Notes

Title: Indexing of Dynamic Abstract Regions


1
Indexing of Dynamic Abstract Regions
  • Joxan Jaffar, Roland Yap
  • National Univerisity of Singapore
  • Kenny Q. Zhu
  • Microsoft
  • ICDE 2006

2
Outline
  • What are we indexing?
  • Existing Methods
  • The RC-tree
  • Clipping and Domain Reduction
  • Partial Rebuilding
  • Dynamic Segmentation
  • Heuristics
  • Technical Results
  • Experimental Results
  • Future Work
  • Conclusion

3
What are we indexing?
  • Stock trading system
  • Large number of triggers like these
  • Trader A if price of IBM 50, then sell 100
    shares of IBM otherwise if price of Dell 35,
    then buy 200 shares of dell
  • Trader B if the price of Dell 2 times the
    price of IBM, then buy 100 shares of IBM and sell
    100 shares of Dell
  • Need an index for them!

4
What are we indexing? (contd)
  • Conjunctions of linear constraints, e.g.
  • Abstract regions of interest or shapes
  • Multi-dimensional spatial objects
  • May overlap each other

5
Existing Representations
  • GIS applications usually rigid objects with no
    overlapping
  • Minimum Bounding Rectangles (MBR) approximations
  • Manual segmentation into smaller boxes (consider
    a freeway!)
  • Segmentation either too coarse (dead space) or
    too fine (space and time cost)

6
Existing Representations Highways in Germany
7
Existing Data Structures
  • R-tree object partitioning, height balanced
  • R-tree space partitioning (clipping),
    height-balanced
  • R-tree R-tree variant with a different object
    splitting method
  • Kd-tree space partitioning, weight-balanced
  • Quad-tree space partitioning, unbalanced
  • CR-tree cache-conscious R-tree

8
The RC-tree
  • Reducible Clip-tree
  • A general weight-balanced binary tree for dynamic
    search and update of k-dimensional spatial
    objects
  • Designed for main-memory use
  • Intermediate nodes space-partitioning
    hyper-planes as discriminators
  • Leaf nodes (overflow nodes) indexed objects
  • Objects that intersects with the discriminators
    are clipped and domain-reduced

9
Clipping and Domain Reduction
  • A Original objects
  • B MBR-based indexing
  • C RC-tree with discriminators d1, d2, and d3

10
Benefits of Domain Reduction
11
Dynamic Segmentation
  • A pre-defined leaf capacity L
  • A leaf is split only if number of (clipped)
    objects is larger than L
  • Splitting of a leaf node may cause segmentation
    (clipping) of object if the objects are
    overlapped
  • Compare with BSP
  • Only segment an object when necessary (on demand)
    as opposed to static segmentation

12
Dynamic Segmentation
13
Dynamic Segmentation
14
Dynamic Segmentation
15
Partial Rebuilding
  • Originally from Overmars
  • Rebuild part of the tree that is out of balance
  • Balancing criterion
  • where T is any sub-tree rooted at node T
  • Flush all objects (clipped) within T, merge them
    and rebuild the sub-tree
  • Controls clipping and space usage

16
Partial Rebuilding
partial rebuild
h
T
T
17
Some Heuristics
  • Additional MBRs at intermediate nodes to speed up
    ruling out false queries
  • Finding the right discriminator
  • RC-SWEEP sorts the objects and sweeps through to
    balance the weight and minimize clipping ? more
    optimized but slower
  • RC-MID quickly partitions the objects in the
    middle of the sub-space and ignore the effects of
    clipping ? less optimized but fast

18
Technical Results
  • Given a constant scaling factor s which bounds
    the relative extents of objects and assuming a
    fixed dimensionality k
  • The amortized insertion cost of a balanced
    RC-tree of n objects has O(sklog(n)) time
    complexity.
  • A point query has average O((1.5)k(log(n)sk))
    time
  • A RC tree has worst case O(skn) space
    complexity.

19
Experimental Results
  • Synthetic Datasets overlapping line segments and
    triangles
  • Traditional GIS Datasets e.g. German roads

20
Search cost synthetic data (log scale)
21
Search cost GIS data
22
Space Usage Synthetic Data
23
Space usage GIS data
24
Hit rates Synthetic Data (negative log scale)
25
Search time GIS Data
26
Insertion and Search Cost vs. log(n)
27
Other Empirical Results
  • Range queries better query accuracy
  • Dynamic vs. static segmentation
  • Cache oblivious RC-tree
  • Some limited performance improvement
  • Requires larger cache-line relative to the node
    size

28
Future Work
  • Indexing higher dimension regions
  • Application of domain reduction techniques in
    height-balanced indexes
  • More advanced cache study of RC-tree
  • Using RC-tree and its techniques in triggering of
    active database and other business applications

29
Conclusion
  • A good weight-balanced clipping-based index for
    both overlapping and non-overlapping spatial data
  • Allows the tuning of space-time trade off
  • Combines dynamic segmentation, domain reduction
    and partial rebuilding
  • Renews the interest in spacing partitioning and
    object clipping indexing methods
Write a Comment
User Comments (0)
About PowerShow.com