Motivation - PowerPoint PPT Presentation

About This Presentation
Title:

Motivation

Description:

Streams of trading data, stock tickers, news feeds ... No real-time services. Assume precise data ... Real-time requirements. Data stale/imprecise ... – PowerPoint PPT presentation

Number of Views:63
Avg rating:3.0/5.0
Slides: 7
Provided by: RajeevM4
Learn more at: http://web.cs.wpi.edu
Category:

less

Transcript and Presenter's Notes

Title: Motivation


1
Motivation
2
Data Streams
  • Traditional DBMS data stored in finite,
    persistent data sets
  • New Applications data input as continuous,
    ordered data streams
  • Network monitoring and traffic engineering
  • Telecom call records
  • Network security
  • Financial applications
  • Sensor networks
  • Manufacturing processes
  • Web logs and clickstreams
  • Massive data sets

3
Data Stream Management System
User/Application
Register Query
Results
Data Stream Management System (DSMS)
Stream Query Processor
Scratch Space (Memory and/or Disk)
4
Meta-Questions
  • Killer-apps
  • Application stream rates exceed DBMS capacity?
  • Can DSMS handle high rates anyway?
  • Motivation
  • Need for general-purpose DSMS?
  • Not ad-hoc, application-specific systems?
  • Non-Trivial
  • DSMS merely DBMS with enhanced support for
    triggers, temporal constructs, data rate mgmt?

5
Sample Applications
  • Network security
    (e.g., iPolicy, NetForensics/Cisco,
    Niksun)
  • Network packet streams, user session information
  • Queries URL filtering, detecting intrusions
    DOS attacks viruses
  • Financial applications
    (e.g., Traderbot)
  • Streams of trading data, stock tickers, news
    feeds
  • Queries arbitrage opportunities, analytics,
    patterns

6
DBMS versus DSMS
  • Persistent relations
  • One-time queries
  • Random access (pull)
  • Unbounded disk store
  • Only current state matters
  • Passive repository
  • Relatively low update rate
  • No real-time services
  • Assume precise data
  • Access plan determined by query processor,
    physical DB design
  • Transient streams
  • Continuous queries
  • Sequential access (push)
  • Bounded main memory
  • History/arrival-order is critical
  • Active stores
  • Possibly multi-GB arrival rate
  • Real-time requirements
  • Data stale/imprecise
  • Unpredictable/variable data arrival and
    characteristics
Write a Comment
User Comments (0)
About PowerShow.com