Fast SubSequence Matching in TimeSeries Databases - PowerPoint PPT Presentation

1 / 29

About This Presentation

Title:

Fast SubSequence Matching in TimeSeries Databases

Description:

Not discrete symbols. Applications for time-series databases. 5. Financial data. Astrological data. Weather data. Sociological data ...many more. Example database ... – PowerPoint PPT presentation

Number of Views:49

Avg rating:3.0/5.0

Slides: 30

Provided by: arturola

Category:

Tags: subsequence | timeseries | astrological | databases | fast | matching | symbols

Transcript and Presenter's Notes

Title: Fast SubSequence Matching in TimeSeries Databases

1
Fast Sub-Sequence Matching in Time-Series
Databases

Michael Käser

TexPoint fonts used in EMF. Read the TexPoint
manual before you delete this box. AA
2
Outline

Time-series databases
Building an index
Answering queries
Evaluation of the method
Conclusion

3
The paper

Published in 1994
Awarded Best paper at SIGMOD 1994

4
Definition of time-series databases

Each row is a sequence of numbers
Sequences length can be variable
Difference to other sequence data like text or
DNA?

Data is based on continuous data that was sampled
in a certain interval
Not discrete symbols

5
Applications for time-series databases

Financial data
Astrological data
Weather data
Sociological data
many more

6
Example database
7
Searching

A query on the database has two properties
Query sequence R
Query distance e
Queries can be categorized by their distance and
by the length of R

8
Query distance

Allows searching for similar data
Distance of 0 is exact search
Distances between sequences are calculated using
the Euclidian distance function

9
Length of query

Same length as data Searching is easy
Shorter than data Do a comparison at every
possible offset

10
What should be achieved

Sequential searching on the sequences is slow
The new search method should
Improve performance for all query types
Require little space overhead
Not miss any matching sequences
(But can generate few false alarms)

11
How it is achieved

Step 1 Extract information of sequences
Step 2 Add support for short queries
Step 3 Store in efficient data structure
Step 4 Query the index

12
Step 1 Extracting features

Compress the information of a complete sequence
into a smaller number of features
Number of features f should be defined in advance
Transform each sequence to a point in the
f-dimensional feature space

13
Discrete Fourier Transformation

Transforms sequence into another sequence of same
length
Each element of the transformed sequence holds
information about all elements of the original
sequence
Transformed elements are complex numbers

14
DFT for feature extraction

Cut off transformed sequence after f elements
Use amplitude of complex number
Distance between transformed sequences is always
smaller than original distance

15
Extracting features in the example
16
Step 2 Extend index for subsequences

Define a minimum query length w
Use a sliding window over the original data
At each window position extract features
All transformed points of subsequences form the
trail of a sequence in the feature space

17
Generating trails in the example
18
Example of trails
19
Step 3 Storage of trails

Storing all the points in a trail requires a lot
of space
Searching in all the points is much slower than
pure sequential searching
An efficient data structure for spatial data has
to be used

20
The R-Tree

Data structure for saving multi-dimensional areas
(i.e. rectangles)
Content is in leaf nodes
Other nodes are minimum bounding rectangles
around the child nodes
Rectangles can overlap
Good algorithms for inserting and deleting exist

21
R-tree example
22
Using the R-tree to store the trails

Split each trail into a number of sub trails
Put a rectangle around the sub trail
Save it together with sequence id and offsets
How should the trails be split?
Fixed number of points per sub trail is not
optimal
Use an adaptive algorithm that minimizes the
number of disk accesses

23
Example Selecting sub-trails
24
Step 4 Querying the index

Use only the first w elements of query
Extract the features of the query
Represent it as circle around the feature point
with query distance as radius
Intersect with R-tree nodes
Add the offsets associated with each matching
child node to the result set
Recalculate every distance in the result set and
discard false alarms

25
Better method

Split query into p parts of length w
Do a query for each part
Merge the results
The query distance can be reduced to

26
Evaluation

Tested on a real database with 329000 points
Minimal query length w of 512
Queries of length 512 were 3 to 100 times faster
Longer queries were 2 to 40 times faster
Index size was 5 KB

27
Evaluation
28
Conclusion

Proposed method works fast for real-world data
Influential paper
A lot of research based on it
Reducing false alarms
Adding constraints to the query
Streaming Time Series
Improvements in R-Trees
many more (250 citations)

29
Your questions?

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Fast Subsequence Matching in Timeseries Databases PowerPoint PPT Presentation

Fast Subsequence Matching in Timeseries Databases - on Management of Data, pages 419--429, Minneapolis, May 1994. presented by ... find companies whose stock prices move similarly ... | PowerPoint PPT presentation | free to view

Fast Track to Scientific Databases PowerPoint PPT Presentation

Fast Track to Scientific Databases - Title: Fast Track to Scientific Databases Author: MAHIDOL Last modified by: MAHIDOL Created Date: 10/23/2004 8:35:50 AM Document presentation format | PowerPoint PPT presentation | free to view

Top Satta fast is the winning game site PowerPoint PPT Presentation

Top Satta fast is the winning game site - Playing satta is likewise the confided in games to every one of the players who are playing satta fast games. Subsequently, we have scrambled our gaming up satta with the high wellbeing standards. | PowerPoint PPT presentation | free to view

Chapter 8. Mining Stream, TimeSeries, and Sequence Data PowerPoint PPT Presentation

Chapter 8. Mining Stream, TimeSeries, and Sequence Data - Mining sequence patterns in transactional databases. Mining ... Y. Moon, K. Whang, W. Loh. Duality Based Subsequence Matching in Time-Series Databases, ICDE'02 ... | PowerPoint PPT presentation | free to view

Simultaneous Point Matching and Recovery of Rigid and Nonrigid Shapes PowerPoint PPT Presentation

Simultaneous Point Matching and Recovery of Rigid and Nonrigid Shapes - THESIS PROPOSAL Simultaneous Point Matching and Recovery of Rigid and Nonrigid Shapes Thesis director Francesc Moreno Noguer Tutor Alberto Sanfeliu Cort s | PowerPoint PPT presentation | free to view

Best Natural Remedies to Get Rid Of White Discharge, Odor, Itching Fast PowerPoint PPT Presentation

Best Natural Remedies to Get Rid Of White Discharge, Odor, Itching Fast - This powerpoint presentation describes about best natural remedies to get rid of white discharge, odor, itching Fast. You can find more detail about Gynex Capsules at https://www.naturogain.com | PowerPoint PPT presentation | free to view

Fast Loan Solution in UK - Payday Loans online PowerPoint PPT Presentation

Fast Loan Solution in UK - Payday Loans online - Payday Loans Payday loans are short term loans for borrowers struggling temporarily. To avoid a financial crisis, borrowers choose to take help of these loans. These instant cash loan lenders operate with a flexible payment plan helping those in need. This is only a temporary fix to an unexpected financial problem which offers instant cash. In recent times, UK residents have become very familiar with payday loans and are easily accessible online. Introduction to Fast Loan Solutions Fast loan solution is a payday loan service provider operating its business all over UK. It has affiliation with 29 direct lenders to provide loans to seekers. | PowerPoint PPT presentation | free to view

From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching PowerPoint PPT Presentation

From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching - From Region Encoding To Extended Dewey: On Efficient Processing of XML Twig Pattern Matching Jiaheng Lu, Tok Wang Ling, Chee-Yong Chan, Ting Chen | PowerPoint PPT presentation | free to view

Fast-Paced Trading of Multi-Attribute Goods PowerPoint PPT Presentation

Fast-Paced Trading of Multi-Attribute Goods - Cars and bonds. Car market with eight attributes. 4,000 to 20,000 ... of multi-attribute goods. Fast identification of matches. between buy and sell orders ... | PowerPoint PPT presentation | free to view

Database Management Systems and Enterprise Software PowerPoint PPT Presentation

Database Management Systems and Enterprise Software - The two primary types of databases are flat-file databases (with only one table) ... Databases can store the following types of fields: Working With a Database ... | PowerPoint PPT presentation | free to view

Payday Loans in UK - Fast Loan Solution PowerPoint PPT Presentation

Payday Loans in UK - Fast Loan Solution - Payday Loans Payday loans are short term loans for borrowers struggling temporarily. To avoid a financial crisis, borrowers choose to take help of these loans. These instant cash loan lenders operate with a flexible payment plan helping those in need. This is only a temporary fix to an unexpected financial problem which offers instant cash. In recent times, UK residents have become very familiar with payday loans and are easily accessible online. Introduction to Fast Loan Solutions Fast loan solution is a payday loan service provider operating its business all over UK. It has affiliation with 29 direct lenders to provide loans to seekers. Features of Fast Loan Solutions • Online quick loans • Instant funds transfer • Fast loans up to 1000 Pounds • For up to 28 days • Low interest rates | PowerPoint PPT presentation | free to view

Fast Payday Loans- Avail Rapid Finances Support In Critical Situation PowerPoint PPT Presentation

Fast Payday Loans- Avail Rapid Finances Support In Critical Situation - Once the borrower matches the eligibility criteria, he will obviously be concerned about the amount of the loan. The maximum amount offered in fast payday loans is in between the range of £100 to £1000. Repayment period is also quite flexible and is up to 30 days. Interest rates on these loans are viable and vary from lender to lender. Enhanced market search of different lenders can give additional benefits to the borrower in terms of getting better interest rates. http://www.fastpaydayloans.uk.com/fast_payday_loans_for_unemployed.html | PowerPoint PPT presentation | free to view

Gigabit Rate Packet Pattern-Matching Using TCAM PowerPoint PPT Presentation

Gigabit Rate Packet Pattern-Matching Using TCAM - Title: Multi-Match Classification Author: Fang Yu Last modified by: Fang Yu Created Date: 6/8/2004 8:21:43 PM Document presentation format: On-screen Show | PowerPoint PPT presentation | free to view

Chap. 8 Mining Stream, TimeSeries, and Sequence Data PowerPoint PPT Presentation

Chap. 8 Mining Stream, TimeSeries, and Sequence Data - Fast changing and requires fast, real-time response ... Tradebot (www.tradebot.com): stock tickers & streams. Tribeca (Bellcore): network monitoring ... | PowerPoint PPT presentation | free to view

Fast Food Market: Global Industry Size, Share, Trends, and forecasts upto 2021 PowerPoint PPT Presentation

Fast Food Market: Global Industry Size, Share, Trends, and forecasts upto 2021 - A new report on Global Fast Food Industry 2021 Market Research Report seen on DecisionDatabases.com analyses the complete market. The industry sales & production volumes, industry’s trends are all discussed,explained and analysed. view more : http://www.decisiondatabases.com/ip/107-fast-food-market-report | PowerPoint PPT presentation | free to view

Parallel Spectral Methods: Fast Fourier Transform (FFTs) with Applications PowerPoint PPT Presentation

Parallel Spectral Methods: Fast Fourier Transform (FFTs) with Applications - Fast Fourier Transform (FFTs) with Applications James Demmel www.cs.berkeley.edu/~demmel/cs267_Spr12 * Last bullet: GASNet reaches half peak bandwidth for message 1 ... | PowerPoint PPT presentation | free to view

Asynchronous Pattern Matching - Metrics PowerPoint PPT Presentation

Asynchronous Pattern Matching - Metrics - Title: On The Connections Between Sorting Permutations By Interchanges and Generalized Swap Matching Author: Last modified by: Amir Created Date | PowerPoint PPT presentation | free to view

Introduction to Introduction to Database Systems PowerPoint PPT Presentation

Introduction to Introduction to Database Systems - What databases do you interact with in a typical week? Types of Databases ... Definition of data types, structures, constraints. Construction of database on ... | PowerPoint PPT presentation | free to view

Chap' 8 Mining Stream, TimeSeries, and Sequence Data PowerPoint PPT Presentation

Chap' 8 Mining Stream, TimeSeries, and Sequence Data - Fast changing and requires fast, real-time response ... Window stitching - Stitch similar windows to form pairs of large similar ... | PowerPoint PPT presentation | free to view

How Can You Lose Weight Fast Without Surgery? PowerPoint PPT Presentation

How Can You Lose Weight Fast Without Surgery? - This presentation describes about how can you lose weight fast without surgery. | PowerPoint PPT presentation | free to view

Global DC Fast Charger Industry Development and Chinese Market Opportunities and Challenges to 2021 PowerPoint PPT Presentation

Global DC Fast Charger Industry Development and Chinese Market Opportunities and Challenges to 2021 - Browse DC Fast Charger Market research report at http://goo.gl/ruHuN5. The DC Fast Charger Market research report depicts the global and Chinese total market of DC Fast Charger industry including capacity, production, production value, cost/profit, supply/demand and Chinese import/export. | PowerPoint PPT presentation | free to view

Strings and Pattern Matching PowerPoint PPT Presentation

Strings and Pattern Matching - Strings and Pattern Matching * ... N Best case time complexity: O(N) * Rabin-Karp The Rabin-Karp string searching algorithm calculates a hash value for the pattern, ... | PowerPoint PPT presentation | free to view

NiagaraCQ : A Scalable Continuous Query System for Internet Databases (modified slides available on course webpage) Jianjun Chen et al Computer Sciences Dept. University of Wisconsin-Madison SIGMOD 2000 Talk by Naresh Kumar PowerPoint PPT Presentation

NiagaraCQ : A Scalable Continuous Query System for Internet Databases (modified slides available on course webpage) Jianjun Chen et al Computer Sciences Dept. University of Wisconsin-Madison SIGMOD 2000 Talk by Naresh Kumar - NiagaraCQ : A Scalable Continuous Query System for Internet Databases (modified s available on course webpage) Jianjun Chen et al Computer Sciences Dept. | PowerPoint PPT presentation | free to view

Need a Fast Loan We've made finding a fast loan easy PowerPoint PPT Presentation

Need a Fast Loan We've made finding a fast loan easy - We provide fast loans online service at Need a Fast Loan and Get great panel of lenders without any paper work, obligation and hassle free. Call us today for more information about fast loans! | PowerPoint PPT presentation | free to view

real time databases PowerPoint PPT Presentation

real time databases - ppt about real time databases.....create by group 6-BCS batch 2013/14-University of ruhuna | PowerPoint PPT presentation | free to view

Horoscope Matching Astrology for Marriage PowerPoint PPT Presentation

Horoscope Matching Astrology for Marriage - Star Matching and compare the horoscopes before finalizing an alliance? Free check Star matching boy to girl. http://kundlimatching.org/star-matching/ | PowerPoint PPT presentation | free to view

Matching Twigs in Probabilistic XML PowerPoint PPT Presentation

Matching Twigs in Probabilistic XML - But specifying the probability of each match does not answer the question! ... A match of a twig T in a document d is a mapping from the nodes of T to those of ... | PowerPoint PPT presentation | free to view