Mining the Stock Market: Which Measure is the Best ? Martin Gavrilov, Dragomir Anguelov, Piotr Indyk, Rajeev Motwani

About This Presentation

Title:

Mining the Stock Market: Which Measure is the Best ? Martin Gavrilov, Dragomir Anguelov, Piotr Indyk, Rajeev Motwani

Description:

Mining the Stock Market: Which Measure is the Best ? Martin Gavrilov, Dragomir Anguelov, Piotr Indyk, Rajeev Motwani Presented by Arun Qamra Main Idea Lot of interest ... – PowerPoint PPT presentation

Number of Views:97

Avg rating:3.0/5.0

Slides: 16

Provided by: Tobi91

Learn more at: https://web.ece.ucsb.edu

Category:

more less

Transcript and Presenter's Notes

Title: Mining the Stock Market: Which Measure is the Best ? Martin Gavrilov, Dragomir Anguelov, Piotr Indyk, Rajeev Motwani

1
Mining the Stock Market Which Measure is the
Best ?Martin Gavrilov, Dragomir Anguelov, Piotr
Indyk, Rajeev Motwani

Presented by
Arun Qamra

2
Main Idea

Lot of interest in mining Time Series data
But little work on identifying measures suitable
for specific class of data sets
This work attempts to
Study similarity measures suitable for stocks
Evaluate results

3
More specifically..

500 stocks, data for one year (S P index, 1998)
Opening price for 252 days
Time Series
Clustering to find similar stocks
Variety of similarity measures

4
Evaluation Technique

How do you evaluate clustering results ?
Each stock pre-assigned to a cluster/category
102 clusters (based on industry)
Abstracted into 62 super-clusters
Used as ground truth
Attempt to recreate this clustering

5
Feature Selection

Data Representation
Normalization
Dimensionality Reduction

6
Data Representation

Raw
Point in 252-dimensional space represents
sequence i.e. stock
First Derivative
i-th coordinate is equal to difference between
i-th and (i1)-th value of sequence

7
Normalization