Entire Regularization Paths for Graph Data - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

Entire Regularization Paths for Graph Data

Description:

1. Entire Regularization Paths for Graph Data. Max Planck Institute for Biological ... Trace the solution trajectory of L1-regularized learning ... – PowerPoint PPT presentation

Number of Views:48

Avg rating:3.0/5.0

Slides: 22

Provided by: velblodVid

Category:

Tags: data | entire | graph | paths | regularization

Transcript and Presenter's Notes

Title: Entire Regularization Paths for Graph Data

1
Entire Regularization Paths for Graph Data

Max Planck Institute for Biological Cybernetics
Koji Tsuda

2
Graph Regression
Test
Training
3
Substructure Representation

0/1 vector of pattern indicators
Huge dimensionality!
Need feature selection

patterns
4
Overview

Entire regularization paths
LARS-LASSO (Efron et al., 2004), L1SVM
Forward selection of features
Trace the solution trajectory of L1-regularized
learning
Path following algorithm for graph data
Feature search -gt pattern search
Branch-and-bound algorithm
DFS code tree, New Bound

5
Path Following Algorithms

LASSO regression
Follow the complete trajectory of
Infinity to Zero
Active feature set
Features corresponding to nonzero weights

6
Piecewise Linear Path

At a turning point,
A new feature included into the active set, or
An existing feature excluded from the active set

7
Practical Merit of Path Following

Cross validation by grid search
Has to solve QP many times
Especially time-consuming for graph data
Path following does not include QP
Determine the CV-optimal regularization parameter
in the finest precision

8
Pseudo code of path following

Set initial point and direction
Do
d1 Step size if next event is inclusion
d2 Step size if next event is exclusion
d min(d1,d2)
Update the active feature set
Set the next direction
Until all features are included

9
Feature space of patterns

Graph training data
Set of all subgraphs (patterns)
Each graph is represented as

10
Main Search problem

Step size if pattern t is included next
Find pattern that minimizes

constants computed from active set
11
Tree-shaped Search Space

Each node has a pattern
Generate nodes from the root
Add an edge at each step

12
Tree Pruning

If it is guaranteed that the optimal pattern is
not in the downstream, the search tree can be
pruned

Not generated
13
Theorem (Pruning condition)

Traversed up to pattern t
Minimum value so far
No better pattern in the downstream, if

where
14
Reusing the search space

Main search is solved repeatedly with different
parameters
More efficient to reuse the search space in next
iterations
Node generation is expensive due to the minimum
DFS code check
Whole tree of patterns is kept in memory and
progressively extended

15
Experiments

Naïve Method
Enumerate all patterns whose edge size is smaller
than maxpat
Then, LAR-LASSO is applied
CPDB dataset
683 training graphs (chemical compounds)
Classification dataset (mutagenetic or not)
Converted to regression problem (y1,-1)

16
How to measure the computational cost of our
method

Data divided into 90 train and 10 validation
Record
Number of nodes in tree
Computation time
at the point of minimum validation error

17
Computational Cost
18
Solution Path
19
Events
20
Conclusion

Path following implemented for graph data
Pattern search by the DFS code tree
Hinge loss To do
Search criterion more complicated
Easily combined with itemset mining, tree mining,
sequence mining

21
gboost MATLAB toolbox

Graph classification by LPBoost DFS Code Tree
Includes an implementation of gspan
www.kyb.mpg.de/people/nowozin/gboost
Path following code will be available soon

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Akindex : Exploiting Local Similarity to Index Paths in Graph Data PowerPoint PPT Presentation

Akindex : Exploiting Local Similarity to Index Paths in Graph Data - Akindex : Exploiting Local Similarity to Index Paths in Graph Data | PowerPoint PPT presentation | free to view

CSE 326: Data Structures Part 8 Graphs PowerPoint PPT Presentation

CSE 326: Data Structures Part 8 Graphs - CSE 326: Data Structures Part 8 Graphs Henry Kautz Autumn Quarter 2002 Outline Graphs (TO DO: READ WEISS CH 9) Graph Data Structures Graph Properties Topological Sort ... | PowerPoint PPT presentation | free to view

Using Graphs in Unstructured and Semistructured Data Mining PowerPoint PPT Presentation

Using Graphs in Unstructured and Semistructured Data Mining - Using Graphs in Unstructured and Semistructured Data Mining Soumen Chakrabarti IIT Bombay www.cse.iitb.ac.in/~soumen Acknowledgments C. Faloutsos, CMU W. Cohen, CMU ... | PowerPoint PPT presentation | free to view

Efficient Evaluation of Regular Path Expressions on Streaming XML Data PowerPoint PPT Presentation

Efficient Evaluation of Regular Path Expressions on Streaming XML Data - name Seattle Bio Lab /name location city Seattle /city country USA /country ... 52. Are We Going in Circles ? Considering the following XML graph #1 #2 ... | PowerPoint PPT presentation | free to view

CS590D: Data Mining Prof. Chris Clifton PowerPoint PPT Presentation

CS590D: Data Mining Prof. Chris Clifton - An implication expression of the form X Y, where X and Y are itemsets. Example: ... Dynamic itemset counting and implication rules for market basket data. In SIGMOD'97 ... | PowerPoint PPT presentation | free to view

Opportunity is the Mother of Invention How Personal Delay Tolerant Networking led to Data Centric Networking PowerPoint PPT Presentation

Opportunity is the Mother of Invention How Personal Delay Tolerant Networking led to Data Centric Networking - Opportunity is the Mother of Invention How Personal Delay Tolerant Networking led to Data Centric Networking & Understanding Social Networks. Jon Crowcroft | PowerPoint PPT presentation | free to view

Information Extraction Data Mining and Topic Discovery with Probabilistic Models PowerPoint PPT Presentation

Information Extraction Data Mining and Topic Discovery with Probabilistic Models - Slide Material for DHS Reverse Site Visit | PowerPoint PPT presentation | free to view

Data-Flow Analysis in the Memory Management PowerPoint PPT Presentation

Data-Flow Analysis in the Memory Management - ... and tracks functional recovery, and tests and challenges existing theories of rehabilitation. ... to adapt to this framework for backwards-compatibility ... | PowerPoint PPT presentation | free to view

Managing XML and Semistructured Data PowerPoint PPT Presentation

Managing XML and Semistructured Data - Query rewriting with schema. Resources. Optimizing Regular Path Expressions Using Graph Schemas, M.Fernandez and D.Suciu, ... E1[ancestor-or-self::E2] E1. Query ... | PowerPoint PPT presentation | free to view

Architectures and Algorithms for InternetScale P2P Data Management PowerPoint PPT Presentation

Architectures and Algorithms for InternetScale P2P Data Management - The 'Internet Screensaver' Engage end users: education and prevention ... Trackability and liability will prevent this being used for free speech. Now consider p2p ... | PowerPoint PPT presentation | free to view

Mathematics and Bioterrorism: Graph-theoretical Models of Spread and Control of Disease PowerPoint PPT Presentation

Mathematics and Bioterrorism: Graph-theoretical Models of Spread and Control of Disease - Mathematical models have become important tools in analyzing the spread and ... models of infectious diseases go back to Daniel Bernoulli's mathematical ... | PowerPoint PPT presentation | free to view

Succinct Data Structures: Upper, Lower PowerPoint PPT Presentation

Succinct Data Structures: Upper, Lower - So break tree into little hunks (say (1-e) lg n size), small enough to ... Hunks Lead to. Updates on binary trees (M., Raman & Storm), & more general trees ... | PowerPoint PPT presentation | free to view

Chap. 8 Mining Stream, TimeSeries, and Sequence Data PowerPoint PPT Presentation

Chap. 8 Mining Stream, TimeSeries, and Sequence Data - Fast changing and requires fast, real-time response ... Tradebot (www.tradebot.com): stock tickers & streams. Tribeca (Bellcore): network monitoring ... | PowerPoint PPT presentation | free to view

Machine Vision for Urban Model Capture: Exploiting Scale, Achieving Automation Seth Teller MIT Graph PowerPoint PPT Presentation

Machine Vision for Urban Model Capture: Exploiting Scale, Achieving Automation Seth Teller MIT Graph - Machine Vision for Urban Model Capture: Exploiting Scale, Achieving Automation Seth Teller MIT Graph | PowerPoint PPT presentation | free to view

Introduction to Spatial Data Mining PowerPoint PPT Presentation

Introduction to Spatial Data Mining - Exercise. Name 2 application domains not listed above. Why Learn ... If A and B are mutually exclusive events then P(AB) = P(A)P(B) Conditional Probability: ... | PowerPoint PPT presentation | free to view

Instruction Set Architectures: RISC, CISC, PowerPoint PPT Presentation

Instruction Set Architectures: RISC, CISC, - Multi-pass structure easy to write bug-free compilers ... Use graph coloring (graph theory) to allocate registers. NP-complete ... | PowerPoint PPT presentation | free to view

ICS 278: Data Mining Lecture 17: Web Log Mining PowerPoint PPT Presentation

ICS 278: Data Mining Lecture 17: Web Log Mining - Data Mining Lectures Lecture 17: Web Log Mining Padhraic Smyth, UC Irvine. ICS 278: Data Mining ... Important to identify robots (also known as crawlers, spiders) ... | PowerPoint PPT presentation | free to view

Chap' 8 Mining Stream, TimeSeries, and Sequence Data PowerPoint PPT Presentation

Chap' 8 Mining Stream, TimeSeries, and Sequence Data - Fast changing and requires fast, real-time response ... Window stitching - Stitch similar windows to form pairs of large similar ... | PowerPoint PPT presentation | free to view

AFNI Soup to Nuts: How to Analyze Data with AFNI from Start to Finish PowerPoint PPT Presentation

AFNI Soup to Nuts: How to Analyze Data with AFNI from Start to Finish - AFNI Soup to Nuts: How to Analyze Data with AFNI from Start to Finish There is no single correct way to analyze fMRI data. The path your data takes will depend ... | PowerPoint PPT presentation | free to view

Overview of Web Mining and E-Commerce Data Analytics PowerPoint PPT Presentation

Overview of Web Mining and E-Commerce Data Analytics - What is Data Mining. What do we need? Extract interesting and useful knowledge from the data. Find rules, regularities, irregularities, patterns, constraints | PowerPoint PPT presentation | free to view

Chapter 7: Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering 7.6 Outlier Detection PowerPoint PPT Presentation

Chapter 7: Spatial Data Mining 7.1 Pattern Discovery 7.2 Motivation 7.3 Classification Techniques 7.4 Association Rule Discovery Techniques 7.5 Clustering 7.6 Outlier Detection - Title: Introduction to Spatial Data Mining Author: SC Last modified by: Yannis Created Date: 8/20/2002 2:27:00 AM Document presentation format: On-screen Show (4:3) | PowerPoint PPT presentation | free to view

Data Mining meets the Internet: Techniques for Web Information Retrieval PowerPoint PPT Presentation

Data Mining meets the Internet: Techniques for Web Information Retrieval - The jaguar, a cat, can run at. speeds reaching 50 mph. The jaguar has a 4 liter engine ... engine jaguar. cat. jaguar. Repository. Documents in repository. 5 ... | PowerPoint PPT presentation | free to view

CS 277: Data Mining Lecture 14: Page Rank and HITS PowerPoint PPT Presentation

CS 277: Data Mining Lecture 14: Page Rank and HITS - Homework 3 due in class Nov 20. Progress Report 2 today ... S. Wasserman and K. Faust, Social Network Analysis, Cambridge University Press, 1994. ... | PowerPoint PPT presentation | free to view

Data Mining meets the Internet: Techniques for Web Information Retrieval and Network Data Management PowerPoint PPT Presentation

Data Mining meets the Internet: Techniques for Web Information Retrieval and Network Data Management - 1. Data Mining Meets the Internet. 6/22/09 ... The jaguar, a cat, can run at. speeds reaching 50 mph. The jaguar has a 4 liter engine ... | PowerPoint PPT presentation | free to view

Ahmed K. Ezzat, PowerPoint PPT Presentation

Ahmed K. Ezzat, - Data Mining and Big Data Ahmed K. Ezzat, Data Mining Concepts and Techniques* | PowerPoint PPT presentation | free to view

MONADIC QUERIES over TREE-STRUCTURED DATA PowerPoint PPT Presentation

MONADIC QUERIES over TREE-STRUCTURED DATA - Joint work with Christoph Koch, Robert Baumgartner, and ... firstchild2, nextsibling2, lastchild2, label[a]1, root1, leaf1 a. Monadic Queries over Trees ... | PowerPoint PPT presentation | free to view

Chapter 6: Mining Association Rules from Data PowerPoint PPT Presentation

Chapter 6: Mining Association Rules from Data - Finding frequent patterns, associations, correlations, or causal structures ... Eclat/MaxEclat and VIPER: Exploring Vertical Data Format ... | PowerPoint PPT presentation | free to view