TreeBased Methods - PowerPoint PPT Presentation

1 / 18

About This Presentation

Title:

TreeBased Methods

Description:

1.3 Basic Issues in Tree-based Methods. 1. How to decide the ... 2. Grow the tree to the pre-defined size, then apply Cost-complexity pruning (preferred) ... – PowerPoint PPT presentation

Number of Views:165

Avg rating:3.0/5.0

Slides: 19

Provided by: Joy29

Category:

Tags: treebased | methods | tree

Transcript and Presenter's Notes

Title: TreeBased Methods

1
Tree-Based Methods

ENEE698A Communication Seminar
Nov. 05, 2003
He Huang

2
Outline

Overview of Tree-Based Methods
Regression Tree
Classification Tree
Spam Example (application of Classification Tree)

3
Overview of the Tree-Based Methods

1.1 General Tree-Based Methods
a. Splitting the feature space into a set
of regions.
b. Fit a simple model (e.g. constant) for
each partition region.
Problem
Some resulting regions are complicated to
describe.
Solution
Using recursive binary partition.

R5
c5
R4
c4
R1
R2
R3
c1
c2
c3
4
1.2 Recursive Binary Partition

a. How it works (an example)

R5
c5
t4
R2
c2
x2
R3
t2
c3
R4
c4
R1
c1
t1
t3
x1
Binary Tree
Binary Partition
5
1.2 Recursive Binary Partition (Cont.)

How to describe the model
Cm the regression model prediction value
corresponding to the region Rm
1.3 Basic Issues in Tree-based Methods
1. How to decide the splitting point?
2. How to control the size of the tree?

6
2. Regression Tree

2.1 Basics for Regression Trees
For each of N observations, input is xi (xi1 ,
xi2 , , xip),
output is yi
Splitting the space into M regions R1, R2, ,
RM. And model each region as a constant Cm .
To decide the optimal value of the Cm , by
minimizing the sum of squared error

7
2. Regression Tree (Cont.)

2.2 How to decide each splitting point, the
pair (j , s)?
Greedy algorithm
For each splitting variable j, the optimal
splitting point s can be
decided by solving the criterion function (2-2).
Then the best
pair (j , s) can be decided after going through
all splitting
variables j.

8
2. Regression Tree (cont.)

Very large tree overfit
Small tree might not capture the structure
2.3 How to control the size of the tree? (Where
to stop the splitting?)
Strategies
1. split only when the decrement of the error
exceeds some threshold (short-sighted)
2. Grow the tree to the pre-defined size, then
apply Cost-complexity pruning (preferred)

9
2. Regression Tree Cost-complexity Pruning

Pruning collapsing some internal node to
minimize
the Cost Complexity Criterion (2-3)
Nm number of the observation data falling in the
region Rm
m index of terminal nodes on the binary tree T
T the number of terminal nodes in T
? tuning parameter

Penalty on the complexity/size of the tree
Cost sum of squared errors
10
2. Regression Tree - Pruning

For each ?, there is a unique smallest tree T?
that minimize C?(T).
To find T? weakest link pruning
Each time collapse an internal node which produce
the smallest increase in Sm NmQm(T), continue
until to the single-node tree.
Choose from this tree sequence T? that the C?(T)
is the minimum.
To estimate the ? by the five- or tenfold cross
validation. Choose the value ? to minimize the
cross-validated sum of squares. The final tree is
T? .

11
3. Classification Trees

3.1 Basics of Classification Tree
a. For each observations, the output yi taking
values
1,2,,k (not continuous values as in
Regression Trees).
Rm is the partition region corresponding to
the terminal
node m, with Nm observations. Pmk is the
proportion of
observation of class k in node m .
b. The majority class in node m is

12
3. Classification Trees (Cont.)

3.2 The criterions applied for splitting and
pruning
Misclassification error
Gini index
Cross-entropy or deviance

13
3. Classification Trees (cont.)

Three cost criterion for 2-class classification,
a function of the proportion p
in one class
Cross-entropy and Gini are more sensitive

14
4. Spam example Classification Tree

4601 observations
inputs58 attributes indicating if a particular
word or a character is frequently used in the
spam emails.
Outputs spam or non-spam
The purpose to generate a spam filter.
Grow the tree by Cross-entropy,
Prune the tree by Misclassification error.

15
(No Transcript)
16
Predicted
True
email
spam
email
57.3
4.0
spam
5.3
33.4
Overall error rate is 8.7
17
Reference

1. The elements of statistical learning-Data
mining, Inference and Prediction.Trevor Hastie,
Robert Tibshirani, Jerome Friedman.
2. Pattern Recognition and Neural Networks,
Ripley, B.D, Cambridge University Press.
3. Classification and Regression Trees, Breiman,
L. etc, Wadsworth.

18

The End
Thanks!

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Training Methods in HRM PowerPoint PPT Presentation

Training Methods in HRM - Training methods is a way to sharpen employees existing skills and learn new skills. The skills may vary from technical or soft skills. | PowerPoint PPT presentation | free to view

Social Research Methods PowerPoint PPT Presentation

Social Research Methods - Social Research Methods Social Research Goal: Test common sense & peoples assumptions then replace with fact & evidence and make Definition ... | PowerPoint PPT presentation | free to view

Inference V: MCMC Methods PowerPoint PPT Presentation

Inference V: MCMC Methods - ... methods that are based on Markov Chain Markov Chain Monte Carlo ... methods that are based on Markov Chain Markov Chain Monte Carlo (MCMC) methods ... | PowerPoint PPT presentation | free to view

What is IHC and it’s detection methods? PowerPoint PPT Presentation

What is IHC and it’s detection methods? - Immunohistochemistry refers to the course of action of becoming aware of antigens like proteins in cells of a tissue section taken during biopsy. If you want to know more about immunohistochemistry and it’s detection methods, visit at Immunostaining. | PowerPoint PPT presentation | free to view

Quasi-Newton Methods of Optimization PowerPoint PPT Presentation

Quasi-Newton Methods of Optimization - Quasi-Newton Methods of Optimization Lecture 10 ... | PowerPoint PPT presentation | free to view

6 Potty Training Methods For Toddlers : Parentingz.com PowerPoint PPT Presentation

6 Potty Training Methods For Toddlers : Parentingz.com - There are lot of methods to Potty Train, i must say for a success you should follow these 6 Potty Training Methods. For more info: http://www.parentingz.com/6-potty-training-methods-train-your-kids-with-successfully/ | PowerPoint PPT presentation | free to view

Optimization methods Review PowerPoint PPT Presentation

Optimization methods Review - Optimization methods Review Mateusz Sztangret Faculty of Metal Engineering and Industrial Computer Science Department of Applied Computer Science and Modelling | PowerPoint PPT presentation | free to view

7 Waste Disposal Methods PowerPoint PPT Presentation

7 Waste Disposal Methods - Industrialized nations are grappling with the problem of expeditious and safe waste disposal. Non-biodegradable and toxic wastes like radioactive remnants can potentially cause irreparable damage to the environment and human health if not strategically disposed of. Though waste disposal has been a matter of concern for several decades, the main problem has been taking massive proportions due to growth in population and industrialization, the two major factors that contribute to waste generation. Though some advancement is being made in waste disposal methods, they are still not adequate. The challenge is to detect newer and unhazardous methods of waste disposal and put these methods to use. | PowerPoint PPT presentation | free to view

Open Methods PowerPoint PPT Presentation

Open Methods - Chapter 6 Open Methods Open Methods 6.1 Simple Fixed-Point Iteration 6.2 Newton-Raphson Method* 6.3 Secant Methods* 6.4 MATLAB function: fzero 6.5 Polynomials Open ... | PowerPoint PPT presentation | free to view

Feminist Methods of Research PowerPoint PPT Presentation

Feminist Methods of Research - Feminist Methods of Research ... Methodologies and Methods May Differ by Discipline Humanities Education Social Sciences Physical Sciences Arts Engineering Etc ... | PowerPoint PPT presentation | free to view

Forecasting Methods for Strategy Development PowerPoint PPT Presentation

Forecasting Methods for Strategy Development - Forecasting Methods for Strategy Development Market Demand Analysis | PowerPoint PPT presentation | free to view

Research Methods PowerPoint PPT Presentation

Research Methods - Research Methods Definition of science Data collection and analysis Sociology as a Science Debate over the scientific status of sociology: Sociology is a science ... | PowerPoint PPT presentation | free to view

Most Frequently Used Qualitative Research Methods PowerPoint PPT Presentation

Most Frequently Used Qualitative Research Methods - http://www.critiquingqualitativeresearch.com/ In our presentation you'll find useful information about most popular qualitative research methods | PowerPoint PPT presentation | free to view

Mixed Methods Research PowerPoint PPT Presentation

Mixed Methods Research - Mixed Methods Research. Mixed Research. Procedures usually found in both: Quantitative ... Use at least one qualitative and one quantitative research method? ... | PowerPoint PPT presentation | free to view

Methods and Behaviors PowerPoint PPT Presentation

Methods and Behaviors - Indicates what type of value is returned when the method is completed ... In method body, reference values using identifier name ... | PowerPoint PPT presentation | free to view

Natural Methods And Remedies That Regulate Hypertension Problem PowerPoint PPT Presentation

Natural Methods And Remedies That Regulate Hypertension Problem - This powerpoint presentation describes about natural methods and remedies that regulate hypertension problem. | PowerPoint PPT presentation | free to view

Non-Surgical Vaginal Rejuvenation Methods To Tighten Vagina PowerPoint PPT Presentation

Non-Surgical Vaginal Rejuvenation Methods To Tighten Vagina - This powerpoint presentation describes about non-surgical vaginal rejuvenation methods to tighten vagina. | PowerPoint PPT presentation | free to view

EEE 431 Computational Methods in Electrodynamics PowerPoint PPT Presentation

EEE 431 Computational Methods in Electrodynamics - EEE 431 Computational Methods in Electrodynamics Lecture 6 By Dr. Rasime Uyguroglu Rasime.uyguroglu@emu.edu.tr | PowerPoint PPT presentation | free to view

Natural Methods To Reduce Appearance Of Acne Marks And Scars PowerPoint PPT Presentation

Natural Methods To Reduce Appearance Of Acne Marks And Scars - This power point presentation describes about natural methods to reduce appearance of acne marks and scars. | PowerPoint PPT presentation | free to view

Wood Floor Installations Methods PowerPoint PPT Presentation

Wood Floor Installations Methods - Due to the increasing use of wooden floor people are looking for an easy and efficient way of installation. Pictures of hammer, nails and back-breaking works come to mind when people think about wood floor installation. But in real life, there are several methods of installing. The Proper method varies for flooring type, and there are four methods mostly used such as nail-down, glue-down, and floating and click method. | PowerPoint PPT presentation | free to view

Natural Methods To Lose Post Pregnancy Weight Effectively PowerPoint PPT Presentation

Natural Methods To Lose Post Pregnancy Weight Effectively - This Power Point Presentation describes about Natural Methods To Lose Post Pregnancy Weight Effectively. | PowerPoint PPT presentation | free to view

Natural Vagina Tightening Methods To Tighten Your Vag Fast PowerPoint PPT Presentation

Natural Vagina Tightening Methods To Tighten Your Vag Fast - This powerpoint presentation describes about natural vagina tightening methods to tighten your vag fast. | PowerPoint PPT presentation | free to view

Natural Methods To Improve Digestion Process Without Any Side Effects PowerPoint PPT Presentation

Natural Methods To Improve Digestion Process Without Any Side Effects - This power point presentation describes about Natural Methods To Improve Digestion Process Without Any Side Effects | PowerPoint PPT presentation | free to view

Computational Methods in Physics PHYS 3437 PowerPoint PPT Presentation

Computational Methods in Physics PHYS 3437 - Computational Methods in Physics PHYS 3437 Dr Rob Thacker Dept of Astronomy & Physics (MM-301C) thacker@ap.smu.ca | PowerPoint PPT presentation | free to view

Natural Methods To Restore Vaginal Elasticity And Tightness PowerPoint PPT Presentation

Natural Methods To Restore Vaginal Elasticity And Tightness - This presentation describes about natural methods to restore vaginal elasticity and tightness. You can find more detail about Aabab tablets at http://www.naturalvaginatightening.com | PowerPoint PPT presentation | free to view

EEE 431 Computational Methods in Electrodynamics PowerPoint PPT Presentation

EEE 431 Computational Methods in Electrodynamics - EEE 431 Computational Methods in Electrodynamics Lecture 5 By Dr. Rasime Uyguroglu Rasime.uyguroglu@emu.edu.tr FINITE DIFFERENCE METHODS (cont). Finite Difference ... | PowerPoint PPT presentation | free to view

Numerical methods for ODEs PowerPoint PPT Presentation

Numerical methods for ODEs - Numerical methods for ODEs. Ordinary differential equations (ODEs) Dynamical system ... Backward Euler method (exercise) Order of accuracy: first order & implicit ... | PowerPoint PPT presentation | free to view