Sparse, Flexible and Efficient Modeling using L1-Regularization - PowerPoint PPT Presentation

About This Presentation

Title:

Sparse, Flexible and Efficient Modeling using L1-Regularization

Description:

Sparse, Flexible and Efficient Modeling using L1-Regularization. Saharon Rosset and Ji Zhu ... Modeling Using L1-Regularization. Feature Extraction. Part 1: ... – PowerPoint PPT presentation

Number of Views:31

Avg rating:3.0/5.0

Slides: 22

Provided by: marku5

Category:

Tags: efficient | flexible | modeling | regularization | sparse | using

Transcript and Presenter's Notes

Title: Sparse, Flexible and Efficient Modeling using L1-Regularization

1
Sparse, Flexible and Efficient Modeling using
L1-Regularization

Saharon Rosset and Ji Zhu

2
Contents

Idea
Algorithm
Results

3

Part 1 Idea

4
Introduction

Setting
Implicit dependency on training data
Linear model ( use j-functions)
Model

5
Introduction

Problem How to choose weight l of
regularization?
Answer Find for all ? ? 0, ?)
Can this be done efficiently (time, memory)?
Yes, if we impose restrictions on

6
Restrictions

shall be piecewise linear
What impact on L(w) and J(w)?
Can we still solve real world problems?

7
Restrictions

must be piecewise constant
L(w) quadratic in w
J(w) linear in w

8
Quadratic Loss Functions

square loss in regression
hinge loss for classification (SVM)

9
Linear Penalty Functions

Sparseness property

10
Bet on Sparseness

50 samples with 300 independent Gaussian
variables
Row 3 non-zero variables
Row 30 non-zero variables
Row 300 non-zero variables

11

Part 2 Algorithm

12
Linear Toolbox

a(r), b(r) and c(r) piecewise constant
coefficients
Regression
Classification

13
Optimization Problem
14
Algorithm Initialization

start at t0 w0
determine set of non-zerocomponents
starting direction

15
Algorithm Loop

follow the direction until one of
the following happens
addition of new component
vanishing of a non-zero component
hit of a knot (discontinuity of a(r), b(r),
c(r) )

16
Algorithm Loop

direction update
stopping criterion

17

Part 3 Results

18
NIPS Results

General procedure
pre-selection(univariate t-statistic)
Algorithm loss functionHuberized hinge loss
Find best ? basedon validation dataset

19
NIPS Results

Dexter Dataset
m300, n20'000, pre-selection n1152
linear pieces of 452
Optimum at ( 120
non-zero components)

20
NIPS Results

Not very happy with the results working with
the original variables simple linear model L1
regularization for feature selection

21
Conclusion

theory practice
limited to linear classifier
other extensionsRegularization Path for the SVM
(L2)

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

A Sparse Modeling Approach to Speech Recognition Using Kernel Machines PowerPoint PPT Presentation

A Sparse Modeling Approach to Speech Recognition Using Kernel Machines - Maximize numerator (ML term), minimize denominator (discriminative term) ... ANN provides flexible, discriminative classifiers for emission probabilities ... | PowerPoint PPT presentation | free to view

Distributed Adaptive Estimation and Tracking using Ad Hoc WSNs PowerPoint PPT Presentation

Distributed Adaptive Estimation and Tracking using Ad Hoc WSNs - Distributed Adaptive Estimation and Tracking using Ad Hoc WSNs Gonzalo Mateos ECE Department, University of Minnesota Acknowledgment: ARL/CTA grant no. DAAD19-01-2-0011 | PowerPoint PPT presentation | free to view

Real-time Mesh Simplification Using the GPU PowerPoint PPT Presentation

Real-time Mesh Simplification Using the GPU - Real-time Mesh Simplification Using the GPU Christopher DeCoro Natasha Tatarchuk 3D Application Research Group | PowerPoint PPT presentation | free to view

Confluence of Visual Computing PowerPoint PPT Presentation

Confluence of Visual Computing - Confluence of. Visual Computing & Sparse Representation. Yi Ma ... ROBUST RECOGNITION Properties of the Occlusion. Several characteristics of occlusion : ... | PowerPoint PPT presentation | free to view

Modeling Facial Shape and Appearance PowerPoint PPT Presentation

Modeling Facial Shape and Appearance - Modeling Facial Shape and Appearance. Shape and Changes in the Texture ... To build models of facial appearance and its variation one can adopt a ... | PowerPoint PPT presentation | free to view

Autonomic Computing: A New Challenge for Machine Learning ECML-06 Tutorial PowerPoint PPT Presentation

Autonomic Computing: A New Challenge for Machine Learning ECML-06 Tutorial - Citibank: online banking. Application. Manager. Servers. Servers. Servers. DB2. Router. SLA ... But online, adaptive approach is even more efficient! ... | PowerPoint PPT presentation | free to view

Classification Methods for Data Mining: Tasks, Issues PowerPoint PPT Presentation

Classification Methods for Data Mining: Tasks, Issues - SPOT Image of Calcutta in the Near Infra Red Band. Garden Reach Lake. Hooghly ... Classified SPOT image of Calcutta (zooming the race course R' only) using (a) ... | PowerPoint PPT presentation | free to view

The TimeTriggered Architecture PowerPoint PPT Presentation

The TimeTriggered Architecture - Volume market real-time applications--efficient use of hardware is a real concern. ... At what point in time the output information must be produced by the module ... | PowerPoint PPT presentation | free to view

Flexible Agent Based Simulation for Pedestrian Modelling on GPU Hardware PowerPoint PPT Presentation

Flexible Agent Based Simulation for Pedestrian Modelling on GPU Hardware - Paul Richmond The Department of Computer Science University of Sheffield, UK paul@dcs.shef.ac.uk www.dcs.shef.ac.uk/~paul Richmond Paul, Coakley Simon, Romano Daniela ... | PowerPoint PPT presentation | free to view

Performance and availability of computers and networks PowerPoint PPT Presentation

Performance and availability of computers and networks - These famous quotes bring out the difficulty of prediction. based on models: ... Mark Twain. 27. Dependability. Reliability: R(t), System MTTF ... | PowerPoint PPT presentation | free to view

CS%20290H:%20Sparse%20Matrix%20Algorithms PowerPoint PPT Presentation

CS%20290H:%20Sparse%20Matrix%20Algorithms - Importance = stationary distribution of Markov process. ... Stationary distribution. of a Markov chain. Power method: matvec. and vector arithmetic ... | PowerPoint PPT presentation | free to view

Protein Structure Similarity PowerPoint PPT Presentation

Protein Structure Similarity - Protein Structure Similarity Computation of Best Matches Two simultaneous subproblems Find maximal correspondence set C Find alignment transform T Chicken-and ... | PowerPoint PPT presentation | free to view

Protein Structure Similarity PowerPoint PPT Presentation

Protein Structure Similarity - Find correspondence set with maximal score ... Terminate a path when score of new correspondence is negative ... Can establish correspondence between partial, ... | PowerPoint PPT presentation | free to view

Numerical methods PowerPoint PPT Presentation

Numerical methods - Title: PowerPoint-Pr sentation Author: Prof. Dr. Heiner Igel Last modified by: heiner Created Date: 11/1/1999 7:29:31 PM Document presentation format | PowerPoint PPT presentation | free to view

Programming for Performance PowerPoint PPT Presentation

Programming for Performance - Programming for Performance | PowerPoint PPT presentation | free to view

Graph Mining Applications to Machine Learning Problems PowerPoint PPT Presentation

Graph Mining Applications to Machine Learning Problems - ... gspan (Yan et al., 2002), Gaston (2004) Graph Mining Frequent Substructure Mining Enumerate all patterns occurred in at least m graphs : Indicator of ... | PowerPoint PPT presentation | free to view

Preconditioning%20Implicit%20Methods%20for%20Coupled%20Physics%20Problems%20%20David%20Keyes%20Center%20for%20Computational%20Science%20Old%20Dominion%20University%20 PowerPoint PPT Presentation

Preconditioning%20Implicit%20Methods%20for%20Coupled%20Physics%20Problems%20%20David%20Keyes%20Center%20for%20Computational%20Science%20Old%20Dominion%20University%20 - Princeton Plasma Physics Lab. Fusion Minisymposium, SIAM CSE03. CMRS profile from SciDAC website ... reconnection in astrophysical plasmas, in smaller scale ... | PowerPoint PPT presentation | free to view

Declarative Specification of NLP Systems PowerPoint PPT Presentation

Declarative Specification of NLP Systems - Declarative Specification of NLP Systems Jason Eisner student co-authors on various parts of this work: Eric Goldlust, Noah A. Smith, John Blatz, Roy Tromble | PowerPoint PPT presentation | free to view

Digital Fountains, and Their Application to Informed Content Delivery over Adaptive Overlay Networks PowerPoint PPT Presentation

Digital Fountains, and Their Application to Informed Content Delivery over Adaptive Overlay Networks - Digital Fountains, and Their Application to Informed Content Delivery over Adaptive Overlay Networks Michael Mitzenmacher Harvard University | PowerPoint PPT presentation | free to view

The Beauty of Local Invariant Features PowerPoint PPT Presentation

The Beauty of Local Invariant Features - The Beauty of Local Invariant Features | PowerPoint PPT presentation | free to view

E PowerPoint PPT Presentation

E - E | PowerPoint PPT presentation | free to view

Interactive, Procedural Computer-Aided Design PowerPoint PPT Presentation

Interactive, Procedural Computer-Aided Design - Interactive, Procedural Computer-Aided Design Carlo H. S quin EECS Computer Science Division University of California, Berkeley CAD Tools for the Early and Creative ... | PowerPoint PPT presentation | free to view

Learning Structured Classifiers for Statistical Dependency Parsing PowerPoint PPT Presentation

Learning Structured Classifiers for Statistical Dependency Parsing - State-of-the-art accuracy for English and Chinese. 9/2/09. Qin Iris Wang (University of Alberta) ... A vector of feature weights. A vector of features. 9/2/09 ... | PowerPoint PPT presentation | free to view

Processor Virtualization for Scalable Parallel Computing PowerPoint PPT Presentation

Processor Virtualization for Scalable Parallel Computing - Department of Computer Science. University of Illinois at Urbana ... Seek optimal division of labor between 'system' and programmer: Specialization. MPI ... | PowerPoint PPT presentation | free to view

Titanium: A Java Dialect for High Performance Computing PowerPoint PPT Presentation

Titanium: A Java Dialect for High Performance Computing - Titanium: A Java Dialect for High Performance Computing Katherine Yelick U.C. Berkeley and LBNL | PowerPoint PPT presentation | free to view

Bayesian Learning for Conditional Models PowerPoint PPT Presentation

Bayesian Learning for Conditional Models - Joint work with T. Minka, Z. Ghahramani, M. Szummer, and R. W. Picard. Motivation ... Approximate a probability distribution by simpler parametric terms (Minka 2001) ... | PowerPoint PPT presentation | free to view

Steps%20in%20Creating%20a%20Parallel%20Program PowerPoint PPT Presentation

Steps%20in%20Creating%20a%20Parallel%20Program - E.g. which process computes which grid points or rows ... Orchestration in Grid Solver ... Simple example: nearest-neighbor grid computation ... | PowerPoint PPT presentation | free to view