Title: MURI Meeting July 2002
1Convex Optimization in Machine Learning
- MURI MeetingJuly 2002
- Gert Lanckriet (gert_at_eecs.berkeley.edu)
- L. El Ghaoui, M. Jordan, C. Bhattacharrya, N.
Cristianini, P. Bartlett - U.C. Berkeley
2Convex Optimization in Machine Learning
3Advanced Convex Optimization in Machine Learning
SDP
SOCP
QCQP
QP
LP
4Advanced Convex Optimization in Machine Learning
5Linear Programming (LP)
6Quadratic Programming (QP)
7Quadratic Constrained Quadratic Programming (QCQP)
8Second Order Cone Programming (SOCP)
9Semi-Definite Programming
10Advanced Convex Optimization in Machine Learning
11MPM Problem Sketch (1)
aT z b decision hyperplane
12MPM Problem Sketch (2)
13MPM Problem Sketch (3)
14MPM Main Result (1)
?
?
Marshall Olkin / Popescu Bertsimas
15MPM Main Result (2)
16MPM Main Result (3)
Lemma
17MPM Main Result (4)
Probabilistic Constraint
Lemma
Deterministic Constraint
18MPM Main Result (5)
19MPM Geometric Interpretation
20MPM Link with FDA (1)
21MPM Link with FDA (2)
22MPM Link with FDA (3)
23Robustness to Estimation Errors Robust MPM
(R-MPM)
24Robust MPM (R-MPM)
25Robust MPM (R-MPM)
26MPM Convex Optimization to solve the problem
Lemma
Linear Classifier
Convex Optimization Second Order Cone Program
(SOCP)
Kernelizing
Nonlinear Classifier
) competitive with Quadratic Program (QP) SVMs
27MPM Empirical results
a1b and TSA (test-set accuracy) of the MPM,
compared to BPB (best performance in Breiman's
report (Arcing classifiers, 1996)) and SVMs.
(averages for 50 random partitions into 90
training and 10 test sets)
- Comparable with existing literature, SVMs
- a1-b is indeed smaller than the test-set
accuracy in all cases (consistent with b as
worst-case bound on probability of
misclassification) - Kernelizing leads to more powerfull decision
boundaries (alinear decision boundary lt
anonlinear decision boundary (Gaussian kernel))
28Conclusions
29Future directions
30Advanced Convex Optimization in Machine Learning
31The idea (1)
32The idea (2)
33The idea (3)
34The idea (4)
35The idea (5)
36Hard margin SVM classifiers (1)
37Hard margin SVM classifiers (2)
38Hard margin SVM classifiers (3)
39Hard margin SVM classifiers (4)
40Hard margin SVM classifiers (5)
SDP !
41Hard margin SVM classifiers (6)
Optimization
Learning the kernel matrix !
42Hard margin SVM classifiers (7)
training set (labelled)
test set (unlabelled)
Learning the kernel matrix !
43Hard margin SVM classifiers (8)
?
44Hard margin SVM classifiers (9)
45Hard margin SVM classifiers (9)
46Hard margin SVM classifiers (9)
47Hard margin SVM classifiers (10)
48Hard margin SVM classifiers (11)
Learning Kernel Matrix with SDP !
49Empirical results hard margin SVMs
50Conclusions and future directions
51Conclusions and future directions
52See also