Title: Principal Components
1Principal Components
- Shyh-Kang Jeng
- Department of Electrical Engineering/
- Graduate Institute of Communication/
- Graduate Institute of Networking and Multimedia
2Concept of Principal Components
x2
x1
3Principal Component Analysis
- Explain the variance-covariance structure of a
set of variables through a few linear
combinations of these variables - Objectives
- Data reduction
- Interpretation
- Does not need normality assumption in general
4Principal Components
5Result 8.1
6Proof of Result 8.1
7Result 8.2
8Proof of Result 8.2
9Proportion of Total Variance due to the kth
Principal Component
10Result 8.3
11Proof of Result 8.3
12Example 8.1
13Example 8.1
14Example 8.1
15Geometrical Interpretation
16Geometric Interpretation
17Standardized Variables
18Result 8.4
19Proportion of Total Variance due to the kth
Principal Component
20Example 8.2
21Example 8.2
22Principal Components for Diagonal Covariance
Matrix
23Principal Components for a Special Covariance
Matrix
24Principal Components for a Special Covariance
Matrix
25Sample Principal Components
26Sample Principal Components
27Example 8.3
28Example 8.3
29Scree Plot to Determine Number of Principal
Components
30Example 8.4 Pained Turtles
31Example 8.4
32Example 8.4 Scree Plot
33Example 8.4 Principal Component
- One dominant principal component
- Explains 96 of the total variance
- Interpretation
34Geometric Interpretation
35Standardized Variables
36Principal Components
37Proportion of Total Variance due to the kth
Principal Component
38Example 8.5 Stocks Data
- Weekly rates of return for five stocks
- X1 Allied Chemical
- X2 du Pont
- X3 Union Carbide
- X4 Exxon
- X5 Texaco
39Example 8.5
40Example 8.5
41Example 8.6
- Body weight (in grams) for n150 female mice were
obtained after the birth of their first 4 litters
42Example 8.6
43Comment
- An unusually small value for the last eigenvalue
from either the sample covariance or correlation
matrix can indicate an unnoticed linear
dependency of the data set - One or more of the variables is redundant and
should be deleted - Example x4 x1 x2 x3
44Check Normality and Suspect Observations
- Construct scatter diagram for pairs of the first
few principal components - Make Q-Q plots from the sample values generated
by each principal component - Construct scatter diagram and Q-Q plots for the
last few principal components
45Example 8.7 Turtle Data
46Example 8.7
47Large Sample Distribution for Eigenvalues and
Eigenvectors
48Confidence Interval for li
49Approximate Distribution of Estimated Eigenvectors
50Example 8.8
51Testing for Equal Correlation
52Example 8.9
53Monitoring Stable Process Part 1
54Example 8.10Police Department Data
First two sample cmponents explain 82 of the
total variance
55Example 8.10Principal Components
56Example 8.1095 Control Ellipse
57Monitoring Stable Process Part 2
58Example 8.11T2 Chart for Unexplained Data
59Example 8.12 Control Ellipse for Future Values
Example 8.10 data after dropping out-of-control
case
60Example 8.12 99 Prediction Ellipse
61Avoiding Computation with Small Eigenvalues