Variance stabilization for microarray data - PowerPoint PPT Presentation

1 / 21
About This Presentation
Title:

Variance stabilization for microarray data

Description:

Normalization is the process of describing and removing such variation ... 'Spoil the data' Plate specific effect: Random effect. Labeling effect. Scanner effect ... – PowerPoint PPT presentation

Number of Views:22
Avg rating:3.0/5.0
Slides: 22
Provided by: WinC9
Category:

less

Transcript and Presenter's Notes

Title: Variance stabilization for microarray data


1
Variance stabilization for microarray data in
comparison with various other normalization
strategies part 1 Christine Steinhoff Microa
rrays/Hybridization/Northern Ulrike
Nuber Bettina Lipkowitz H.-Hilger
Ropers Max Planck Institute for Molecular
Genetics Berlin, Germany
2
Types of Arrays
Red/Green experiments
Affymetrix Chips
Radioactive filters
3
Data Analysis - Procedure
Starting with Image Processing Scanner output
information about spot intensities local
background pins PCR plates localization
standard deviation...
Quality Check Are there any effects due to
pins PCR plates local effects ...
4
What is Normalization ?
Systematic Variation in Microarray Experiments
- Saturation (Scanner Labeling) -
Nonlinearity of Cy5, Cy3 Labeling - Efficiencies
of Cy5, Cy3 Labeling - Variation of
Low-Intensities - Pins - PCR Plates - Local
Effects ...
Normalization is the process of describing and
removing such variation
5
Some Examples
6
Unequal Variance Across Intensity Range
Logratio
Product intensity (logscale)
7
Normalization Methods
User Defined Sets Housekeeping (?!) Controls
etc useful for Most Genes Changed- Settings
Entire Dataset useful for Most Genes
Unchanged- Settings
8
Normalization Strategies
Local Regression determine regression lines
locally
9
Normalization Strategies
10
Comparison of Normalization Strategies
1 maximal differential genes (red, 138 genes)
discarding 5 lowest expressed genes (green, 691
genes) before log product vs. log ratio of
normalized intensities
No Normalization
Linear Regression
Local Regression
Overall Median
Zscore
ANOVA
Variance Stabilization
11
Comparison of Normalization Strategies
Goal Detection of differentially expressed
genes Set of 30 maximal differential genes out
of 13824
12
Comparison of Normalization Strategies
Goal Detection of Differentially expressed Genes
Var Stab ANOVA Lin Regr Least Med Local
Regr Mean Median Shorth Zscore Raw
distance d(i,j) N - shared genes
13
Comparison of Normalization Strategies
Goal Detection of Differentially expressed Genes
Var Stab ANOVA Lin Regr Least Med Local
Regr Mean Median Shorth Zscore Raw
d(i,j) 1 - 6/(N(N2-1)) ?k,l1...N d(i,j)k,l
genes ordered by abs(logratio) d(i,j)k,l
rank(genek)-rank(genel) if exists
N1 else
14
Comparison of Normalization Strategies
Dataset three repetitions of one dye swap
experiment (6) followed by Northern blot
verification
Normalization strategies
Biological Evaluation (a) Northern Blotting
(b) quant. RT PCR (c) SAGE library (d)
quantifiable controls
15
Goal Biological Evaluation
Microarray Ratios
quantifiable method different from microarray
16
Three Dye Swap experiments
experiment 1
experiment 2
experiment 3
swap 1
genes empties housekeping plant
swap 2
17
Biological Evaluation
RAW DATA
LogRatio
product of intensity (logScale)
18
Comparison of Normalization Strategies
(1) 0.845
(2) 0.845
(7) 0.854
(3) 0.859
(4) 0.851
(1) Raw data (2) median (3) ZScore (4) Overall
(linear) Regression (5) Local Regression (6)
Variance Stabilization (7) ANOVA
(5) 0.851
(6) 0.853
19
Spoil the data
Plate specific effect Random
effect Labeling effect Scanner effect
20
Normalize again
21
Compare to Northern Results
no/mean ZScore Local R
Lin R ANOVA VarStab
raw random effect on 5 spots labeling
scatter in low intensity
correlation coefficient
Write a Comment
User Comments (0)
About PowerShow.com