Two%20Color%20Microarrays - PowerPoint PPT Presentation

About This Presentation

Title:

Two%20Color%20Microarrays

Description:

Two Color Microarrays – PowerPoint PPT presentation

Number of Views:33

Avg rating:3.0/5.0

Slides: 28

Provided by: davidm133

Learn more at: https://dmrocke.ucdavis.edu

Category:

more less

Transcript and Presenter's Notes

Title: Two%20Color%20Microarrays

1
Two Color Microarrays

EPP 245
Statistical Analysis of
Laboratory Data

2
Two-Color Arrays

Two-color arrays are designed to account for
variability in slides and spots by using two
samples on each slide, each labeled with a
different dye.
If a spot is too large, for example, both signals
will be too big, and the difference or ratio will
eliminate that source of variability

3
Dyes

The most common dye sets are Cy3 (green) and Cy5
(red), which fluoresce at approximately 550 nm
and 649 nm respectively (red light 700 nm,
green light 550 nm)
The dyes are excited with lasers at 532 nm (Cy3
green) and 635 nm (Cy5 red)
The emissions are read via filters using a CCD
device

4
(No Transcript)
5
(No Transcript)
6
(No Transcript)
7
File Format

A slide scanned with Axon GenePix produces a file
with extension .gpr that contains the
resultshttp//www.axon.com/gn_GenePix_File_Forma
ts.html
This contains 29 rows of headers followed by 43
columns of data (in our example files)
For full analysis one may also need a .gal file
that describes the layout of the arrays

8
"Block" "Column" "Row" "Name" "ID" "X" "Y" "Dia." "F635 Median" "F635 Mean" "F635 SD" "B635 Median" "B635 Mean" "B635 SD" " gt B6351SD" " gt B6352SD" "F635 Sat." "F532 Median" "F532 Mean" "F532 SD" "B532 Median" "B532 Mean" "B532 SD" " gt B5321SD" " gt B5322SD" "F532 Sat." "Ratio of Medians (635/532)" "Ratio of Means (635/532)" "Median of Ratios (635/532)" "Mean of Ratios (635/532)" "Ratios SD (635/532)" "Rgn Ratio (635/532)" "Rgn R² (635/532)" "F Pixels" "B Pixels" "Sum of Medians" "Sum of Means" "Log Ratio (635/532)" "F635 Median - B635" "F532 Median - B532" "F635 Mean - B635" "F532 Mean - B532" "Flags"
9
Analysis Choices

Mean or median foreground intensity
Background corrected or not
Log transform (base 2, e, or 10) or glog
transform
Log is compatible only with no background
correction
Glog is best with background correction

10
Array normalization

Array normalization is meant to increase the
precision of comparisons by adjusting for
variations that cover entire arrays
Without normalization, the analysis would be
valid, but possibly less sensitive
However, a poor normalization method will be
worse than none at all.

11
Possible normalization methods

We can equalize the mean or median intensity by
adding or multiplying a correction term
We can use different normalizations at different
intensity levels (intensity-based normalization)
for example by lowess or quantiles
We can normalize for other things such as print
tips

12
Example for Normalization
Group 1 Group 1 Group 2 Group 2
Array 1 Array 2 Array 3 Array 4
Gene 1 1100 900 425 550
Gene 2 110 95 85 110
Gene 3 80 65 55 80
13
gt normex lt- matrix(c(1100,110,80,900,95,65,425,85,
55,550,110,80),ncol4) gt normex ,1 ,2
,3 ,4 1, 1100 900 425 550 2, 110 95
85 110 3, 80 65 55 80 gt group lt-
as.factor(c(1,1,2,2)) gt anova(lm(normex1,
group)) Analysis of Variance Table Response
normex1, Df Sum Sq Mean Sq F value
Pr(gtF) group 1 262656 262656 18.888
0.04908 Residuals 2 27812 13906
--- Signif. codes 0 ' 0.001 ' 0.01
' 0.05 .' 0.1 ' 1
14
gt anova(lm(normex2, group)) Analysis of
Variance Table Response normex2,
Df Sum Sq Mean Sq F value Pr(gtF) group 1
25.0 25.0 0.1176 0.7643 Residuals 2 425.0
212.5 gt anova(lm(normex3,
group)) Analysis of Variance Table Response
normex3, Df Sum Sq Mean Sq F value
Pr(gtF) group 1 25.0 25.0 0.1176
0.7643 Residuals 2 425.0 212.5
15
Additive Normalization by Means
Group 1 Group 1 Group 2 Group 2
Array 1 Array 2 Array 3 Array 4
Gene 1 975 851 541 608
Gene 2 -15 46 201 168
Gene 3 -45 16 171 138
16
gt cmn lt- apply(normex,2,mean) gt cmn 1 430.0000
353.3333 188.3333 246.6667 gt mn lt- mean(cmn) gt
normex - rbind(cmn,cmn,cmn)mn ,1
,2 ,3 ,4 cmn 974.58333 851.25 541.25
607.9167 cmn -15.41667 46.25 201.25 167.9167 cmn
-45.41667 16.25 171.25 137.9167 gt normex.1 lt-
normex - rbind(cmn,cmn,cmn)mn
17
gt anova(lm(normex.11, group)) Analysis of
Variance Table Response normex.11,
Df Sum Sq Mean Sq F value Pr(gtF) group 1
114469 114469 23.295 0.04035 Residuals 2
9828 4914 gt
anova(lm(normex.12, group)) Analysis of
Variance Table Response normex.12,
Df Sum Sq Mean Sq F value Pr(gtF) group 1
28617.4 28617.4 23.295 0.04035 Residuals 2
2456.9 1228.5 gt
anova(lm(normex.13, group)) Analysis of
Variance Table Response normex.13,
Df Sum Sq Mean Sq F value Pr(gtF) group 1
28617.4 28617.4 23.295 0.04035 Residuals 2
2456.9 1228.5
18
Multiplicative Normalization by Means
Group 1 Group 1 Group 2 Group 2
Array 1 Array 2 Array 3 Array 4
Gene 1 779 776 687 679
Gene 2 78 82 137 136
Gene 3 57 56 89 99
19
gt normexmn/rbind(cmn,cmn,cmn) ,1
,2 ,3 ,4 cmn 779.16667 775.82547
687.33407 679.13851 cmn 77.91667 81.89269
137.46681 135.82770 cmn 56.66667 56.03184
88.94912 98.78378 gt normex.2 lt-
normexmn/rbind(cmn,cmn,cmn) gt anova(lm(normex.21
, group)) Response normex.21,
Df Sum Sq Mean Sq F value Pr(gtF) group
1 8884.9 8884.9 453.71 0.002197 Residuals 2
39.2 19.6 gt
anova(lm(normex.22, group)) Response
normex.22, Df Sum Sq Mean Sq F value
Pr(gtF) group 1 3219.7 3219.7 696.33
0.001433 Residuals 2 9.2 4.6
gt anova(lm(normex.23,
group)) Response normex.23, Df
Sum Sq Mean Sq F value Pr(gtF) group 1
1407.54 1407.54 57.969 0.01682 Residuals 2
48.56 24.28
20
Multiplicative Normalization by Medians
Group 1 Group 1 Group 2 Group 2
Array 1 Array 2 Array 3 Array 4
Gene 1 1000 947 500 500
Gene 2 100 100 100 100
Gene 3 73 68 65 73
21
gt cmd lt- apply(normex,2,median) gt cmd 1 110 95
85 110 gt normex.3 lt- normexmd/rbind(cmd,cmd,cmd)
gt normex.3 ,1 ,2 ,3
,4 cmd 1000.00000 947.36842 500.00000
500.00000 cmd 100.00000 100.00000 100.00000
100.00000 cmd 72.72727 68.42105 64.70588
72.72727 gt anova(lm(normex.31,
group)) Response normex.31, Df Sum
Sq Mean Sq F value Pr(gtF) group 1
224377 224377 324 0.003072 Residuals 2
1385 693 gt
anova(lm(normex.32, group)) Response
normex.32, Df Sum Sq Mean Sq F value
Pr(gtF) group 1 0 0
Residuals 2 0 0 gt
anova(lm(normex.33, group)) Response
normex.33, Df Sum Sq Mean Sq F value
Pr(gtF) group 1 3.451 3.451 0.1665
0.7228 Residuals 2 41.443 20.722
22
Intensity-based normalization

Normalize by means, medians, etc., but do so only
in groups of genes with similar expression
levels.
lowess is a procedure that produces a running
estimate of the middle, like a robustified mean
If we subtract the lowess of each array and add
the average of the lowesss, we get the lowess
normalization

23
norm lt- function(mat1) mat2 lt-
as.matrix(mat1) p lt- dim(mat2)1 n lt-
dim(mat2)2 cmean lt- apply(mat2,2,mean)
cmean lt- cmean - mean(cmean) mnmat lt-
matrix(rep(cmean,p),byrowT,ncoln)
return(mat2-mnmat)
24
lnorm lt- function(mat1,span.1) mat2 lt-
as.matrix(mat1) p lt- dim(mat2)1 n lt-
dim(mat2)2 rmeans lt- apply(mat2,1,mean)
rranks lt- rank(rmeans,ties.method"first")
matsort lt- mat2order(rranks), r0 lt- 1p
lcol lt- function(x) lx lt-
lowess(r0,x,fspan)y lmeans lt-
apply(matsort,2,lcol) lgrand lt-
apply(lmeans,1,mean) lgrand lt-
matrix(rep(lgrand,n),byrowF,ncoln) matnorm0
lt- matsort-lmeanslgrand matnorm1 lt-
matnorm0rranks, return(matnorm1)
25
(No Transcript)
26
(No Transcript)
27
(No Transcript)

Write a Comment

User Comments (0)