Title: A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data
1A robust neural networks approach for spatial and
intensity-dependent normalization of cDNA
microarray data
- A.L. Tarca, J.E.K. Cooke and J. MacKay
Presented by Dana Mohamed
2Microarrays
3Importance of Microarrays (and that the
data is correct)
- Assumption that microarray data linearly reflects
amount of mRNA present in cell - In turn, reflects gene expression levels
- If the data is incorrect,
- So is our interpretation of gene expression
- And therefore all the science built on that
interpretation is also incorrect
4Where error is
- Intensity of Fluorescence
- Overall imbalance of dye intensity
- 2 dyes Cy5 (R) and Cy3 (G)
- If R G expressed at equal levels, R/G 1
- Space
- Intensities variable on coordinates
- Can be dirty on sides of microarray
5Previous Methods
- Many address intensity bias
- Few address spatial bias
- Most rely on M M m
- M is the normalized values
- M is the raw log-ratio (M log2R/G)
- m is the estimate of the bias
6Important Variables
- M log2(R/G)
- Log ratio converts multiplicative error to
additive error - A (1/2)0.5log2RG
- Average of the log-intensities
- Minus-add plots
- M vs. A
- Useful for assessing systematic bias
7Calculating m in other methods
- gMed global median normalization
- m median(Mi)
- Mi are all the values of M
- pLo print tip loess
- m ci (A)
- pLoGS
- found in GeneSight biodiscovery.com
- Local group median (3x3 square regions)
- print tip loess
- cPLo2D - print tip loess pure 2D normalization
- BioConductor bioconductor.org
- m a ci (A) ß ci (SpotRow,SpotCol)
- ci (SpotRow,SpotCol) is the loess estimate of M
using spot row and column coordinates inside the
ith print tip - gLoMedF
- global loess normalization
- spatial median filter
8Robust Neural Networks Technique
- pNN2DA print tip robust neural nets 2D and A
- Attempt to find the best fit of M using A and the
2-D space coordinates of the spots - m ci (A,X,Y)
- Instead of using individual print tips use 3x3
bins of them X and Y - Accounts for spatial bias
9Neural Nets Terminology
- Uses multi-layer feedforward network
- Sigmoid Function
10Neural Networks
- Uses multi-layer feedforward network
- x is the vector (X,Y,A,1),
- I 3,
- w are the weights,
- sigma one represents the hidden neurons and they
are sigmoid functions, - sigma two is the single neuron in the output
layer, which is also sigmoid, - Sigma one J1 accounts for the second layer bias,
- J represents the number of neurons in the hidden
layer of the network
11Multi-layered Feedforward
Usually, J 3 to take care of outliers but also
so as to avoid over-fitting
12Criteria Datasets
- Criteria
- a) reduce variability of log-ratios between
replicated slides and within slides - b) ability to distinguish truly regulated
genes from the other genes - Datasets
- Apo AI a,b
- Swirl Zebra Fish a
- Poplar experiment a
- Perturbed Apo AI b
13Classic Neural Nets vs. Robust NNets
14Criteria refresher
- The ability to reduce the variability of
log-ratios between replicated slides and within
slides - The ability to distinguish truly regulated genes
from the other genes
15Impact on Variability
16Cont. 3 Data Sets
17Downregulated Gene Sorting Apo AI set
18DRGS Perturbed Apo AI set
19Spatial Uniformity of M values distribution
20Results Table
21Strengths/Weaknesses
- Seems promising
- Uses multiple tests to determine efficacy
- Doesnt use enough datasets
- Uses patterned perturbed dataset
- But no real perturbed dataset
22Future Work
- More datasets
- When should this normalization technique be used
over other techniques? - Should this technique be combined with elements
of other techniques to further improve it?
23References
- Tarca, A.L., J.E.K. Cooke, and J. Mackay. A
robust neural networks approach for spatial and
intensity-dependent normalization of cDNA
microarray data." Bioinformatics Jun 2005 21
2674 - 2683 - Haykin, Simon. Neural Networks A Comprehensive
Foundation. New Jersey Prentice Hall, 1999. - Mount, David W. Bioinformatics sequence and
genome analysis. New York Cold Spring Harbor
Laboratory Press, 2001.