A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data

About This Presentation

Title:

A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data

Description:

A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data A.L. Tarca, J.E.K. Cooke and J. MacKay – PowerPoint PPT presentation

Number of Views:128

Avg rating:3.0/5.0

Slides: 24

Provided by: Dana145

Learn more at: https://www.calstatela.edu

Category:

more less

Transcript and Presenter's Notes

Title: A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data

1
A robust neural networks approach for spatial and
intensity-dependent normalization of cDNA
microarray data

A.L. Tarca, J.E.K. Cooke and J. MacKay

Presented by Dana Mohamed
2
Microarrays
3
Importance of Microarrays (and that the
data is correct)

Assumption that microarray data linearly reflects
amount of mRNA present in cell
In turn, reflects gene expression levels
If the data is incorrect,
So is our interpretation of gene expression
And therefore all the science built on that
interpretation is also incorrect

4
Where error is

Intensity of Fluorescence
Overall imbalance of dye intensity
2 dyes Cy5 (R) and Cy3 (G)
If R G expressed at equal levels, R/G 1
Space
Intensities variable on coordinates
Can be dirty on sides of microarray

5
Previous Methods

Many address intensity bias
Few address spatial bias
Most rely on M M m
M is the normalized values
M is the raw log-ratio (M log2R/G)
m is the estimate of the bias

6
Important Variables

M log2(R/G)
Log ratio converts multiplicative error to
additive error
A (1/2)0.5log2RG
Average of the log-intensities
Minus-add plots
M vs. A
Useful for assessing systematic bias

7
Calculating m in other methods

gMed global median normalization
m median(Mi)
Mi are all the values of M
pLo print tip loess
m ci (A)
pLoGS
found in GeneSight biodiscovery.com
Local group median (3x3 square regions)
print tip loess
cPLo2D - print tip loess pure 2D normalization
BioConductor bioconductor.org
m a ci (A) ß ci (SpotRow,SpotCol)
ci (SpotRow,SpotCol) is the loess estimate of M
using spot row and column coordinates inside the
ith print tip
gLoMedF
global loess normalization
spatial median filter

8
Robust Neural Networks Technique

pNN2DA print tip robust neural nets 2D and A
Attempt to find the best fit of M using A and the
2-D space coordinates of the spots
m ci (A,X,Y)
Instead of using individual print tips use 3x3
bins of them X and Y
Accounts for spatial bias

9
Neural Nets Terminology

Uses multi-layer feedforward network
Sigmoid Function

10
Neural Networks

Uses multi-layer feedforward network
x is the vector (X,Y,A,1),
I 3,
w are the weights,
sigma one represents the hidden neurons and they
are sigmoid functions,
sigma two is the single neuron in the output
layer, which is also sigmoid,
Sigma one J1 accounts for the second layer bias,
J represents the number of neurons in the hidden
layer of the network

11
Multi-layered Feedforward
Usually, J 3 to take care of outliers but also
so as to avoid over-fitting
12
Criteria Datasets

Criteria
a) reduce variability of log-ratios between
replicated slides and within slides
b) ability to distinguish truly regulated
genes from the other genes
Datasets
Apo AI a,b
Swirl Zebra Fish a
Poplar experiment a
Perturbed Apo AI b

13
Classic Neural Nets vs. Robust NNets
14
Criteria refresher

The ability to reduce the variability of
log-ratios between replicated slides and within
slides
The ability to distinguish truly regulated genes
from the other genes

15
Impact on Variability
16
Cont. 3 Data Sets
17
Downregulated Gene Sorting Apo AI set
18
DRGS Perturbed Apo AI set
19
Spatial Uniformity of M values distribution
20
Results Table
21
Strengths/Weaknesses

Seems promising
Uses multiple tests to determine efficacy
Doesnt use enough datasets
Uses patterned perturbed dataset
But no real perturbed dataset

22
Future Work

More datasets
When should this normalization technique be used
over other techniques?
Should this technique be combined with elements
of other techniques to further improve it?

23
References

Tarca, A.L., J.E.K. Cooke, and J. Mackay. A
robust neural networks approach for spatial and
intensity-dependent normalization of cDNA
microarray data." Bioinformatics Jun 2005 21
2674 - 2683
Haykin, Simon. Neural Networks A Comprehensive
Foundation. New Jersey Prentice Hall, 1999.
Mount, David W. Bioinformatics sequence and
genome analysis. New York Cold Spring Harbor
Laboratory Press, 2001.