A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data

Description:

A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data A.L. Tarca, J.E.K. Cooke and J. MacKay – PowerPoint PPT presentation

Number of Views:128
Avg rating:3.0/5.0
Slides: 24
Provided by: Dana145
Category:

less

Transcript and Presenter's Notes

Title: A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data


1
A robust neural networks approach for spatial and
intensity-dependent normalization of cDNA
microarray data
  • A.L. Tarca, J.E.K. Cooke and J. MacKay

Presented by Dana Mohamed
2
Microarrays
3
Importance of Microarrays (and that the
data is correct)
  • Assumption that microarray data linearly reflects
    amount of mRNA present in cell
  • In turn, reflects gene expression levels
  • If the data is incorrect,
  • So is our interpretation of gene expression
  • And therefore all the science built on that
    interpretation is also incorrect

4
Where error is
  • Intensity of Fluorescence
  • Overall imbalance of dye intensity
  • 2 dyes Cy5 (R) and Cy3 (G)
  • If R G expressed at equal levels, R/G 1
  • Space
  • Intensities variable on coordinates
  • Can be dirty on sides of microarray

5
Previous Methods
  • Many address intensity bias
  • Few address spatial bias
  • Most rely on M M m
  • M is the normalized values
  • M is the raw log-ratio (M log2R/G)
  • m is the estimate of the bias

6
Important Variables
  • M log2(R/G)
  • Log ratio converts multiplicative error to
    additive error
  • A (1/2)0.5log2RG
  • Average of the log-intensities
  • Minus-add plots
  • M vs. A
  • Useful for assessing systematic bias

7
Calculating m in other methods
  • gMed global median normalization
  • m median(Mi)
  • Mi are all the values of M
  • pLo print tip loess
  • m ci (A)
  • pLoGS
  • found in GeneSight biodiscovery.com
  • Local group median (3x3 square regions)
  • print tip loess
  • cPLo2D - print tip loess pure 2D normalization
  • BioConductor bioconductor.org
  • m a ci (A) ß ci (SpotRow,SpotCol)
  • ci (SpotRow,SpotCol) is the loess estimate of M
    using spot row and column coordinates inside the
    ith print tip
  • gLoMedF
  • global loess normalization
  • spatial median filter

8
Robust Neural Networks Technique
  • pNN2DA print tip robust neural nets 2D and A
  • Attempt to find the best fit of M using A and the
    2-D space coordinates of the spots
  • m ci (A,X,Y)
  • Instead of using individual print tips use 3x3
    bins of them X and Y
  • Accounts for spatial bias

9
Neural Nets Terminology
  • Uses multi-layer feedforward network
  • Sigmoid Function

10
Neural Networks
  • Uses multi-layer feedforward network
  • x is the vector (X,Y,A,1),
  • I 3,
  • w are the weights,
  • sigma one represents the hidden neurons and they
    are sigmoid functions,
  • sigma two is the single neuron in the output
    layer, which is also sigmoid,
  • Sigma one J1 accounts for the second layer bias,
  • J represents the number of neurons in the hidden
    layer of the network

11
Multi-layered Feedforward
Usually, J 3 to take care of outliers but also
so as to avoid over-fitting
12
Criteria Datasets
  • Criteria
  • a) reduce variability of log-ratios between
    replicated slides and within slides
  • b) ability to distinguish truly regulated
    genes from the other genes
  • Datasets
  • Apo AI a,b
  • Swirl Zebra Fish a
  • Poplar experiment a
  • Perturbed Apo AI b

13
Classic Neural Nets vs. Robust NNets
14
Criteria refresher
  • The ability to reduce the variability of
    log-ratios between replicated slides and within
    slides
  • The ability to distinguish truly regulated genes
    from the other genes

15
Impact on Variability
16
Cont. 3 Data Sets
17
Downregulated Gene Sorting Apo AI set
18
DRGS Perturbed Apo AI set
19
Spatial Uniformity of M values distribution
20
Results Table
21
Strengths/Weaknesses
  • Seems promising
  • Uses multiple tests to determine efficacy
  • Doesnt use enough datasets
  • Uses patterned perturbed dataset
  • But no real perturbed dataset

22
Future Work
  • More datasets
  • When should this normalization technique be used
    over other techniques?
  • Should this technique be combined with elements
    of other techniques to further improve it?

23
References
  • Tarca, A.L., J.E.K. Cooke, and J. Mackay. A
    robust neural networks approach for spatial and
    intensity-dependent normalization of cDNA
    microarray data." Bioinformatics Jun 2005 21
    2674 - 2683
  • Haykin, Simon. Neural Networks A Comprehensive
    Foundation. New Jersey Prentice Hall, 1999.
  • Mount, David W. Bioinformatics sequence and
    genome analysis. New York Cold Spring Harbor
    Laboratory Press, 2001.
Write a Comment
User Comments (0)
About PowerShow.com