Title: Similarities and differences in genomewide expression programs of six organisms Sven Bergmann
1Similarities and differences in genome-wide
expression programs of six organismsSven Bergmann
2Large-scale expression data
E.coli
S. cerevisiae
A. thaliana
H. sapiens
C.elegans
D. melanogaster
3How to extract biological insight from these data?
4How to integrate sequence information?
5Comparative analysis
- Practical tools
- decomposing data into transcription modules
- integration of sequence information provides
hypotheses - Global analysis
- study global network properties
- reveal design principles
6Signature Algorithm
7Signature Algorithm Output
Co-regulated genes
8Iterative Signature Algorithm
OUTPUT
9Identification of transcription modules using
many random seeds
10Threshold parameter allows for modular
decomposition at different resolutions
11Representation as Module trees
Cell-cycle
Amino-acid metabolism
TM
Stress
Mating
Protein synthesis
12Color-code visualizes module properties
13Segregation between modules with and without
homologues
Number of modules
Random control
data
Fraction of homologues
14Do the six organisms share the same basic
architecture structure?
Distinct modular organization may reflect
different adaptation requirements
15Comparative analysis
- Practical tools
- decomposing data into transcription modules
- integration of sequence information provides
hypotheses - Global analysis
- study global network properties
- reveal design principles
16Sequence similarity allows for gene mapping
But which homologue has similar function?
17Mapping Transcription Modules
18Average correlation Ribosomal genes
- highly correlated
- statistically significant
19Average correlation
Mapped transcription modules exhibit significant
correlations!
20Gene Refinement
BLAST
21Refined modules contain only co-regulated genes
22Available annotation indicates that added genes
are functionally related
Worm heat-shock
23Higher order correlations
correlated
anti-correlated
24Correlation patterns are distinct for each
organism
25Comparative analysis
- Practical tools
- decomposing data into transcription modules
- integration of sequence information provides
hypotheses - Global analysis
- study global network properties
- reveal design principles
26Constructing an Expression network
27Constructing an Expression network
28Connectivity
k number of edges per node
29Connectivity distribution
30Connectivity distribution for yeast expression
network
31Connectivity Distributions for Expression Networks
- highly connected yeast genes
- related to protein synthesis
- rRNA processing
32Other networks with power-law distributions
actors
WWW
power-grid
n(k)
k
k
k
33Comparative analysis
- Practical tools
- decomposing data into transcription modules
- integration of sequence information provides
hypotheses - Global analysis
- study global network properties
- reveal design principles
34Theoretical Approaches
- Dynamically evolving networks with preferential
attachment (rich get richer) Barabasi
Albert 1999 - Systems with Highly Optimized Tolerance
(HOT)Carlson Doyle 2000
35Centrality Homology
Fraction of yeast genes with human homologue
connectivity
36Centrality Homology
37Centrality Lethality
38Clustering-coefficient
Friendships between friends
C
Possible friendships
How many of your friends are also friends amongst
each other?
6/15
39From global to modular Where do the stripes come
from?
Transcription Modules (TM) (co-regulated sets
of genes) appear as stripes!
C
k
40C against k diagrams for all expression networks
41Take-home Messages
- Accumulation of large scale expression data makes
comparative analysis of transcriptomes possible - Practical tools
- decomposing data into transcription modules
- gene-mapping provides hypotheses
- Expression networks of six organisms shareglobal
properties (power-law clustered)