Title: Biomimicry
1Genetic and epigenetic control of cis and trans
regulatory variationJustin BorevitzEcology
EvolutionUniversity of Chicagohttp//borevitzlab
.uchicago.edu
2Universal Whole Genome Array
DNA
RNA
Gene/Exon Discovery Gene model correction Non-codi
ng/ micro-RNA
Chromatin Immunoprecipitation ChIP chip
Alternative Splicing
Methylation
Antisense transcription
Polymorphism SFPs Discovery/Genotyping
Transcriptome Atlas Expression levels Tissues
specificity
Comparative Genome Hybridization
(CGH) Insertion/Deletions Copy Number
Polymorphisms
RNA Immunoprecipitation RIP chip
Allele Specific Expression
Control for hybridization/genetic
polymorphisms to understand TRUE expression
variation
3Which arrays should be used?
cDNA array
Long oligo array
4Which arrays should be used?
Gene array
Exon array
Tiling array
35bp tile, 25mers 10bp gaps
5Which arrays should be used?
SNP array
How about multiple species? Microbial
communities?
Pst,Psm,Psy,Psx, Agro, Xanthomonas, H parasitica,
15 virus,
Ressequencing array
Tiling/SNP array 2007 250k SNPs, 1.6M
tiling probes
6Genomic profile of cellular systems responding to
the environment
ORFa
Transcriptome Atlas
ORFb
start
AAAAA
deletion
M
M
M
M
M
M
SFP
M
M
M
M
M
M
SNP
SNP
SFP
SFP
conservation
Chromosome (bp)
7Talk Outline
Talk Outline
Van x Col Copy Number Variation (Indels) Single
Feature Polymorphisms Methylation
Polymorphisms Expression Polymorphisms gene,
exon additive dominant Haplotype Diversity (si
hay tiempo) Dr Xu Zhang (borevitzlab.uchicago.ed
u)
Van x Col Copy Number Variation (Indels) Single
Feature Polymorphisms Methylation
Polymorphisms Expression Polymorphisms gene,
exon additive dominant Haplotype Diversity (si
hay tiempo) Dr Xu Zhang (borevitzlab.uchicago.ed
u)
8The experiment
Col? x Col?
Van ? x Van ?
Col ? x Van ?
Van ? x Col ?
- parental strains and reciprocal F1 hybrids
- mRNA from total RNA genomic DNA
9Potential Deletions
10Deletions vs duplications
11Natural Variation on Tiling Arrays
12Distribution of indels along chromosomes
13Potential Deletions
gt500 potential deletions 45 confirmed by Ler
sequence 23 (of 114) transposons Disease
Resistance (R) gene clusters Single R gene
deletions Genes involved in Secondary
metabolism Unknown genes
14Potential Deletions Suggest Candidate Genes
FLOWERING1 QTL
Chr1 (bp)
Flowering Time QTL caused by a natural deletion
in FLM
(Werner et al PNAS 2005)
15SFPs and CCGG Methylome
Genomic DNA
Col
HpaII digestion
Random labeling
Col
Genomic DNA
MspI digestion
Random labeling
Genomic DNA
Van
HpaII digestion
Random labeling
Van
Genomic DNA
MspI digestion
Random labeling
Full model Intensity genotype enzyme
genotype x enzyme
16SFP detection on tiling arrays
Xu Zhang
17Methylation polymorphisms are extensive
a Features of constitutive CG methylation bc
Features of Col- or Van-specific methylation df
cDNAs or promoters with feature(s) of enzyme
effect (p lt 0.1) or genotype enzyme
interaction (p lt 0.05) eg cDNAs or promoters
containing CCGG feature(s) h Intergenic
features (excluding cDNAs or promoters) of enzyme
effect (p lt 0.1) or genotype enzyme interaction
(p lt 0.05) i Intergenic (excluding cDNAs or
promoters) CCGG-containing features
18Verification of methylation polymorphisms
19Verification of methylation polymorphisms
20epiTyper
Col Col Col Van Van Van Col? x Van? Col? x Van ?
Van? x Col? Van ? x Col ? Van? x Col ?
CCGG
chromomethylase 2 (CMT2) exon19
21Co-methylation of pericentromere regions
22Genic distribution of constitutive and
polymorphic methylation sites
23Correlation between gene size and constitutive CG
methylation
24Double-stranded random labeling
AAAAA
Random reverse transcription
AAAAA
Double-stranded cDNA
Random priming
25Whole genome tiling array
- High density and resolution 1.6M unique probes
at 35bp spacing - Without bias toward known transcripts
Genetic hybridization polymorphisms could affect
the estimation of gene expression
26Transcription and splicing
Chromosomal DNA
Exon 1
Exon 2
Exon 3
Intron 1
Intron 2
Transcription
Nuclear RNA
RNA splicing
Messenger RNA
Exon 1
Exon 2
Exon 3
Exon 1
Exon 3
27The linear model
Gene probe Intensity additive dominant
maternal e
28The pattern of gene expression inheritance
Mean gene intensity
paternal
Maternal
Col dominant
over dominant
F1v dominant
F1c dominant
Van dominant
Col Van F1v F1c
29Gene expression variation between genotypes
30Additive, dominant and maternal effects of gene
expression
31The pattern of gene expression inheritance
32Correlation of constitutive CG methylation and
absolute gene expression
33Correlation of polymorphic CG methylation and
gene expresson variation
34Default expression status of exon and intron
- Exons correction for gene expression
- corrected by gene mean
- corrected by a gene median
- splicing index (Meanexon/Meangene)
- Introns direct comparison
Exon/intron probe Intensity additive dominant
maternal e
35Differential exon splicing
Exon probe Intensity additive dominant
maternal e
36Differential intron splicing
Intron probe Intensity additive dominant
maternal e
37Some dominant effect in differential intron
splicing in F1 hybrids
38Comparison for enrichment in known alternatively
spliced exons
39Experimental determined FDR for differential
splicing
40Photosynthesis related genes
Differrentially spliced genes which are located
in chloroplast thylakoid
AT5G38660 APE1 (Acclimation of Photosynthesis to
Environment) mutant has altered acclimation
responses
41Generalized tiling array HMM
(by Jake Byrnes)
- 3-state HMM
- Discrete distribution for emission probability
- Transition probability counts for probe spacing
- Baum-Welch parameter estimation
42de novo transcriptome variation
Xu Zhang
43Comparison of annotation-based analysis and HMM
44Comparison of annotation-based analysis and HMM
45Array Haplotyping
- What about Diversity/selection across the genome?
- A genome wide estimate of population genetics
parameters, ?w, p, TajimaD, ? - LD decay, Haplotype block size
- Deep population structure?
- Col, Lz, Bur, Ler, Bay, Shah, Cvi, Kas, C24, Est,
Kin, Mt, Nd, Sorbo, Van, Ws2 - Fl-1, Ita-0, Mr-0, St-0, Sah-0
46Array Haplotyping
Inbred lines Low effective recombination due to
partial selfing Extensive LD blocks
47SFPs for reverse genetics
14 Accessions 30,950 SFPs
http//naturalvariation.org/sfp
48Chromosome Wide Diversity
49Diversity 50kb windows
50Tajimas D like 50kb windows
RPS4
unknown
51R genes vs bHLH
52http//borevitzlab.uchicago.edu
http//borevitzlab.uchicago.edu
Arabidopsis Yan Li Megan Dunning Joy
Bergelson Magnus Nordborg Paul Marjoram Aquilegia
Christos Noutsos Scott Hodges Tall Grass
Prairie Geoff Morris Mike Miller
Arabidopsis Yan Li Megan Dunning Joy
Bergelson Magnus Nordborg Paul Marjoram Aquilegia
Christos Noutsos Scott Hodges Tall Grass
Prairie Geoff Morris Mike Miller
Arrays Xu Zhang Shinhan Shiu Ivan Baxter
Arrays Xu Zhang Shinhan Shiu Ivan Baxter