Setting Up a Replica Exchange Approach to Motif Discovery in DNA PowerPoint PPT Presentation

presentation player overlay
1 / 25
About This Presentation
Transcript and Presenter's Notes

Title: Setting Up a Replica Exchange Approach to Motif Discovery in DNA


1
Setting Up a Replica Exchange Approach to Motif
Discovery in DNA
  • Jeffrey Goett
  • Advisor
  • Professor Sengupta

2
Protein Synthesis from DNA
3
Binding Sites
4
Discovering New Binding Motifs
Motif GCTCAG
ATCG GCTCAG CTAG
CACT GATCAG AGTA
TTCC GCTCTG TAAC
GCTA GCTCAA ATCG
Motif Probability Model
5
Modeling Motifs in Sequences
Assume Break into N sequences Each sequence has
one instance of motif embedded in random
background Variations of motif by point mutation,
but not insertion or deletion
ATATCCGTA AATCGAGAC TCGATGTGT CCACCTGCA
6
Modeling Motifs in Sequences
The Alignment Starting position of motif in
each sequence
AT ATC CGTA A ATC GAGAC TCG ATG TGT CC ACC TGCA
The Motif Probability Distribution Probability
of each letter occurring at each motif position
7
Scoring a Model
Log-likelihood score
8
Example Models
A TAT CCGTA AAT CGA GAC TCGATG TGT CC ACC TGCA
AT ATC CGTA A ATC GAGAC TCG ATG TGT CC ACC TGCA
9
The Gibbs Sampler
that maximizes
We want to find
10
The Gibbs Sampler
11
The Gibbs Sampler
Times visited
Over time, the frequency distribution approaches
12
Optimization Technique
If we assume areas of local maximization
contribute the most during integration to the
local maximizations of
Biasing our search to these areas may discover
the pj,ro values which maximize faster.
13
Multiple Gibbs Samplers
By combining results from Gibbs Samplers begun at
random positions, find maximizing sooner
14
Replica Exchange/Parallel Tempering
Low-sensitivity samplers which scout out area
periodically swap with high-sensitivity
samplers good at focused searches if swap appears
promising.
15
Controlling Sensitivity
Adjust the relative probability of sampling an xi
by adjusting a new parameter in distribution
Large
Small
Search breadth of space
Focused search of region
16
Testing the Sensitivity
Running on randomly generated sequences to see
motifs found, different sensitivity samplers
converge to different scores.
17
Predicting Convergence Score
Measure of Similarity magnetization
Ex m.5
Configuration Score energy
m0
m0
m1
m0
m.5
E2J
E2J
E-6J
E2J
E0
18
Alignment Analogue
m1
E-9J
A
B
m.77
E-5J
C
m.77
m.77
E-5J
E-5J
19
Test Results
L lt alphabetw
20
Test Results
L gt alphabetw
21
Test Results
22
Test Results
23
Hidden Motifs Gibbs Sampler
Beta .1
Beta .5
Beta .9
Beta 1.3
Beta 1.7
Beta 2
W5, l500
24
Hidden Motifs Replica Exchange
25
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com