Parametric Alignment - PowerPoint PPT Presentation

1 / 19
About This Presentation
Title:

Parametric Alignment

Description:

Global Alignment (Needleman-Wunsch) Special case of alignment pair HMM ... An alignment corresponds to a monomial. Represented by point. Input/Output. Parametric ... – PowerPoint PPT presentation

Number of Views:39
Avg rating:3.0/5.0
Slides: 20
Provided by: colin130
Category:

less

Transcript and Presenter's Notes

Title: Parametric Alignment


1
Parametric Alignment
  • Colin Dewey and Lior Pachter

2
Biological Sequence Alignment
  • What are we really doing?
  • Establishing homology relationships
  • Given two biological strings ?1,?2, define
    homology relation Rhom on ?1 ? ?2

-CGATT GCC-TA
Homologous positions Nucleotides evolved from a
common ancestral nucleotide
3
Alignment Parameters
  • Global Alignment (Needleman-Wunsch)
  • Special case of alignment pair HMM
  • Parameters (combinations of emission and
    transition probabilities)
  • Match (m)
  • Mismatch (x)
  • Gap Start (g)
  • Gap Extend (e)

4
Example Global Alignment
?1 GCCTA, ?2 CGATT log(match) 1,
log(mismatch) -1
C
G
A
T
T
G
C
C
T
A
of possible alignments D(5,5) 1,683
5
Example Global Alignment
?1 GCCTA, ?2 CGATT log(match) 1,
log(mismatch) -1
C
G
A
T
T
-GCCTA CG-ATT -GCCTA CGA-TT -GCCTA CGATT- GC-CT
A -CGATT GCC-TA -CGATT
G
C
C
T
A
6
Parametric Alignment
  • An alignment corresponds to a monomial
  • Represented by point

7
Input/Output
Newton Polytope
Parametric Aligner
?1,?2
Normal Fan
Alignments
-GCCTA CG-ATT -GCCTA CGA-TT
8
Example Polytope
VERTICES 1 1 4 1 2 4 1 0 6 1 2 6 1 0
10 FACETS -4 0 1 0 1 0 -6 2 1 10 -2 -1 2 -1 0
9
Normal Fan
log(mismatch)
4
3
log(match)
2
1
0
10
Constrained Normal Fan
log(mismatch) lt 0
log(mismatch)
log(mismatch) lt log(match)
log(match)
1
0
11
Output
Number of polytope vertices 5 Inequalities for
cone 0 -1m 0x gt 0 1m -1x gt 0 match
-0.5 mismatch -1 gap -1 Number of optimal
alignments for cone 1 Inequalities for cone
1 1m 0x gt 0 0m -1x gt 0 match
0.5 mismatch -0.5 gap -0.5 Number of optimal
alignments for cone 5 Total number of optimal
alignments 6
12
Alignments
Cone 0 GCCTA CGATT
Cone 1 -GCCTA CG-ATT -GCCTA CGA-TT -GCCTA CGATT
- GC-CTA -CGATT GCC-TA -CGATT
13
Random vs. Related Sequences
14
Polytope vertices for different models
15
Theorem
  • Theorem There exist two homologous biological
    sequences ?1,?2 such that no choice of parameters
    for the 2-parameter Needleman-Wunsch algorithm
    gives the biologically correct alignment

16
Proof of Theorem
dr_3 AGTGATTT----TTGCATAACAGGTCTACTT---- hs_34
AGGCATCAGAAGTTGAGAGACAACTCTCCATGCAG
TACATGAC----------ATTTTCGAGAAAAAAA
TCCACGCCCTCAGAGAAGACTTTCGGGAGAAAAA
VERTICES 1 38 33 1 37 33 1 38 45 1 0 70 1 0 121
17
Implementation
templatelttypename SemiRinggt void alignGlobalLastR
ow(const string seq1, const
string seq2, const typename
SemiRingElement match, const
typename SemiRingElement mismatch,
const typename SemiRingElement gap,
vectorlttypename SemiRingElementgt
row)
18
Implementation
const Element one SemiRingmultiplicativeIdenti
ty // Initialize row row.resize(seq2.size()
1) row0 one // Calculate first row for
(size_t j 1 j lt seq2.size() j) rowj
gap rowj - 1 // Calculate remaining
rows Element up, diag for (size_t i 1 i lt
seq1.size() i) diag row0 row0
gap for (size_t j 1 j lt seq2.size()
j) up rowj if (seq1i -
1 seq2j - 1) rowj match
diag gap (up rowj - 1) else
rowj mismatch diag gap (up
rowj - 1) diag up
19
Next Steps
  • Bayesian Alignment
  • Prior probabilities on parameters
  • Find best alignment given priors
  • More easily
Write a Comment
User Comments (0)
About PowerShow.com