Bioinformatic PhD. course - PowerPoint PPT Presentation

1 / 6
About This Presentation
Title:

Bioinformatic PhD. course

Description:

Bioinformatic PhD. course Bioinformatics Xavier Messeguer Peypoch (http://www.lsi.upc.es/~alggen) LSI Dep. de Llenguatges i Sistemes Inform tics – PowerPoint PPT presentation

Number of Views:35
Avg rating:3.0/5.0
Slides: 7
Provided by: LCL6
Category:

less

Transcript and Presenter's Notes

Title: Bioinformatic PhD. course


1
Bioinformatic PhD. course
Bioinformatics Xavier Messeguer Peypoch
(http//www.lsi.upc.es/alggen) LSI Dep. de
Llenguatges i Sistemes Informàtics BSC Barcelona
Supercomputing Center Universitat Politècnica de
Catalunya
2
Contents
1. Biological introduction
2. Comparison of short sequences ( up to
10.000bps)
Dot Matrix Pairwise align.
Multiple align. Hash alg.
3. Comparison of large sequences ( more that
10.000bps)
Data structures Suffix trees MUMs
4. String matching
Exact Extended Approximate
5. Sequence assembly
6. Projects PROMO, MREPATT,
3
Pairwise alignment
  • Recall that with two strings of length n

S2
C A -1
__
S1
O(n2)
22-1
1
And with 3 strings?
4
Multiple alignment
  • What happens with three strings?

Let n be their length, then the cost becomes
A C A -1
__
O(n3)
23-1
And with k strings?
O(nk 2k k2)
5
Multiple alignment programs
  • Multi-alignment programs
  • Malig (Progressive alignment)
  • http//alggen.lsi.upc.edu
  • Clustal (Progressive alignment)
  • http//www.ebi.ac.uk/clustalw
  • TCoffee (Progressive alignment data bases)
  • http//igs-server.cnr-mrs.fr/Tcoffee_cgi/index.c
    gi
  • HMM (Hidden Markov Models)

6
Multiple progressive alignment
  • Run alggen-program

Run Malig (Progressive alignment) http//alggen
.lsi.upc.edu
Run Clustal (Progressive alignment) http//www.
ebi.ac.uk/clustalw
Write a Comment
User Comments (0)
About PowerShow.com