Algorithms FDK Center of Excellence - PowerPoint PPT Presentation

1 / 10
About This Presentation
Title:

Algorithms FDK Center of Excellence

Description:

Enhancer prediction for N-myc (Cell 2006, Nat. Protocols 2006) ... Conserved GLI binding sites in two predicted enhancer elements, CM5 and CM7 ... – PowerPoint PPT presentation

Number of Views:18
Avg rating:3.0/5.0
Slides: 11
Provided by: csHel
Category:

less

Transcript and Presenter's Notes

Title: Algorithms FDK Center of Excellence


1
Algorithms FDK Center of Excellence
  • Esko Ukkonen

Department of Computer Science
2
Mission
  • Sequences Are Everywhere
  • combinatorial pattern matching
  • pattern discovery in sequences
  • dynamic programming, automata theory, advanced
    data structures, probabilistic modeling
  • algorithms on strings and biological sequence
    analysis studied since 1980 many our results
    appear in textbooks

3
J Kärkkäinen, P Sanders S Burkhardt Linear
work suffix array construction. J ACM 53 (2006),
918-936
  • direct construction of a suffix array in linear
    time
  • immediately included in teaching materials
    internationally

abaab baab aab ab b
SuffixTree(abaab)
a

baab
sort
ab
baab
aab ab abaab b baab
4
V Mäkinen, G Navarro E Ukkonen Transposition
invariant string matching. J Algorithms 56
(2005)E Ukkonen, K Lemström V Mäkinen
Sweepline the music! LNCS 2598 (2003), 330-342.
  • Transposition invariant variants of string
    matching algorithms
  • Music retrieval

Transposition by -2
5
P Rastas, M Koivisto, H Mannila E Ukkonen A
hidden Markov technique for haplotype
reconstruction. WABI 2005, 140-151
founder
SNP
6
O Hallikas al Genome-wide prediction of
mammalian enhancers based on analysis of
transcription-factor binding affinity. Cell 124
(2006), 47-59.
enhancer module
gene1
gene2
gene3
gene4
DNA
transcription
transcription factors
RNA
translation
Proteins
7
Computational identification of enhancer elements
  • Preserved in evolution
  • Affinities of functional cis-elements.
  • Spatial arrangement of elements within a module.

Human
Mouse
8
Enhancer prediction for N-myc (Cell 2006, Nat.
Protocols 2006)
200 kb Mouse N-Myc genomic region
200 kb Human N-Myc genomic region
Conserved GLI binding sites in two predicted
enhancer elements, CM5 and CM7
9
A Rantanen Algorithms for 13C Metabolic Flux
Analysis. PhD Thesis 2006.
10
Future goals
  • Indexing sequential data for approximate searches
  • Distance functions for sequences
  • Theoretical framework, efficient evaluation,
    complexity bounds, relations between distances
  • Application specific distances (XML, images,
    music, )
  • Finding structure and signals in sequences
  • Supervised and unsupervised learning of signals
  • Statistical significance of the findings
  • Systems biology How works the program encoded in
    genomes?
Write a Comment
User Comments (0)
About PowerShow.com