Forces and Prediction of Protein Structure - PowerPoint PPT Presentation

1 / 57
About This Presentation
Title:

Forces and Prediction of Protein Structure

Description:

Forces and Prediction of Protein Structure Ming-Jing Hwang ( ) Institute of Biomedical Sciences Academia Sinica http://gln.ibms.sinica.edu.tw/ – PowerPoint PPT presentation

Number of Views:207
Avg rating:3.0/5.0
Slides: 58
Provided by: shin151
Category:

less

Transcript and Presenter's Notes

Title: Forces and Prediction of Protein Structure


1
Forces and Prediction of Protein Structure
  • Ming-Jing Hwang (???)
  • Institute of Biomedical Sciences
  • Academia Sinica

http//gln.ibms.sinica.edu.tw/
2
Science 2005
3
Sequence - Structure - Function
MADWVTGKVTKVQNWTDALFSLTVHAPVLPFTAGQFTKLGLEIDGERVQR
AYSYVNSPDNPDLEFYLVTVPDGKLSPRLAALKPGDEVQVVSEAAGFFVL
DEVPHCETLWMLATGTAIGPYLSILR
4
Sequence/Structure Gap
  • Current (May 15, 2007) entries in protein
    sequence and structure database
  • SWISS-PROT/TREMBL 267,354/4,361,897
  • PDB 43,459

Sequence
Structure
5
Structural Bioinformatics Sequence/Structure
Relationship
Percent Identity
100 90 80 70 60 50 40 30 20 10 0
All possible sequences of amino acids
Protein structures observed in nature
Twilight zone
Midnight zone
Protein sequences observed in nature
6
Structure Prediction Methods
Homology modeling
Fold recognition
ab initio
0 10 20 30 40 50 60
70 80 90 100
sequence identity
7
Levinthals paradox (1969)
  • If we assume three possible states for every
    flexible dihedral angle in the backbone of a
    100-residue protein, the number of possible
    backbone configurations is 3200. Even an
    incredibly fast computational or physical
    sampling in 10-15 s would mean that a complete
    sampling would take 1080 s, which exceeds the age
    of the universe by more than 60 orders of
    magnitude.
  • Yet proteins fold in seconds or less!

Berendsen
8
Energy landscapes of protein folding
Borman, CE News, 1998
9
Levitts lecture for S
10
Levitt
11
Levitt
12
Other factors
  • Formation of 2nd elements
  • Packing of 2nd elements
  • Topologies of fold
  • Metal/co-factor binding
  • Disulfide bond

13
Ab initio/new fold prediction
  • Physics-based (laws of physics)
  • Knowledge-based (rules of evolution)

14
Levitt
15
Levitt
16
Levitt
17
Levitt
18
Levitt
19
Levitt
20
Levitt
21
Levitt
22
Levitt
23
Levitt
24
Levitt
25
Levitt
26
Levitt
27
Molecular Mechanics (Force Field)
28
Levitt
29
(No Transcript)
30
1-microsecond MD simulation
980ns
  • villin headpiece
  • 36 a.a.
  • 3000 H2O
  • 12,000 atoms
  • 256 CPUs (CRAY)
  • 4 months
  • single trajectory

Duan Kollman, 1998
31
Protein folding by MD
PROTEIN FOLDINGA Glimpse of the Holy
Grail? Herman J. C. Berendsen "The Grail had
many different manifestations throughout its long
history, and many have claimed to possess it or
its like". We might have seen a glimpse of it,
but the brave knights must prepare for a long
pursuit.
32
Massively distributed computing
  • SETI_at_home
  • Folding_at_home
  • Distributed folding
  • Sengents drug design
  • FightAIDS_at_home

33
Massively distributed computing
Letters to nature (2002)
  • engineered protein (BBA5)
  • zinc finger fold (w/o metal)
  • 23 a.a.
  • solvation model
  • thousands of trajectories each of 5-20 ns,
    totaling 700 ms
  • Folding_at_home
  • 30,000 internet volunteers
  • several months, or a million CPU days of
    simulation

34
Energy landscapes of protein folding
Borman, CE News, 1998
35
Protein-folding prediction technique
CGU Convex Global Underestimation - K. Dills
group
36
Challenges of physics-based methods
  • Simulation time scale
  • Computing power
  • Sampling
  • Accuracy of energy functions

37
Structure Prediction Methods
Homology modeling
Fold recognition
ab initio
0 10 20 30 40 50 60
70 80 90 100
sequence identity
38
Flowchart of homology (comparative) modeling
From Marti-Renom et al.
39
Fold recognition
Find, from a library of folds, the 3D
template that accommodates the target sequence
best. Also known as threading or inverse
folding Useful for twilight-zone sequences
40
Fold recognition (aligning sequence to structure)
(David Shortle, 2000)
41
3D-gt1D score
42
On X-ray, NMR, and computed models
43
(Rost, 1996)
44
Reliability and uses of comparative models
Marti-Renom et al. (2000)
45
Pitfalls of comparative modeling
  • Cannot correct alignment errors
  • More similar to template than to true structure
  • Cannot predict novel folds

46
Ab initio/new fold prediction
  • Physics-based (laws of physics)
  • Knowledge-based (rules of evolution)

47
From 1D ? 2D ? 3D
Primary
LGINCRGSSQCGLSGGNLMVRIRDQACGNQGQTWCPGERRAKVCGTGNSI
SAYVQSTNNCISGTEACRHLTNLVNHGCRVCGSDPLYAGNDVSRGQLTVN
YVNSC
seq. to str. mapping
Secondary(fragment)
Tertiary
fragment assembly
48
CASP Experiments
49
One lab dominated in CASP4
One group dominates the ab initio
(knowledge-based) prediction
50
Some CASP4 successes
Bakers group
51
Ab initio structure prediction server
52
Toward High-Resolution de Novo Structure
Prediction for Small Proteins --Philip
Bradley, Kira M. S. Misura, David Baker (Science
2005)
The prediction of protein structure from amino
acid sequence is a grand challenge of
computational molecular biology. By using a
combination of improved low- and high-resolution
conformational sampling methods, improved
atomically detailed potential functions that
capture the jigsaw puzzlelike packing of protein
cores, and high-performance computing,
high-resolution structure prediction (lt1.5
angstroms) can be achieved for small protein
domains (lt85 residues). The primary bottleneck to
consistent high-resolution prediction appears to
be conformational sampling.
53
3D to 1D?
Science 2003
54
A computer-designed protein (93 aa) with 1.2 A
resolution
55
Structure prediction servers
http//bioinfo.pl/cafasp/list.html
56
Hybrid approach for solving macromolecular
complex structures
57
Thank You!
Write a Comment
User Comments (0)
About PowerShow.com