Title: Untangling Knots in Lattices and Proteins
1Untangling Knotsin Lattices and Proteins
A Computational Study
- By Rhonald Lua
- Adviser Alexander Yu. Grosberg
2Human Hemoglobin(oxygen transport protein)
(Structure by G. FERMI and M.F. PERUTZ)
Globular proteins have dense, crystal-like
packing density. Proteins are small biomolecular
machines responsible for carrying out many life
processes.
3Hemoglobin Protein Backbone (string of a-carbon
units)
One chain
ball of yarn
44x4x4 Compact Lattice Loop
Possible cube dimensions 2x2x2,4x4x4,6x6x6,,LxLx
L,
No. of distinct conformations
(Flory)
NL3, z 6 in 3D
514x14x14 Compact Lattice Loop
What kind of knot is (in) this?
6Subchains
contour length l lt N2/3
What are the statistics of the end-to-end
distance?
7In this talk
- Knots and their relevance to physics
- Virtual tools to study knots
- Results for lattices
- Results for proteins
- Knotting probabilities
- Subchain statistics
- Knot closure methods
- Knotting probabilities
- Subchain statistics
8Knot a closed curve in space that does not
intersect itself.
The first few knots
Trivial knot (Unknot) 0-1
3-1 (Trefoil)
4-1 (Figure-8)
5-1 (Cinquefoil, Pentafoil Solomons seal)
5-2
9Knots in Physics
- Lord Kelvin (1867) Atoms are knots (vortices) of
some medium (ether). - Knots appear in Quantum Field Theory and
Statistical Physics. - Knots in biomolecules. Example The more
complicated the knot in circular DNA the faster
it moves in gel-electrophoresis experiments.
10Protein Folding
denatured state
collapsed native state (functioning protein)
- Knots present a barrier to folding.
- If knots are rare, is protein folding ergodic?
(Mansfield 1994) - Are proteins selected by evolution to preclude
knots?
11A Little Knot Math
12Reidemeister Moves
Reidemeisters Theorem Two knots are equivalent
if and only if any diagram of one may be
transformed into the other via a sequence
of Reidemeister moves.
13Compounded Reidemeister Moves
14Knot Invariants -Mathematical signatures of a
knot.
D(-1)1 v20 v30
Examples
Trivial knot 0-1
D(-1)3 v21 v31
Trefoil knot 3-1
15Prime and Composite Knots
Composite knot, K
K1
K2
Alexander
Vassiliev
16Method to Determine Type of Knot
Project 3D object into 2D diagram.
Inflation/tightening for large knots.
Preprocess and simplify diagram using
Reidemeister moves.
Compute knot invariants.
Give object a knot-type based on its signatures.
17A. Projection
2D knot projection
3D conformation
projection process
Projected nodes and links
18B. Preprocessing
Using Reidemeister moves
19C. Knot Signature Computation
20Caveat!
- Knot invariants cannot unambiguously classify a
knot. - However
- For prime knots with 10 crossings or fewer (249
knots in all), knots invariants for knots
0-1,3-1,4-1,5-1,5-2 are unique with the exception
(5-1 and 10-132) - Reidemeister moves and knot inflation can
considerably reduce the number of possibilities.
21Knot Inflation
Monte Carlo. Points remain on a lattice.
22Knot Tightening
Shrink-On-No-Overlaps (SONO) method of Piotr
Pieranski. Scale all coordinates slt1, keep bead
radius fixed.
23Results
24Knotting Probabilities for Compact Lattice Loops
25Chance of getting an unknot
for several cube sizes
Mansfield slope -1/270
slope -1/196
26Chance of getting the first few simple knots for
different cube sizes
27Subchain statistics
2814x14x14 Compact Lattice LoopMean square
end-to-end distance versus length of subchain
Subchain (fragment)
Fragments of trivial knots are more crumpled
compared to fragments of all knots.
29Contrast with Non-compact, Unrestricted
LoopAverage gyration radius (squared) versus
length
Closed random walk with fixed step length
(N. Moore)
Trivial knots swell compared to all knots for
non-compact chains. This topologically-driven
swelling is the same as that driven
by self-avoidance (Flory exponent 3/5 versus
gaussian exponent 1/2).
30Compact Lattice Loops
General scaling of subchains (mean-square
end-to-end) versus length
Over all knots
i.e. Gaussian Florys result for chains in a
polymer melt.
Trivial knots
?
(A. Borovinskiy)
31Knots in Proteins
32Previous work
1. M.L. Mansfield (1994) Approx. 400 proteins,
with random bridging of terminals, using
Alexander polynomial. Found at most 3 knots. 2.
W.R. Taylor (2000) 3440 proteins, fixing the
terminals and smoothing (shrinking) the segments
in between. Found 6 trefoils and 2
figure-eights. 3. J.R. Banavar, T.X. Huang, A.
Maritan (2005) (Not about knots) Flory theorem
holds for proteins. 4. K. Millet, A. Dobay, A.
Stasiak (2005) (Not about proteins) A study of
linear random knots and their scaling behaviour.
33Finding knots in proteins
Protein Data Bank (PDB/PAPIA)
4716 protein chains
Extract coordinates of backbone protein chain
Close the knot with DIRECT,CENTER,RANDOM
Project, simplify and compute knot
invariants/signatures
34Protein gyration radius versus length
35Distribution of the distance of the protein
terminals from the center-of-mass
36DIRECT closure method
T1, T2 protein terminals
37CENTER closure method
C center of mass S1, S2 - located on surface of
sphere surrounding the protein F- point at some
large distance away from C
38RANDOM closure method
(random)
(random)
Study statistics of knot closures after
generating 1000 pairs of points (S1 and S2).
Determine the dominant knot-type.
39Knot probabilities in RANDOM closures for protein
1ejg chain A
N46
next dominant
dominant
40Knot probabilities in RANDOM closures for protein
1xd3 chain A
N229
next dominant
dominant
41Distribution of the of RANDOM closures giving
the dominant knot-type
42Knot counts of the 4716 protein chains in the
three closure methods
- RANDOM and CENTER methods gave the same
predictions for all but 5 chains. - DIRECT method gave significantly more knots.
43Unknotting probabilities versus length for
proteins and for compact lattice loops
Total of 19 non-trivial knots in the RANDOM
method. Knots in proteins occur much less often
than in compact lattice loops.
44Scaling of subchains in proteins and in compact
lattice loops
45Degree of interpenetration of subchains
46Secondary and Tertiary structures
b strand
a helix
Basic units
aa
bab
bb
Supersecondary
bbbb (Greek key)
aaaa (4-helix Bundle)
babab (Rossman fold)
Domain folds (examples)
Based on http//www.med.unibs.it/marchesi/pps97/c
ourse/section9/9_term.html and http//swissmodel.e
xpasy.org/course/text/chapter4.htm
47Summary of scales in proteins
48Summary of Results
- Unknotting probability drops exponentially with
chain length. - Compact lattice subchains satisfy Florys
theorem. Subchains of trivial knots are
consistently smaller than subchains of
non-trivial knots. For noncompact conformations,
the opposite is observed. The fragments seem to
be aware of the knottedness of the whole thing
(AYG). - Knots in proteins are rare, compatible with the
compactness of protein subchains.
49Acknowledgments
A. Yu. Grosberg. A. Borovinskiy, N. Moore. P.
Pieranski and associates for SONO
animation. Minnesota Supercomputer Institute,
DTC. DDF support, UMN Graduate School. Knot
Mathematicians. Biologists, Chemists and other
researchers for making protein structures
available.