Title: DNA: Structure, Dynamics and Recognition
1DNA Structure, Dynamics and Recognition
L3 DNA dynamics
Les Houches 2004
2STRUCTURAL DATABASES
3RSCB-PDB
http//www.rscb.org
H.M. Berman, J. Westbrook, Z. Feng, G.
Gilliland, T.N. Bhat, H. Weissig, I.N.
Shindyalov, P.E. Bourne Nucleic Acids Research,
28 pp. 235-242 (2000)
4RSCB
5Full search
6Custom forms
7An example - chymotrypsin
8Display
9PDB file header
10NDB -http//ndbserver.rutgers.edu/
11NDB - atlas
12NDB - atlas
13CCDC - 1965
http//www.ccdc.cam.ac.uk/
14CSD small molecules
15CCDC products
16TRANSFAC - http//www.gene-regulation.com/
17HELICAL PARAMETERS
18Helical symmetry
- move between residues by translation along and
- rotation around the helical axis
19Finding a helical axis
(2) Find plane defined by vectors placed at origin
(1) Build vectors between helically
equivalent atoms
(3) Draw perpendiculars to chords between heads
of vectors
20Curved helical axis ?
21Cambridge convention
3
5
5
3
Dickerson et al. J. Mol. Biol. 205, 1989, 787
22Helical parameters
Translation
Rotation
23Helicoidal parameters
Local
Global
24Extreme global solutions
Keep linear helical axis
Keep monomer orientations
25CURVES
R. Lavery and H. Sklenar J. Biomol. Struct. Dyn.
6, 1989, 655
26Base reference system
27Axis reference system
28Curves function
A) Bases (X, E) in the same place with respect
to the local axis (U, P) ? Â A1) Rotation S
(Ui.Xi - Ui-1.Xi-1)2 - where X J, K, L Â A2)
Translation S (Pi - Ei).Xi - (Pi-1 -
Ei-1).Xi-1)2 Â B) Axis straight and continuous
? Â B1) Rotation S (Ui - Ui-1)2 Â B2)
Translation S (Qi - Qi-1)2 - where Qi (Pi -
Pi-1) - ltUgt.ltUgt.(Pi - Pi-1) Â Final formula
F (X,Y,I,T) 10 (A1 B1) (A2 B2)
29Curves helical axis
30Helical parametersfor B- and A-DNA
Parameter B-DNA A-DNA Xdisp 0.0 -5.28 Y
disp 0.0 0.0 Inclination 1.5 20.7Â Propeller -13.3
-7.5 G-Slide 0.0 0.0 G-Rise 3.38 2.56 G-Roll 0.0
0.0 G-Twist 36.0 32.7 Â L-Slide 0.08 -1.92 L-Rise
3.38 3.44 L-Roll 0.9 11.4 L-Twist 35.6 30.7 Â Pha
se 155 18 Amplitude 40 42
31B-DNA structural variation
Hartmann and Lavery Q. Rev. Biophys. 29, 1996, 309
Value Min. Max. Mean Value Min. Max. Mean
Xdisp -0.6 1.5 0.5 Shift -1.1 1.1 0.0
Inc 11 11 -1 Slide -1.1 1.1 -0.1
Tip -11 11 0 Rise 3.2 4.0 3.4
Buck -11 16 1 Tilt -8 8 0
Prop -24 5 -11 Roll -12 16 2
Open -8 8 1 Twist 22 51 36
a -109 -23 -64 d 90 173 132
b 112 -159 166 e 141 -61 -179
g -16 104 50 z 148 -68 -94
Pha 19 206 150 c -62 -160 -104
Amp 15 58 39
32B-DNA - 2ns dynamic trajectory
33"LONG" MD SIMULATIONS
34Molecular dynamics
Time integration of Newton's equation of
motion F ma -dE/dr m dr2/dt2 Taylor
expansion r(t dt) r(t) dt dr(t)/dt dt2/2
d2r(t)/dt2 Fastest movements O(10-15 s)
r
t
35Periodic boundary conditions
36Equilibration
Temp (K)
Equilibrate
Production
300
NPT ensemble
Reassign or rescale velocities
200
Heat
100
Initially constrain solute
0
Minimize
Time (ns)
1
2
3
37MD snapshots
38MD time series- sugar phase- groove width
39MD time series- base pair Hbonds
40Ascona B-DNA Consortium
F. Lankas, Herovsky Inst. Czech Republic ?
EPFL R. Lavery, IBPC France J. Maddocks,
EPFL Switzerland H. Sklenar, MDC Germany
D. Beveridge, Wesleyan U. D. Case, Scripps
Institute T. Cheatham, U. Utah R. Osman, Mount
Sinai, NY M. Young, Berkeley
41136 unique tetramers
AAAA AAAC AAAG AAAT AAGA AAGC AAGG AAGT AATA AATC
AATG AATT ACGA ACGC ACGG ACGT AGAA AGAC AGAG AGAT
AGCA AGCC AGCG AGCT AGGA AGGC AGGG AGGT AGTA AGTC
AGTG AGTT ATAA ATAC ATAG ATAT ATGA ATGC ATGG ATGT
CAAA CAAC CAAG CAAT CAGA CAGC CAGG CAGT CATA CATG
CCGA CCGG CGAA CGAC CGAG CGAT CGCA CGCG CGGA CGGC
CGGG CGGT CGTA CGTC CGTG CGTT CTAA CTAG CTGA CTGC
CTGG CTGT GAAA GAAC GAAG GAAT GAGA GAGC GAGG GAGT
GATA GATC GATG GCGA GCGC GCGG GGAA GGAC GGAG GGAT
GGCA GGCC GGCC GGGA GGGC GGGG GGGT GGTA GGTC GGTG
GGTT GTAA GTAC GTAG GTGA GTGC GTGG GTGT TAAA TAAC
TAAG TAAT TAGA TAGC TAGG TAGT TATA TCGA TGAA TGAC
TGAG TGAT TGCA TGGA TGGC TGGG TGGT TGTA TGTC TGTG
TGTT TTAA TTGA TTGC TTGG TTGT
42ABC oligomers - construction
G-D-ABCD-ABCD-ABCD-G
- 15 base pairs
- Central tetranucleotide repeats
- GC terminal base pairs for stability
- No sampling for ilt3 or igt13
- Two copies of each tetranucleotide
4339 oligomer database
GGGGGGGGGGGGG AAAAAAAAAAAAA CGCGCGCGCGCGC TATATATA
TATAT AGAGAGAGAGAGA TGTGTGTGTGTGT AGGGAGGGAGGGA CG
GGCGGGCGGGC TGGGTGGGTGGGT GAAAGAAAGAAAG CAAACAAACA
AAC TAAATAAATAAAT CGGCCGGCCGGCC AGGAAGGAAGGAA TGGT
TGGTTGGTT TAATTAATTAATT CGGACGGACGGAC AGGCAGGCAGGC
A AGGTAGGTAGGTA TGGATGGATGGAT CGGTCGGTCGGTC TGGCTG
GCTGGCT CAAGCAAGCAAGC GAACGAACGAACG TAACTAACTAACT
CAATCAATCAATC TAAGTAAGTAAGT GAATGAATGAATG TGAGTGAG
TGAGT CGAGCGAGCGAGC TGCGTGCGTGCGT TAGATAGATAGAT GA
CAGACAGACAG TACATACATACAT AGCTAGCTAGCTA TGCATGCATG
CAT CGATCGATCGATC TGACTGACTGACT CGTACGTACGTAC
44Simulation protocol
- AMBER program
- PARM94 parameters
- Truncated octahedral box (7600 waters)
- Neutralising K counterions
- Particle Mesh Ewald electrostatics
- 2 fs timestep (SHAKE on X-H)
- Careful equilibration, NVT?NPT
- Save configuration every 1ps
- 15 ns trajectories (for Phase I)
45ABC dataset Phase I ? Finished 5/03
- 150 months of CPU time
- 0.6 ms of simulation
- (2.2x Vilin folding simulation)
- 600,000 coordinate sets
- 272 tetranucleotide steps
- 400 Gb of data
46ACGT trajectory
B-DNA A-DNA Last ns
47Helical parameters
Translation
Rotation
48ACGT helical parameters - instantaneous
49SYMMETRY?
50Each oligomer contains 2 "identical" tetramers
51ACGT helical parameters - histograms
C6pG7 C10pG11
52GCGC helical parameters - histograms
C4pG5 C6pG7 C8pG9 C10pG11
53GCGC helical parameters
54Backbone torsion angles
- d C5 C4 C3 O3
- O5 C5 C4 C3
- b P O5 C5 C4
- a O3 P O5 C5
- z C3 O3 P O5
- e C4 C3 O3 P
55CGCG backbone parameters
G7pC8 G11pC12
C6pG7 C10pG11
g-/g
g/t
56ag transition
a - G11pC12 g - G11pC12
g-
t
g
g
57ag impact on twist
58SEQUENCE DEPENDENT STRUCTURE?
59CpG translational parameters
60CpG rotational parameters
61IONS AROUND DNA
62Diffusion coefficients(10-9 m2sec-1 )
lt(xi(t0dt)-xi(t0))2gt 6D dt
K D 2.85 Exp1.96
lt(xi(t0dt)-xi(t0))2gt
Na D 1.72 Exp1.33
63Volume sampled by ions during 50ns simulation time
K
Na
64Most frequently visited zones
K
Na
65Tight binding
phosphate 2 2 3 2 2 9 5 3 13 2 3
strand 1 C1 C2 A3 T4 G5 C6 G7 C8 T9 G10 A11 C12
groove 1 3 7 12 2 1 5
groove 2 1 2 1 3 8 8,1 6 3
strand 2 G24 G23 T22 A21 C20 G19 C18 G17 A16 C15 T14 G13
phosphate 2 6 3 17 2 4 12 3 8 3 2
Na
phosphate 1 2 2 3 2 5 2 4 3 2 3
strand 1 C1 C2 A3 T4 G5 C6 G7 C8 T9 G10 A11 C12
groove 1 2 7 4 13,3 2 1
groove 2 2 2 1,1 1 3,8 3 11 1,6 1,1 1 2
strand 2 G24 G23 T22 A21 C20 G19 C18 G17 A16 C15 T14 G13
phosphate 2 3 2 3 2 4 3 3 6 1 1
K
66Minor groove width at C8 level
67a/g transitions in Na dynamics
red g green a
68a/g transitions in K dynamics
red g green a