Title: Application of Reduced Order Modeling to Time Parallelization
1Application of Reduced Order Modeling to Time
Parallelization
- Ashok Srinivasan, Yanan Yu, and Namas Chandra
- Florida State University
- http//www.cs.fsu.edu/asriniva
Aim Simulate for long time scales Solution
features Use data from prior simulations and
experiments to parallelize the time domain
Acknowledgements NSF, ORNL, NCSA
2Outline
- Limitations of Conventional Parallelization
- Example Application Carbon Nanotube Tensile Test
- A Drawback of Molecular Dynamics Simulations
- Small Time Step Size
- Data-Driven Time Parallelization
- Reduced order modeling is used for prediction
- Experimental Results
- Scaled efficiently to 400 processors, for a
problem where conventional parallelization scales
to just 2-3 processors - Conclusions
3Limitations of Conventional Parallelization
- Conventional parallelization decomposes the state
space across processors - It is effective for large state space
- It is not effective when computational effort
arises from a large number of time steps - or when granularity becomes very fine due to a
large number of processors
4Example Application Carbon Nanotube Tensile Test
- Pull the CNT at a constant velocity
- Determine stress-strain response and yield strain
(when CNT starts breaking) using MD - Strain rate dependent, in reality
- MD uses unrealistically large strain-rates
5A Drawback of Molecular Dynamics Simulations
- Molecular dynamics
- In each time step, forces of atoms on each other
modeled using some potential - After force is computed, update positions
- Repeat for desired number of time steps
- Time steps size 10 15 seconds, due to physical
and numerical considerations - Desired time range is much larger
- A million time steps are required to reach 10-9 s
- Around a day of computing for a 3000-atom CNT
6Data-Driven Time Parallelization
- Each processor simulates a different time
interval -
- Initial state is obtained by prediction, except
for processor 0 - Verify if prediction for end state is close to
that computed by MD - Prediction is based on dynamically determining a
relationship between the current simulation and
those in a database of prior results
If time interval is sufficiently large, then
communication overhead is small
7Dimensionality Reduction
- Movement of atoms in a 1000-atom CNT is the
motion of a point in 3000-dimensional space - Find a lower dimensional subspace close to which
the points lie - We use principal orthogonal decomposition
- Find a low dimensional affine subspace
- Motion may be complex in this subspace
- Use results for different strain rates
- Velocity 10m/s, 5m/s, and 1 m/s
- At five different time points
- U, S, V svd(Shifted Data)
- Shifted Data USVT
- States of CNT expressed as
- m c1 u1 c2 u2
??
??
m
8Basis Vectors from POD
- CNT of 100 A with 1000 atoms at 300 K
u1 (blue) and u2 (red) for z u1 (green) for x is
not significant
Blue z Green, Red x, y
9Relate strain rate and time
- Coefficients of u1
- Blue 1m/s
- Red 5 m/s
- Green 10m/s
- Dotted line same strain
- Suggests that behavior is similar at similar
strains - In general, clustering similar coefficients can
give parameter-time relationships
10Prediction
- Direct Predictor
- Independently predict change in each coordinate
- Use precomputed results for 40 different time
points each for three different velocities - To predict for (t v) not in the database
- Determine coefficients for nearby v at nearby
strains - Fit a linear surface and interpolate/extrapolate
to get coefficients c1 and c2 for (t v) - Get state as m c1 u1 c2 u2
Green 10 m/s, Red 5 m/s, Blue 1 m/s, Magenta
0.1 m/s, Black 0.1m/s through direct prediction
- Dynamic Prediction
- Correct the above coefficients, by determining
the error between the previously predicted and
computed states
11Stress-strain response at 0.1 m/s
- Blue Exact result
- Green Direct prediction with interpolation /
extrapolation - Points close to yield involve extrapolation in
velocity and strain - Red Time parallel results
12Speedup
- Red line Ideal speedup
- Blue v 0.1m/s
- Green An earlier basis
- v 1m/s, using v 10m/s
13Problems with multiple time-scales
- A common difficulty in problems with multiple
time scales - Finer scale models (such as MD) are more
accurate, but more time consuming - Much of the details at the finer scale are
unimportant, but some are - Larger scale models are faster, but may miss
important behavior observed at the finer scale - So a finer time-scale model is used, but limits
the temporal scale that can be reached in a
realistic simulations
A simple schematic of multiple time scales
14Solution
- Use results of related finer scale simulations to
model the significant effects on the larger scale - Example (long time, high temperature/strain
rate) -gt (short time, low temperature/strain
rate) - Technique Reduced order modeling
- Identify important modes of behavior
- Relationships between simulation parameters
- Technique clustering
- Interpolate from existing simulation results, to
predict behavior when possible - Parallelization of time when unexpected modes
might be significant - Technique Learning
15Conclusions
- Time parallelization shows significant
improvement in speed, without sacrificing
accuracy significantly - This suggests that time can be considered,
effectively, as a parallelizable domain - Direct prediction can yield several orders of
magnitude improvement in performance when
applicable
16Future Work
- More complex problems
- Better prediction
- POD is good for representing data, but not
necessarily for identifying patterns - Use better dimensionality reduction / reduced
order modeling techniques - Better learning in the predictor will also be
useful - Simulations with multiple parameters
- Example Predict based on simulations that differ
in temperature and strain rate - Such simulations may differ significantly from
those in the database