Title: Parallel Processing Techniques for the Nonlinear Dynamic Analysis of Structures
1Parallel Processing Techniques for the Nonlinear
Dynamic Analysis of Structures
Elisa D. Sotelino
School of Civil Engineering Purdue University
2Statement of the Problem
u(t)
fext(t)
3Newmark Method (1959)
Finite difference formulas
Unconditional Stability
4Newmark Family of Methods
5General Solution Scheme
- Estimate displacements, velocities and
accelerations using Newmarks finite difference
approximations
- Compute the displacement increment as
6General Solution Scheme (Cont.)
- Update the displacements, velocities and
accelerations using the obtained corrections as
- Increment time nn1, tth and go to the second
step if not converged
7Central Difference Method
Time-step size for stability
8Parallel Solution Algorithms
- Substructuring-Based Methods
- Domain-Splitting Methods
- Parallel Central Difference Method
- FETI Algorithm
- GI Algorithm
- MIGI Algorithm
9Domain Partitioning
- Automatic Domain Partitioning
- Usually based on Graph Theory
- Important especially for large 3-D meshes
- Manual Domain Partitioning
- Fine tuning partitioning
10Substructuring-Based Methods
- Divide the structure into substructures
- Condense out interior DOF (in parallel)
- Solve the reduced system for the interface DOF
(in parallel) - Recover interior DOF solution from the interface
DOF
However communication is necessary since the
reduced stiffness matrix and load vector have
overlapping coefficients
11Domain-Splitting Methods
Parallel Central Difference Scheme (Hajjar
Abel 1989)
12Finite Element Tearing and Interconnecting (FETI)
Algorithm (Farhat Roux 1991)
is the subdomain connectivity matrix ? are
the Lagrange multipliers s 1, 2, , Ns
13Performance of the FETI Algorithm
3-D Example
14Group Implicit (GI) Algorithm (Sotelino et al.
1990)
Partition the mesh into
subdomains
1.
For each
subdomain
Solve using an implicit scheme and
accept solution for the interior
DOFs
15GI Algorithm (Cont.)
2.
Enforce compatibility of interface DOFS
3.
Accept solution
assemble displacement vector
16GI Algorithm - Characteristics
- Same stability characteristics as underlying
implicit procedure
Consistent
mass averaging rule
For accuracy time
-
step size is limited by
a Courant type condition
17GI Algorithm - Characteristics
Strengths
Highly modular
l
ow communication overhead
Weakness
Deterioration of accuracy (unreliable for
practical h)
Reason for inaccuracies
Mass averaging rule introduces unknown amount
of interface reactive forces
equilibrium is not satisfied
18Modified Iterative Group Implicit (MIGI)
Algorithm (Modak Sotelino 1997, Dere
Sotelino 2002)
Partition the mesh into subdomains
- Solve each subdomain estimate interface
reactive forces.
19MIGI Algorithm (Cont.)
2. Iterate until convergence
2.1 Solve each subdomain for the computed
corrective forces
2.2 Enforce compatibility
20MIGI Algorithm (Cont.)
2.3 Re-evaluate interface reactive forces from
residual interface displacements
2.4 Assemble interface reactive forces
21MIGI Algorithm (Cont.)
2.5 IF THEN convergence is achieved ELSE
distribute interface residual forces among
DOFs
2.6 Update interface force vector
22Nonlinear MIGI Algorithm
- Two possible schemes
- Combined iterations nonlinear and
- MIGI iterations are combined
- Separate iterations nonlinear and MIGI
- iterations are isolated
23Iterative Integration Schemes
Separate Scheme
Sequential Solution
Load
Rn1
Rn
Combined Scheme
Displac.
24Combined Scheme
Separate Scheme
25Performance Comparison
Mass density 2.26e-2 lb-sec2/in Cross-section
W12x14 standard section
26Accuracy Verification
27Nonlinear Iterations
28Total Analysis Time
29Total time comparison
30Average times per main iteration
31Total time comparison
32Average times per main iteration
Note Total time (sequential) 188 sec
33Demonstration
Nonlinear Transient Analysis of a 20-story SAC
Building
- 266 Nodes, 391 Elements
- Fiber beam-column elements
- SAC building model data Gupta,
- A. and Krawinkler, H. (1999)
- Northridge Newhall Earthquake
- Lumped mass, Rayleigh damping
- Material and Geometric
- nonlinearity
- IBM SP multicomputer
34Domain partitioning (MPE)
35Top Story Displacement
36Speed-up
37Asymptotic iteration performance
Time step size Dt 0.01 sec
Number of Processors
Total analysis time (sec)
CPU time/Dt (sec)
Number of nonlinear iterations
1 2 3 4 5 6
15,031 8,659 5,762 4,414 3,956 3,458
7.50 4.32 2.87 2.20 1.96 1.72
4,323 4,321 4,321 4,312 4,315 4,307
38Demonstration 3D
39Domain Partitioning (MPE)
40Top Story Displacement
41Speed-up
42Future Research Directions
- Improved Automatic Domain Decomposition
Algorithms - Dynamic Load Balancing
- New algorithms for Grid Computing (communication
is major handicap)