Title: Earth System Modeling Infrastructure
1Earth System Modeling Infrastructure
Cecelia DeLuca/ESMF-NCAR March 31-April 1,
2009 CHyMP Meeting
2Outline
- Elements of interoperability platforms
- Integrating across elements
- Summary
3Elements of interoperability platforms
- Tight coupling tools and interfaces
- - hierarchical and peer component relationships
- - frequent, high volume transfers on high
performance computers - Loose coupling tools and interfaces- generally
peer-peer component relationships- lower volume
and infrequent transfers on desktop and
distributed systems - Science gateways- browse, search, and
distribution of model components, models, and
datasets- visualization and analysis services-
workspaces and management tools for
collaboration - Metadata conventions and ontologies- ideally,
with automated production of metadata from models - Governance- coordinated and controlled
evolution of systems
4Tight coupling tools and interfaces
- Examples
- Earth System Modeling Framework (ESMF) - NASA,
NOAA, Department of Defense, community weather
and climate models, U.S. operational numerical
weather prediction centers (HPC focus) - http//www.esmf.ucar.edu
- Flexible Modeling System (FMS) NOAA precursor
to ESMF, still used at the Geophysical Fluid
Dynamics Laboratory for climate modeling - http//www.gfdl.noaa.gov/fms/
- Space Weather Modeling Framework (SWMF)
NASA-funded, used at the University of Michigan
for space weather prediction
5How coupling tools work
- Users wrap their native data in framework data
structures - Users adopt standard calling interfaces for a set
of methods that enable data exchange between
components - Development toolkits help users with routine
functions (regridding, time management, etc.)
6ESMF Standard interfaces
- Three ESMF component methods Initialize, Run,
and Finalize (I/R/F) - Each can have multiple phases
- Users register their native I/R/F methods with an
ESMF Component - Small set of arguments
call ESMF_GridCompRun (myComp, importState,
exportState,clock, phase, blockingFlag, rc)
7ESMF Distributed data representation
- 1. Representation in index space (Arrays)
- Simple, flexible multi-dimensionalarray
structure - Regridding via sparse matrix multiply
withuser-supplied interpolation weights - Scalable to 10K processors - no
globalinformation held locally - 2. Representation in physical space (Fields)
- Built on Arrays some form of Grid
- Grids are logically rectangular, unstructured
mesh, or observational data streams - Regridding via parallel on-line interpolation
weight generation, bilinear or higher order
options - Intrinsically holds significant amounts of
metadata - - dynamic, usable for multiple purposes, limited
annotation required
Supported Array distributions
8ESMF Coupling options
- Generally single executable for simpler
deployment - Push mode of data communication is very efficient
- Coupling communications can be set up and called
in a coupler, or called directly from within
components (for I/O, data assimilation)
- Hierarchical components for organization into
sub-processes - Recursive components for nesting higher
resolution regions - Coupling across C/Cand Fortran
- Ensemble management
ESMF-based hierarchical structure of GEOS-5
atmospheric GCM
9ESMF Performance portability
- ESMF is highly performance portable, low (lt5)
overhead - 3000 regression tests run on 30
platform/compiler combinations nightlySee
http//www.esmf.ucar.edu/download/platforms - Newer ports include native Windows, Solaris
- Using TeraGrid Build and Test Service to simplify
regression testing
Performance at the petascale Scaling of the
ESMF sparse matrix multiply, used in regridding
transformations, out to 16K processors. (ESMF
v3.1.0rp2) Plot from Peggy Li, NASA/JPL Tested
on ORNL XT4, -N1 means 1 core per node.
ASMM Run-Time Comparison
msec
10ESMF Higher order interpolation techniques in
CCSM
- ESMF higher order interpolation weights were used
to map from a 2-degree Community Atmospheric
Model (CAM) grid to a POP ocean grid (384x320,
irregularly spaced) - 33 reduction in noise globally in quantity
critical for ocean circulation compared to
previous bilinear interpolation approach - ESMF weights are now the CCSM default
Interpolation noise in the derivative of the
zonal wind stress
Interp. noise
grid index in latitudinal direction
Black bilinear Red higher-orderESMF
v3.1.1 Green higher order ESMF v4.0.0
11HAF
ESMF Model map
SWMF
Legend Ovals show ESMF components and models that
are at the working prototype level orbeyond.
GAIM
CCSM4
Dead atm
Data atm
POP Ocean
NOAA Department of Defense University NASA Departm
ent of Energy National Science Foundation ESMF
coupling complete ESMF coupling in
progress Component (thin lines) Model (thick
lines)
Dead ocean
Data ocean
Stub ocean
CICE ice
Dead ice
Data ice
Stub ice
FIM
CLM
Dead land
Data land
Stub land
Ice sheet
Strat Chem
GEOS-5
Param Chem
GEOS-5 Atm Dynamics
GOCART
FV Cub Sph Dycore
GEOS-5 GWD
GEOS-5 FV Dycore
Tracer Advection
Land Info System
GSI
GEOS-5 Atm Physics
GEOS-5 Hiistory
GEOS-5 Atm Chem
GEOS-5 Aeros Chem
GEOS-5 Surface
GEOS-5 Topology
GEOS-5 Moist Proc
GEOS-5 Lake
GEOS-5 Turbulence
GEOS-5 Land Ice
WRF
POP
ROMS
MOM4
UCLA AGCM
SWAN
CICE
COAMPS
MITgcm
MITgcm Atm
MITgcm Ocean
pWASH123
ADCIRC
NCOM
HYCOM
12Loose coupling tools and interfaces
- Examples
- OpenMI
- http//www.openmi.org
- Web service approaches
- Coupling options
- Generally multiple executable
- Pull mode of data communication simple but not
efficient(ask for a data point based on
coordinates) - Generally peer-peer component relationships
- Coupling across multiple computer languages
(Python, Java, C, etc.)
13Science gateways access centers
- Examples
- Earth System Grid (ESG) DOE, NCAR, NOAA
support, used to distribute Intergovernmental
Panel on Climate Change data and for climate
research - http//www.earthsystemgrid.org
- Hydrologic Information System (HIS) - NSF
funded, used to enhance access to data for
hydrologic analysishttp//his.cuahsi.org - Object Modeling System (OMS) - USDA effort, used
for agricultural modeling and analysishttp//java
forge.com/project/1781
14Metadata conventions and ontologies
- Examples
- Climate and Forecast (CF) conventions - spatial
and temporal properties of fields used in weather
and climate - http//cf-pcmdi.llnl.gov
- METAFOR Common Information Model (CIM) large
EU-funded project, climate model component
structure and properties (including technical and
scientific properties) http//metaforclimate.eu - WaterML Schema for hydrologic data developed by
the Consortium of Universities for the
Advancement of Hydrologic Science
(CUAHSI)http//his.cuahsi.org/wofws.html -
-
15Governance
- Pervasive issue in community modeling
- Divergent effects of
- Multiple institutions
- Geographic dispersion
- Multiple domains of interest (working groups)
- Must be balanced by strong integration body -
strategies - Meets frequently enough to affect routine
development (quarterly) - Meets virtually to get sufficient representation
- Includes user and other stakeholder
representatives - Authorized to prioritize and set development
schedule - Supported by web-based management tools
16Integrating across interoperability elements
- Examples from the Curator project (NSF and NASA)
- Automated output of CF and CIM XML schema from
ESMF (tight coupling ontology) - Ingest of ESMF-generated schema into ESG,
propagation into tools for search, browse,
inter-comparison and distribution of model
components and models(tight coupling ontology
science gateway) - Implementation of dataset trackback in ESG that
connects datasets with detailed information about
the models used to create the data - (tight coupling ontology science gateway)
- Implementation of personal and group workspaces
in ESG - (science gateway governance)
17Integrating across interoperability
elements(cont.)
- Translation of ESMF interfaces into web services
to enable invocation of ESMF applications from a
science gateway, and enable data and metadata
from the run to be stored back to the
gateway(tight coupling loose coupling
science gateway ontology, new TeraGrid funding)
Web service interface
Loosely coupled components
ESMF interface
Tightly coupled HPC components
Issue of switch from push to pull data
interactions
18Screenshot Component trackback
19Screenshot Faceted search
20Summary
- Cross-domain interoperability platforms have
multiple elements - Many of these elements already exist
- Integration activities (such as Earth System
Curator) are the next focus
Image courtesy of Rocky Dunlap, Georgia Institute
of technology