Title: Community Earth Science Informatics Initiatives
1Community Earth Science Informatics Initiatives
Their Impacts
Lee Allison, Arizona Geological
Survey Association of American State Geologists
2200 million websites if you dont have a
website, you dont exist.
Prediction
In 5-10 years, if your data are not online in an
integrated, interoperable network, you wont
exist.
31000s of National and Regional Databases
- topographic, orthoimagery, hydrography
- mineral resources
- water
- geochemistry
- geophysics (aeromag, gravity, aerorad)
- earthquake catalogs
- biological surveys
- vegetation/speciation maps
Tower of Babel
4Conclusions Growing Consensus for an NGS
- Goals interoperable, distributed, Web-service
based, synoptic 4-D system - Challenges
- Technical adapting-adopting existing
capabilities - Cultural organizational controls, recognition
- How do we get there?
- Agreement on standards, protocols, architecture
- Geological Surveys as data archives, providers
- Parallel community efforts are linking
- Implementation is underway
- Sustainability is an issue
5Current electronic delivery
6The Goal
7Most of the technology exists Challenges are
cultural and organizational
8With apologies to JRR Tolkien
One system to rule them all, one system to find
them, one system to bring them all, and in the
darkness bind them.
9How do we get there?
NSF to the Solid Earth Sciences how do you
build a sustainable community system? - 2-year
community engagement process underway
10Earth science cyberinfrastructure
Early paradigm Central databases for each topic
Distributed Web-based Interoperable
11Goal is making data interoperable
Ian Jackson, BGS
12interoperability
- "The capability to communicate, execute programs,
or transfer data among various functional units
in a manner that requires the user to have little
or no knowledge of the unique characteristics of
those units." - ISO/IEC 2382-01 (SC36 Secretariat, 2003)
13Example the electrical utility
- Simple interface put plug in wall, get
electricity
Afghanistan 220 V 50 Hz Andorra 230 V 50
Hz Anguilla 110 V 60 Hz Antigua 230 V 60 Hz
Cayman Islands 120 V 60 Hz Cyprus 240 V 50 Hz
Czech Republic 230 V 50 Hz
14Complexity
15Other complex things
16National Geoinformatics System
- Killer applications
- User cases best practices in meeting
stakeholder needs - Data discovery, catalogs, inventories, metadata
profiles, metadata aggregation service(s) 4D
search engines, Informatics specifications, data
model, interoperability, standards - Web portal Registry development and
implementation - Accessing licensing protocols, recognition
credit - Community of practice
- Communication, dissemination, awareness
- Ontologies, vocabularies
- Access to high-resolution spatial geological
applied datasets - Big Iron high performance computing
- Digitization of legacy data
- Liaison and integration with related groups
initiatives - Sustainability
17Computer printer services
- Advantages
- one driver (wrapper) per application
- Application need know nothing about
printerseparation of concerns
18Interoperability via web service
Use of standard markup language means schema
mapping only needs to be done once
Wrapper implements interface to service
formulate requests, interpret results
Participants implement one interface for each
service
Applications focus on application logic, not data
access.
19GeoSciML developers
Mark-up language wrapper translates your data
Cocoon Uppalla, Sweden
GeoServerKeyworth, UK
CocoonOttawa, Canada
Ionic Orleans, France
Mapserver Arizona
CocoonVirginia, USA
Tsukuba, Japan
GeoServer Canberra
GeoServer Melbourne, Australia
20Using a web service step 1
GeoSciML Web Services Request
21Web service request step 2
GeoSciML Web Services Request
22Web service response part 1
GeoSciML Web Services Response
23Web service response - part 2
GeoSciML Web Services Response
24ORGANIZATION Unique missions of geological
surveys - collect, archive, disseminate data
Geoscience Information Network (GIN)
Distributed Web-based Interoperable
2,000 3,000 databases 1000s of collections
80,000 geologic maps
25We agree on a data network that
- is distributed (vs centralized)
- is interoperable
- uses open source standards and common protocols
(OGC, GeoSciML) - respects and acknowledges data ownership
- fosters communities of practice to grow
- facilitates development of new web services and
clients
26System overview
GIN
27Geologic map service scenario
Catalog NGMDB? OneGeology? NDC? GEON? NGDS?
Registration
Survey map servers
OGC CSW
ArcMap
OGC WMS
ArcGIS
28- National Geologic Geophysical Data Preservation
Program - 1M per year
- National inventory
- Metadata catalogue
- National Digital Catalogue
29Data discovery -
- 79,000 maps, images, data, and products from
350 publishers - Lexicon of Geologic Names of the United States
30Defining GIN
- collections of service definitions, interchange
formats, and vocabularies - independent of hardware, operating system, or
lower-level network protocols - new technology will only require implementation
of network elements in a new environment - architecture allows for the use of multiple
conventions for different user groups
31WWW GIN
- http hypertext transfer protocol ( ftp, etc)
- html hypertext mark-up language
- url universal resource locator
- browser built by others
- Open source standards Open Geospatial
Consortium - data interchange tool GeoSciML
- distributed data catalogues (National Geologic
Map DB National Data Catalogue, etc) - Web services applications built by others
32Challenges to building community
Who sets the standards? Who controls the
system? Who makes the decisions?
33The network is voluntary, not imposed from above
Interoperability
34Don't panic!
- We wont take your data away they stay with you
- Your participation is voluntary
- Keep your formats, system, servers
35Will 3,000 interoperable data bases become an
800-lb gorilla?
36GIN is partnering with the global Earth science
community
AASG USGS National Geoinformatics System
OneGeology-Europe 21 nations Marine Metadata
Interoperability Initiative US DOE National
Geothermal Data System (NGDS) US DOE Geothermal
Technologies Program Energy Industry Metadata
Standards Working Group - Energistics
37PARTNERS COLLABORATORS
38MS SciScope geospatial data discovery
Welcome to SciScope!SciScope is a tool by
Microsoft Research to help geoscientists discover
data from numerous data repositories with ease
through a single, intuitive interface. Users
can display multiple map layers related to the
scope of their study and interact with
geographical features on the map including dams,
rivers, water bodies, geology, aquifer systems,
ecological regions and river basins.
39GIN DEMO PROGRAM
- NSF INTEROP GIN
- 3 year development of standards, services
- Demos in 6 SGSs 80K subcontracts
40Circuit Riders
- Part trainer, part management consultant, part
computer expert - Write GeoSciML wrappers
- Guide server configurations
- Training, short courses
- 80K for demos across AASG
41(No Transcript)
42ADOPTION DEPLOYMENT
- US Dept. of Energy (May, 2009)
- National Geothermal Data System (NGDS)
- GIN architecture, standards
- 5M, 5 years
- Adopted by US Geothermal Technologies Program
43National Geothermal Data System
Distributed data sources
NGDS
Legacy data repository
Desktop applications (GeoSciNet)
Ontologies, vocabularies
Discovery, access, exchange (GIN)
Portals (GeoSciNet, SciScope)
44National Geothermal Data System
- Data discovery, access, exchange GIN
- Distributed content geothermal community
- Legacy data repository NGDS
- Desktop applications (economic modeling tool,
etc) GeoSciNet - Portals GeoSciNet, SciScope
45NATIONAL DEPLOYMENT
- US DOE Geothermal Data Development, Collection,
and Maintenance - 20M, 1-5 awards
- AASG proposal submitted
46106 nations
47(No Transcript)
48(No Transcript)
4929 countries and European organizations are
committed to create a geological map at
11.000.000 scale, integrated with metadata
initially available in the following languages
English, French, Italian, Spanish, Swedish, Czech
and Norwegian.
50Network sustainability
- tipping point at which users and providers will
see the network as critical to their basic
functions - populating and using the network becomes a
necessary cost of doing business - how do we maintain network functions?
51How do we get there?
NSF to the Solid Earth Sciences how do you
build a community system? - 2-year community
engagement process underway
Geological Surveys as drivers? - USGS, 51
state surveys, 21 European surveys, 106 nations
Linkage with other communities and natural
science domains - MMI, OOS, CUAHSI-HIS,
Geoscience Australia, iPlant, GBIF, ESIP,
Energistics,..
52TIPPING POINT
- Energy Industry Metadata Standards Working Group
- End-to-end discovery, access, and exchange of
upstream petroleum data
53- 97 members
- American Geological Institute (AGI)
- Baker Hughes
- BP
- British Geological Survey (BGS)
- Chevron
- ConocoPhillips
- Department of Interior (U.S. DOI-BLM-MMS)
- Directorate General of Hydrocarbons (India) (DGH)
- ExxonMobil
- Ground Water Protection Council
- Halliburton
- IBM Corporation
- IFP - Institut Francais du Petrole
- Norwegian Petroleum Directorate (NPD)
- Open Geospatial Consortium (OGC)
- Pioneer Natural Resources
- SAIC-Science Applications Intl. Corp.
54Geoscience Information Network
http//usgin.org