Title: Open GeoArchives
1Open Geo-Archives
- Integrating earth science data centers into
research portals
Robert Huber Uwe Schindler
2PANGAEA WDC MARE
Database PANGAEA is a public data library for
science aimed at archiving, publishing and
distributing georeferenced data with special
emphasis on environmental, marine and geological
basic research Organisation The World Data
Center for Marine Environmental Sciences
(WDC-MARE ) uses PANGAEA as its data archive and
distribution system. Operating
Institutions The WDC-MARE is operated by Centre
for Marine Environmental Sciences (MARUM)
Research Center Ocean Margins RCOM Foundation
Alfred Wegener Institute for Polar and Marine
Research (AWI) Staff 3 permanent, 7-9
temporary
3PANGAEA Content Statistics (9/2006)
Interdisciplinary data / many contributing
communities
Total number of data sets 450.000
Data items 1.8 billions
4PANGAEA portal networks
5Why community portals
- Researchers need to share data
- Data is difficult to find / scattered (databases,
literature) - Technology / standards are available
- Provide community tailored data
- Funding EC demands networks (NoE)
- Many more good reasons..
6PANGAEA standard interfaces for metadata
data management longterm archiving
Frontends / portals
catalogues
protocols
catalogues
WS(SOAP/WSDL)
PangaVistaGE UNM
marshaller
Index
PANGAEA
GeoPortal.Bund
gml, kml
WFS(OGC)
XSLT
IODP
OGCcatalogue service
RDB
D-GRID
ISO19xxx
harvester
ISO19xxx
CARBOOCEAN
harvester
EUR-OCEANS
Dublin Core
DIF
GCMD
OAI-PMH
ScientificCommons
DublinCore
DIF
harvester
Google
HGF Fedora
Darwin Core
DiGIR
Darwin Core
OBIS
harvester
GBIF
DOI registration
WS(SOAP/WSDL)
STD-DOI
ISO690
TIB NationalLibrary
DOI registry
www.pangaea.de
7Community portals-Examples
- EUR-OCEANS (pelagic ecosystems)
- CARBOOCEAN (CO2)
- Task Build a networked database
- Initial Partners CDIAC, Ifremer, PANGAEA
- Grow! (COPEPOD, NODC, OceanPortal, BODC..)
8Community portals-Technology
- OAI-PMH
- DIF / ISO
- Lucene
- SOAP
- PHP
A
Lucene Index
PHP
Java
DIF XML
DIF
B
OAI-PMH
Portal
SOAP
DIF XML
DIF XML
C
9Community portals-demo
10Community portals-Lessons learned
- We still learn..
- Definition of dataset, granularity of data
- Technology less difficult than metadata
- Choose simple metadata formats
- Standards ? -gt Interpretation ! -gtCommunicate !
- Avoid user frustration, link to real data
- OAI good choice!
- Ontologies ? Yes but