Title: Publishing Scientific Data
1Publishing Scientific Data the Role of the
Digital Object Identifier
- Michael Diepenbroek
- PANGAEA / WDC-MARE
2DOI - Operational System
- International DOI Foundation (IDF) launched 1998.
- Currently used by c. 4,000 naming authorities
- assigners e.g. 3,000 publishers, data
repositories, EU - Documents, science data sets, etc.
- 43 million DOI names assigned to date
- 60 million DOI resolutions per month
- Well established in professional information
sector - best known applications are CrossRef
(www.crossref.org ) and DataCite
(www.datacite.org ) - Draft International Standard (ISO TC46 )
3DOI Operational System
- Use of identifier syntax and network resolution
mechanism (Handle System) - Persistence ensured through combination of
- improved handle infrastructure (registry
database, proxy support) - social infrastructure (obligations by
Registration Agencies - RA) - Use of a semantically interoperable data model
and grouping mechanisms. - multiple resolution, data typing, Application
Profiles
4DOI - Organisation
Operating Federation
International DOI Foundation
members
5DOI Business Model
- IDF receives membership fees from RAs, contracts
technical operator - RAs also pay operational fees to IDFs technical
operator for registering and maintaining DOI
names (sliding scale per volume) - Assigners are customers of RAs
- RAs might have their own existing numbering
scheme - RAs are autonomous independent bodies. They offer
services to assigners using DOI names - RAs business model with their customers is
entirely autonomous - RAs only obligation to IDF is a licence/operating
agreement
6DOI system added value
- DOI is a brand
- DOI resolving infrastructure
- Offer you the opportunity to build added-value
services - Strong linkage with academic publishing!
- DataCite as DOI registry for scientific data
- Organised as an international association of
libraries - Developed by ICSU World Data Centers and Services
(German cluster) the Technical Information
Library in Hannover (TIB) - Will be adopted by ICSU World Data System
(associated member of DataCite)
7ICSU World Data Centers (WDC)Geophysical Year
1957
- Meteorology
- Asheville NC, USA
- Beijing, China
- Obninsk, Russia
- Marine Geology and Geophysics
- Boulder CO, USA
- Moscow, Russia
- Nuclear Radiation
- Tokyo, Japan
WDC Co-ordination Offices Washington DC,
USA Beijing, China
8Initial position of WDS
- Contra
- Insufficient funding (of course)
- Organisation and quality of data services are not
consistent - IT development is fast no time for legacies
- Fragmentation of efforts
- Pro
- Long standing experience know how motivation
- Good context with science
- Open access for all data resources
- As a whole a very large global data management
capacity - Trans-disciplinary !
9ICSU WDS - Roles relations in a federated
system
Scientific Communities Other Stakeholders
Related Networks Programs GEOSS, GMES, WMO-IS,
IOC etc
Metadata Data Services web portals, catalogues
Visualisation Analysis compute systems,
virtual labs, GIS systems
Publishers commercial, open access (e.g. ESSD
journal),crossreferencing
Data Archiving Publication Facilities
certified repositories
Libraries Service Providers DOI
registryinterdiscipl. catalogues
Education Outreach
Data Collection Processing FacilitiesQA/QC,
data products, also data rescue
Research Institutionsuniversities, research
institutes
Research Projects / Programsnational, EU,
international
Research Facilitiessattelites, vessels,
observatories, alert systems etc.
10WDS implementation
11Why do we need publishing systems for scientific
data?
- Good data availability fosters large scale
complex science approaches. - Data recycling is more effective than
re-production. - General data availability is low compared to data
production. - Available data are often not usable because the
quality cannot be estimated. - Prerequisite for the verification of scientific
results. - Benefit to data producers (publications
science currency)
12Data publishing - prerequisites and current
status
- DOI assigners (agents) need to be certified
- Certification agency CA
- following the OECD principles and guidelines for
access to research data (2007) - peer-review procedures
- citability
- persistent identifiers (DOI)
- ICSU SCOR Codata working groups on data
citation - Science citation index -gt Thomson Reuters Web of
Knowledge - Metadata/Data standards protocols
13Data publishing - metadata
14Data publishing - prerequisites and current
status
- Collaborations with science publishers (Elsevier,
Springer, Wiley, Oxford, AGU etc.) - data journals (ESSD)
- cross-referencing supplementary data with
traditional publications - published data as embedded content for
traditional publications - combined peer-review between data archive and
journal
15Data publishing - peer-review
16Data publishing cross-referencing
17Thank you !