Title: doi
1Digital Object Identifier
Norman Paskin, International DOI Foundation
2Main themes
doigt
Previous ERPANET Cork meeting (June
2004) Principles of persistent
identifiers The development of persistent
identifiers. Keynote presentation made at
ERPANET Persistent Identifiers seminar, Cork.
June 2004 http//www.erpanet.org/events/2004/cork
/presentations/040617PaskinPIConcepts.pdf DOI
system as one solution Overview of Digital
Object Identifier (DOI) System. Presentation
made at ERPANET Persistent Identifiers seminar,
Cork. June 2004 http//www.erpanet.org/events/200
4/cork/presentations/040617PaskinDOIpresentation.p
df
- Now Update on key developments
- Internet infrastructure
- Standards
- Social infrastructure
- Value added
- Example DOI applications
- Responses to issues raised slides modified, more
in full version on web site
3doigt
Internet infrastructure
- The governance of the DNS will not
completely encompass future Internet addressing
and navigation, which is a good thing, not a
shortcoming. The system of domain names, IP
numbers, root servers and protocol identifiers is
not static but a technology capable of evolving
into a better form. As such, the current system
should not be treated as sacrosanct, but amenable
to innovation. The paradox of Internet governance
is that any institutional arrangement will by
nature be a collusion of political power and
financial interests that acts to freeze into
place the current technical design, and make new
and better approaches almost impossible to
emerge-much as the system of national telecom
operators dominated communications for a century
until the Internet emerged as the unlikely force
that upended it. We can already see that future
Internet navigation will not simply be addresses
linked to computers, but to billions of devices,
file-documents, real-time video and audio
streams, objects though RFID tags, and even
constantly changing instantiations of information
- all which will make today's DNS and its
governance seem anachronistic. Allowing for
alternative addressing and navigation across the
network, alongside a sanctioned 'legacy' DNS,
will be a balanced way to achieve diversity,
experimentation and progress, while also ensuring
stability and reliability. - Kenneth Neil Cukier (Technology
Correspondent, The Economist) at the OII Forum on
Internet Governance 6 May 2005
i.e. an architecture which sustains legacy and
supports innovation
4doigt
DOI as a packaged system minimum reinvention
Numbering scheme
Policies
doigt
Data Model
Internet Resolution
5DOI syntax can include any existing identifier
label formal or informal, of any entity
- An identifier container e.g.
- 10.1234/NP5678
- 10.5678/ISBN-0-7645-4889-4
- 10.2224/2004-10-ISO-DOI
- NISO Z39.84 DOI Syntax (2000)
- 5 year review in 2005 completing.
- Revision of case sensitivity, to be amended in
line with use - Approved subject to minor edits
6- Resolve from DOI to data
- initially to location (URL) persistence
- May be to multiple data
- Multiple locations
- Metadata
- Services
- Extensible
- Uses the Handle system
- - IETF RFCs 3650, 3651, 3652
- See Release 1.0, September 2003 "Online
Registries The DNS and Beyond...
doi10.1340/309registries - URI specification pending
- Possible URN spec
- May use http
- 10.123/456 as dx.doi.org/10.123/456
Internet resolution allows a DOI to link to any
multiple pieces of current data
7- DOI Data Model Metadata tools
- a data dictionary to define
- a grouping mechanism to relate
- The semantic web bit
- Necessary for interoperability
- Enabling information that originates in one
context to be used in another in ways that are as
highly automated as possible. - Able to use existing metadata
- Mapped using a standard dictionary
- Can describe any entity at any level of
granularity - indecsDD which incorporates ISO MPEG 21000 part 6
RDD - IDF is the MPEG21 RDD registration authority
ltindecsgt Data Dictionary DOI AP framework
8doigt
DOI and metadata standards
- RDD is part of the overall indecs Data Dictionary
approach - All the RDD terms are in the actual iDD as it now
stands - That is is not the DOI Metadata Standard scheme
you must use. - DOI can use ANY metadata the Dictionary simply
enables interoperability. - Using ONIX is best example so far
- CRM, FRBR are similar philosophy so map very well
- SCORM, DC, METS, etc can also be mapped
- The existence of the dictionary can help develop
a scheme by offering standard terms
91998-2004 Defining what is identified through
metadata
Development of indecs 1998-2004 Black what
Red who
10DOI policies allow any business model for
practical implementations
- Implementation through IDF
- Technical infrastructure resolution mechanism,
proxy servers, mirrors, back-up, central
dictionary, - Social infrastructure persistence commitments,
fall-back procedures, cost-recovery
(self-sustaining), shared use of ip - Not a standard but a Registration
Authority/maintenance agency (cf bar code and
EAN) - DOI System for standardisation in ISO TC46SC9
(ISBN etc) - Registration agencies
- Each can develop own applications
- Use in own brand ways appropriate for their
community
11doigt
Update on DOI system
- 2005 meeting presentations on web site
- http//www.doi.org/doi_presentations/members_meeti
ng_2005/agenda.html - See DOI News
- Key theme social infrastructure issues
-
- 17m DOIs, gt1300 naming authorities
- Modest growth in DOIs of most RAs
- One new sector (data TIB, N4L)
- Self funding model underlying income must
balance costs vulnerable to problems. Growth
rate? - Network effects communities more effective than
small or start-up RAs - Relation to handle implementations
-
12DOI Deposits by RA per year
doigt
10 RAs but one dominates numerically at present
(6 months)
13doigt
Cumulative DOI Prefixes by RA/year
Naming authorities (prefix) development presages
more growth
14doigt
Update on DOI system
- Key theme social infrastructure issues
-
- Completing main development work
- from initial implementation (single URL redirect)
- to full (Multiple resolution, metadata
management) - Continuing growth
- Good progress with standardising syntax, data
dictionary, core specification - IP patent policy
- Restructuring organisation
- Wider advisory board, change of some internal
aspects -
15doigt
Scope
- Digital Object Identifier
- not Identifier of Digital Object
- but Digital Identifier of Object
- Object any entity of interest in an
intellectual property transaction -
- Object precisely defined by metadata and
terminology from data dictionary (indecs
framework et al) to ensure that what you mean is
what I mean (interoperability) - Objects can be physical, digital, or abstract,
e.g. people, organisations, agreements, etc. - What is identified is not necessarily what DOI
resolves to (what is resolved to is about what
is identified)
16doigt
Value of the DOI system
- See DOI factsheet Value added by the DOI
system - Version 1.1 www.doi.org/factsheets
- The value of the DOI system is from three
functions - Persistent identification
- Network actionability
- Semantic interoperability
- Value is added by each of these three
- Each offers value
- Can have one or more for a DOI application
- The combination provides best functionality/advant
age
17doigt
Value added (1)
- Design
- A system for persistent, semantically
interoperable, identification of intellectual
property entities on any digital network. - Media type neutral applies to physical and
digital objects and abstract entities (resources,
parties, licences, etc) - Standards based
- Uses existing components in building an added
value application. - To be an ISO standard
- Syntax
- Standard NISO Z39.84 syntax
- Resolution (network actionability)
- Uses Handle System
- Technology neutral (only requires internet
connectivity)
18doigt
Value added (2)
- Data structure (semantic interoperability)
- Precisely specify exactly what is identified
- Uses indecs framework
- Can be used to embody relationships, e.g. a
parent-child. Handle technology allows this
DOI provides a framework to achieve it. -
- Technical Infrastructure
- IDF funds aspects of technical infrastructure of
the DOI System - Social infrastructure (persistence)
- IDF builds on top of Handle a social structure
guaranteeing persistence. Persistence is a
function of organizations, not of technology a
persistent identifier system requires a
persistent organization and defined processes - Business potential
- Registrants can incorporate persistent,
semantically interoperable, identification of
intellectual property entities as part of
services providing added value.
19Example 1 Crossref now 400 publishers, 16
million DOIs, Google, etc
20Example 2 OECD use of DOIs for tables etc
Adding value for both print and online readers
Printed under each table, chart and graph,
DOI-based links offer instant access to the
matching spreadsheet in Excel TM
21Examples of data citations
- Nozawa, Toru (2004) IPCC-DDC_CCSRNIES_SRES_B2
211 YEARS MONTHLY MEANS, National Institute for
Environmental Studies and Center for Climate
System Research Japan, WDCC. doi10.1594/WDCC/CCSR
NIES_SRES_B2 - Kamm,H Machon, L Donner, SÂ (2004) Gas
Chromatography (KTB Field Lab), GFZ Potsdam.
doi10.1594/GFZ/ICDP/KTB/ktb-geoch-gaschr-p
Example 3 TIB expecting 1.5m DOIs in year
one
22What is the Eleonet project
- Eleonet stands for European Learning Object
Network - Is a project co-funded by the European Commission
within the eTEN programme - It is described in a single phrase as follow
- Eleonet aims at creating a technological system,
based on appropriate DOI application profile(s),
to facilitate the exchange of educational content
within the eLearning value chain at European level
Example 4 Eleonet several DOI agencies
collaborating
23The consortium
- AIE (co-ordinator) and Cineca - the two parent
organisations of mEDRA - Editrain in Spain, a mEDRA partner since 2002
- Nielsen BookData in UK
- L3S / TIB in Germany, the most recently appointed
DOI-RA - INDIRE an Italian public organisation
specialised in metadata for educational content
Example 4 Eleonet several DOI agencies
collaborating
24The expected output
- A DOI registration system for European Learning
Objects (based on mEDRA technology) - including complete metadata schema (based on
existing ones, LOM in particular) - possibly mapped within the iDD system
- A database of metadata on LOs
- A search engine to explore and query the DB
- A resolution system, based on DOI technology, to
reach multiple repositories
Example 4 Eleonet several DOI agencies
collaborating
25Cools URIs dont meet the practical needs of
some users e.g. purchase of book list by a
rival publisher (example from Cork presentation)
26doigt
Some issues from the meeting (1)
- Technology
- http? Yes DOI 10.123/456 as http//dx.doi.org/10
.123.456 - but the world does not stop evolving sustain
legacy, support innovation multiple, secure
resolution, etc (whats the problem?) - Services not identifier mechanisms are important
for users - IDF Some technical infrastructure mirrors,
copies, backups, proxies, etc - But mainly social infrastructure
- Communities of self-interest
- Persistence of doi.org? dx.doi.org? (re e.g.
loc.gov). - Its communities, not technical design
stakeholders - gt1300 naming authorities - may not all care but
some will - Just as persistence of handle.net is through
CNRI, DOI, ADL, DTIC, Grid,Matrushka dolls - Persistence of data dictionaries other
stakeholders ONIX MPEG21, etc
27doigt
Some issues from the meeting (2)
- Practical rules
- e.g what happens, if RA goes away, to its
existing DOIs specific rules re prefixes, new
DOIs, edits, transfers, aliases, fees - e.g. what happens if IDF goes away, CNRI goes
away contractual bindings (RA-IDF-CNRI-Data
Dictionary) - Network effects
- Communities adopt DOI most easily if they have
- a reason (Crossref, TIB) and/or
- an ability to mandate (EC)
- The more stakeholders adopt DOI, the more
persistent DOI is - Building communities is hard
- People, politics, suspicion, NIH, FIG.
- Costs effort self-funding bootstrapping,
leadership - Herding cats different viewpoints
- Legal issues self-interest can mean selfish
interest e.g. intellectual property conflicts - But worth doing
28Further reading
doigt
- Paskin, Norman. "Two practical examples of issues
in internet governance". Paper presented 6 May
2005 at the Internet Governance Forum, Oxford
Internet Institute. - http//www.oii.ox.ac.uk/collaboration/specialevent
s/20050505_governance_position_papers.pdf - Paskin, Norman. "Digital Object Identifiers for
scientific data". Data Science Journal, Vol. 4,
18 March 2005. - http//www.doi.org/topics/050210CODATAarticleDSJ.p
df - The development of persistent identifiers.
Keynote presentation made at ERPANET Persistent
Identifiers seminar, Cork. June 2004 - http//www.erpanet.org/events/2004/cork/presentati
ons/040617PaskinPIConcepts.pdf - Overview of Digital Object Identifier (DOI)
System. Presentation made at ERPANET Persistent
Identifiers seminar, Cork. June 2004 - http//www.erpanet.org/events/2004/cork/presentati
ons/040617PaskinDOIpresentation.pdf - DOI factsheet Value added by the DOI system
http//www.doi.org/factsheets - Presentations from 2005 annual IDF meeting
- http//www.doi.org/doi_presentations/members_meeti
ng_2005/agenda.html
29n.paskin_at_doi.org www.doi.org