Title: Curation: the Future of the Past
1Curation the Future of the Past
- Chris Rusbridge
- (with thanks to Peter Burnhill, Philip Lord,
Alison Macdonald, etc)
Funded by
2Whats the problem?
- Struggle to understand curation against
preservation
3Whats this?
4More like data
Images from Nigel Hambly, Edinburgh WFAU
one of the coolest and therefore oldest white
dwarfs ever found,,, a member of a hitherto
unobserved and possibly large population of faint
stars
5E-Science example?
- Glass plate originals, being digitised, not born
digital - Science from quality of metadata (time, place,
wavelength) and provenance - Future examples, eg Sloan Digital Sky Survey,
VISTA - Virtual observatories, eg Astrogrid
- Continuity after project end?
6The archive is the sky
AstroGrid project
graphics from US NVO project
7Data Curation Terminology
- management/enhancement over life-cycle of
scholarly/scientific interest - Digital Preservation
- Future technological/legal accessibility
usability - Data Curation
- Data in use (huge, distributed, for long periods)
- Enhancement (eg annotation)
- Combination and re-combination
8 Digital preservation OAIS
9Research/publication process
Lord, Macdonald
10. . . Data archiving
Lord, Macdonald
11 . . . curation
Research Process
Web Content
Patent data
Research Process
Secondary (derived) data
Tertiary data for publication
Primary data
Scientist
Metadata
Research based on data
Publication Process
Peer Review
Primary publication
Pre-print
Curation
Data repositories
Curator
Secondary publication
Archived data
Publication Archives
Tertiary Publication
Curation Process
Lord, Macdonald
Library - Peers - Public - Industry
12Provenance
- Digital Preservation
- Archive knows where the data comes from
- Data curation
- Data knows where it comes from
- Changes flow to from derivatives
13Annotation
- Adds huge value
- Poorly implemented
14Lord/Macdonald 1data producers
15Lord/Macdonald 2
16Lord/Macdonald 3data producers re-visited
17Lord/Macdonald 4Librarians
18UK Digital Curation Centre
- Twin drivers
- eScience - data deluge - continuing access
- Digital Preservation
- Jointly funded
- by JISC and the e-Science Core Programme
- Funding of outreach, services development
- Funding of research programme
- Challenge of ensuring fruitful linkage
19DCC Consortium Partners
- Four Consortium partner institutions
- University of Edinburgh (lead, EDINA/NeSC/DCS)
- University of Glasgow (HATII, IS)
- UKOLN, at University of Bath
- CCLRC (Rutherford and Daresbury Laboratories)
- Unifying factor National eScience Centre (NeSC)
- jointly managed by Universities of Edinburgh
Glasgow - UKOLN CCLRC have involvement
20Overall Aim
- continuing quality improvement in data curation
digital preservation - Main focus
- data as evidential base for scholarly
conclusions - role of data curation preservation as keys to
reproducibility and reuse - Wider focus
- worlds of eLearning scholarly communication
21Objectives
- Vibrant research programme
- addressing the wider issues of digital curation
- Collaborative Associates Network
- strong links across existing community of
practice - engagement with curators (individuals
organisations) - Services
- evaluate tools, methods, standards and policies
- a repository of tools and technical information
- provide advice
- Virtuous circle
- expertise, experience requirement feed into the
DCC research programme back to development
service
22Setting up the DCC
- Funding from the JISC began on 1 March 2004
- EPSRC Research funding begins on 1 September 2004
- Phase One Set-up
- from now until Launch of Centre in October 2004
- Early deliverables
- Website at www.dcc.ac.uk
- to learn of updates progress
- Helpdesk at digitalcuration_at_dcc.ac.uk
- for queries and offers of collaboration
23Matrix approach
24Some Names Responsibilities
- Peter Burnhill
- Acting Director (Phase One)
- Peter Buneman (Informatics,Edinburgh)
- Research Director PI on EPSRC grant
- Robin Rice
- Phase One Project Co-ordinator
- Liz Lyon (UKOLN, Bath)
- Associate Director (Community Support)
- Seamus Ross (HATII, Glasgow)
- Associate Director (Services)
- David Giaretta (CCLRC)
- Associate Director (Development)
25What does this mean for libraries and archives?