Developing practical approaches to active preservation - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Developing practical approaches to active preservation

Description:

Developing practical approaches to active preservation. Adrian Brown ... All services exposed as web services (local or remote) Developed using J2EE ... – PowerPoint PPT presentation

Number of Views:80
Avg rating:3.0/5.0
Slides: 29
Provided by: dcc8
Category:

less

Transcript and Presenter's Notes

Title: Developing practical approaches to active preservation


1
(No Transcript)
2
Developing practical approaches to active
preservation
Adrian Brown Head of Digital Preservation The
National Archives
3
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

4
Previous work
  • National Digital Archive of Datasets (1997)
  • PRONOM (2002)
  • TNA Digital Archive (2004)
  • Web archiving programme (2004)
  • Electronic Records Online (2005)

5
Current developments
  • Seamless Flow programme (2004-2007)
  • JISC projects (2004 onwards)
  • PLANETS (2006-2010)
  • Shared preservation services for UK Government
    (2006-?)

6
Preservation management
  • Passive preservation
  • Preserving the bits
  • Digital Object Store (enhanced Digital Archive)
  • Active preservation
  • Preserving the record
  • Active Preservation system (incorporating PRONOM)

7
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

8
(No Transcript)
9
Active preservation
  • Service-oriented architecture
  • All services exposed as web services (local or
    remote)
  • Developed using J2EE
  • Lightweight pluggable framework to use new and
    existing TNA and third-party tools
  • Orchestrated using workflow engine
  • Scheduled for completion by end of 2007

10
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

11
(No Transcript)
12
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

13
Characterisation
  • Identification
  • Format version expressed as PUID
  • Validation
  • Well-formed syntactically correct
  • Valid well-formed and semantically correct
  • Property extraction
  • Technical properties
  • Inherent properties

14
Significant properties
  • Research and collaboration
  • InterPares
  • InSPECT
  • PLANETS
  • NARA

15
Characterisation tools
  • Automated deployment via PRONOM
  • DROID 2.0 Identification
  • JHOVE Validation and property extraction
  • Java POI library Validation and property
    extraction (MS Office documents)
  • Java JAXP API Validation (XML)
  • PLANETS characterisation tools

16
(No Transcript)
17
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

18
Preservation planning
  • Risk-based approach
  • Supports preservation and presentation
  • Two types of risk considered
  • Format risk arising from generic properties of
    the format
  • Instance risk arising from specific properties
    of the object

19
Preservation planning
  • Risk assessment
  • Identifies urgency of action, assessed against
    standard criteria stored in registry
  • Technology watch
  • Monitors technological change
  • Updates registry accordingly
  • Results in updated risk assessment criteria

20
Preservation planning
  • Impact assessment
  • Monitors ongoing risk
  • Quantifies impact on the collection
  • Migration pathway generation
  • Identifies possible migration pathways through
    analysis of registry
  • Tests and certifies pathways

21
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

22
Preservation action
  • Lightweight framework for automated deployment of
    migration tools via PRONOM
  • Execution perform migration
  • Characterisation characterise migrated objects
  • Validation automated comparison of significant
    properties

23
Migration tools
  • Stellent Transformation Server
  • XML Export
  • HTML Export
  • PDF/A (forthcoming)
  • ODF (forthcoming)
  • TNA migration validation tool

24
  • Introduction
  • Active preservation
  • Data model
  • Characterisation
  • Preservation planning
  • Preservation action
  • PRONOM

25
(No Transcript)
26
PRONOM
  • PRONOM 6 (Spring 2007)
  • Enhanced characterisation support
  • Risk assessment service
  • Migration pathway development
  • SOAP/REST interfaces
  • PUID resolution service
  • Research Preserv and ROAR
  • Collaboration GDFR and DCC RIRR

27
Conclusions
  • Core components of active preservation system
    operational by end of 2007
  • Enhanced PRONOM public web services
  • Compatibility with PLANETS to allow future
    extensibility

28
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com