Title: Digital Library Architecture
1Digital Library Architecture
Reagan Moore Chaitan Baru Amarnath Gupta
George Kremenek Bertram Ludaescher Richard
Marciano Arcot Rajasekar Wayne Schroeder
Michael Wan Ilya Zaslavsky Bing Zhu
(http//www.npaci.edu/DICE/)
2What Types of Management Systems are Required?
- Data management
- Ability to access multiple types of storage
systems, across separate administration domains - Information management
- Ability to migrate collection onto new
information repository - Knowledge management
- Rule-based ontology mapping
- Characterization of rules under which collection
is formed - Management of knowledge bases - Topic Maps
3Information Management Hierarchy
- Persistent Archives
- Storage of information model, data model, along
with data - Data Grid
- Access to data in a different administration
domain - Digital Library - Presentation / Information
Discovery - Interlib - ADEPT, UC Berkeley Digital Library
- Data Collection
- Extensible Meta-data catalog - EMCAT
- Data handling
- SDSC Storage Resource Broker - SRB
- Archival Storage
- High performance storage system - HPSS
4Digital Library Data Management
- Persistent identifiers
- Ability to move a data set without the name
changing - Data set replicas
- Management of multiple copies of a data set
- Archival backup of data sets
- Integration of disk data caches with archival
storage - Persistent archives
- Management of a collection through multiple
cycles of technology evolution
5SDSC Storage Resource Broker Meta-data Catalog
Application
Resource
Third-party copy
User
Remote Proxies
MCAT
Dublin Core
DataCutter
Application Meta-data
6Common Information Model
- eXtensible Markup Language (XML)
- Use tags to define semantic context for
components of the data set - Document Type Definition (DTD)
- Provides semi-structured representation for
organizing tags that can be applied to groups of
digital objects - Development of standards for tags
- Digital sky, Protein Data Bank, Neuroscience
brain images - California Digital Library - Art Museum Image
Consortium
7Applications
- Support for distributed data collections
- Federation of data collections to form digital
library - Integration of digital libraries with archives
- Finding aids for federation of digital libraries
through mediation of information - Data grids for data access
- Persistent archives
8Electronic Records Archive (ERA)
ACCESSION
ARCHIVES
REFERENCE
TRANSFER
Accessioning Work Bench (snapin)
Reference Workbench (snapin)
Retrieve Records
Media Handlers
Catalog
METADATA REPOSITORY RECORDS REPOSITORY
Internet Intranet
Text Image Photo Video Audio Geographical
Information System Compound Records WEB Database
Arrangement
A R C
Query Reference Tools
TAPE
TAPE
CD
U N W R A P P E R
CD
W R A P P E R
DISK
DISK
record
Presentation
Metadata wrapper
Order Fulfillment
9More Information
http//www.npaci.edu/DICE