METS at UC Berkeley - PowerPoint PPT Presentation

About This Presentation
Title:

METS at UC Berkeley

Description:

The concepts that drove the design. Shielding user from METS complexity. Highly configurable ... Access driven by login privileges. Use of Open Source software ... – PowerPoint PPT presentation

Number of Views:28
Avg rating:3.0/5.0
Slides: 22
Provided by: rickbe9
Learn more at: https://www.loc.gov
Category:
Tags: mets | berkeley | drove

less

Transcript and Presenter's Notes

Title: METS at UC Berkeley


1
METS at UC Berkeley
  • Generating METS Objects

2
Background
  • Kinds of materials
  • primarily imaged content tei encoded content
  • archival materials manuscripts and pictorial
    collections
  • oral histories
  • Kinds of Metadata
  • Structural metadata physical structure
  • Descriptive metadata
  • BasicTechnical metadata about digital files and
    how they were produced

3
Tools For Producing METS Objects
  • GenDB
  • Gathers structural, descriptive and technical
    metadata
  • GenX
  • Generates METS objects from GenDB

4
GenDB
  • Consists of
  • Relational database (Currently SQL Server)
  • Locally developed software for gathering metadata
    and facilitating digital processing

5
GenDB Database Structure
Structural Metadata
Structural Md Table
Div 1
Object 1
(root)
Div 2
(parent div 1)
Div 3
(parent div 1)

Object 2
Div 1
(root)
Div 2
(parent div 1)
Div 3
(parent div 2)
Div 4
(parent div 2)
6
GenDB Database Structure
Descriptive Metadata
Name Table
Structural Md Table
Div 1
Object 1
Core Desc Md
Name 1
Div 2
Core Desc Md
Name 2
Div 3
Core Desc Md
Name 3
Object 2
Div 1
Core Desc Md
Note Tables
Note 1
Div 2
Core Desc Md
Div 3
Core Desc Md
Note 2
Div 4
Core Desc Md
Note 3
7
GenDB Database Structure
Content File/Technical Md
Master Image Table
Structural Md Table
Mstr 1
Technical Md
Div 1
Object 1
Mstr 2
Technical Md
Div 2
Div 3
Derivative Image Table
Drv 1
Technical Md
Drv 2
Technical Md
Drv 3
Technical Md
Drv 4
Technical Md
8
Populating the Database Tables
  • Web interface manual input of structural and
    descriptive metadata
  • Digitization Management modules
  • Generate work orders to guide digitization
    process
  • Import content file information and technical
    metadata coming out of digitization process
  • Batch loader batch input based on TEI encodings,
    legacy metadata

9
Web Interface WebGenDB
Java Server
Web Interface
SQL Server Database
jdbc
rmi
Java Servlet
XML Config Files
10
Digitization Management Modules
Vendor
Imaging/ Transcription WorkOrders
Web Interface
Technical MD Spreadsheets
Java Server
Java Servlet
SQL Server Database
11
SQL Server Database
Batch Loader
Java Server
Web Interface
TEI Docs
XSLT
Java Servlet
Java Batch Loader
XML Batch Load File
12
WebGenDB
  • The concepts that drove the design
  • Shielding user from METS complexity
  • Highly configurable
  • Unicode support
  • Access driven by login privileges
  • Use of Open Source software and components
  • Distributed approach

13
XML Configuration Files
  • Three levels
  • Common to all projects elements
  • Common to all screens in a project elements
  • Specific to a screen in a project
  • Define fields common to all projects
  • Define fields used in specific project
  • Define screens by project object type

14
Relation among XML files
ObjectType1.xml
Proj1.xml
ObjectType2.xml
AlProjects.xml
ObjectType1.xml
Proj2.xml
ObjectType2.xml
15
Project XML file example
ltObjectTypegt ltnamegtworkorderlt/namegt
ltfileLocationgt /data/_w/GenDB/WEB-INF/cla
sses/edu/berkeley/library/propertyFiles/CalCulture
WorkOrderScreensFile.xml lt/fileLocationgt
lt/ObjectTypegt ltFieldgt ltnamegtImagelt/namegtltt
ypegtcheckboxlt/typegtltlabelgtImage
lt/labelgtltsizegt1lt/sizegt lt/Fieldgt ltFieldgt
ltnamegtTextlt/namegtlttypegtcheckboxlt/typegtltlabelgtText
lt/labelgtltsizegt1lt/sizegt lt/Fieldgt ltFieldgt
ltnamegtTitlelt/namegtlttypegttextlt/typegtltlabelgtTitle
lt/labelgtltsizegt60lt/sizegt lt/Fieldgt
16
Software used
  • MSSQL running on NT
  • Tomcat 4.1.2 implementing servlets 2.3
  • Jsdk 1.4
  • Xalan 2.4
  • Xerces 1.0.3
  • FOP 0.12.1
  • JDOM beta 8
  • Opta 2000

17
Relationship of GenDB to METS
  • Metadata not directly stored in METS, MODS or MIX
    schema formats.
  • Much of the database structure was developed
    before these standards emerged
  • Database structure and content adjusted to be
    compatible with all these formats

18
GenX From GenDB to METS
  • Allows Digital Publishing Group staff to select
    the objects in the GenDB database that are ready
    for export and to export them as METS objects.

19
GenX Architecture
App Interface
Java Application
METS XML Repository
JDBC
GenDB
20
GenX Output
  • METS output corresponding to version 1.3
  • Descriptive metadata exported to METS descMD in
    MODS 2.0 format
  • Technical Metadata exported to METS techMD in MIX
    format
  • Planned
  • Text technical md to METS descMD in NYU TextMD
  • Rights to METS rightsMD in ODRL subset

21
Links
  • GenDB Web Interface Demo
  • http//sunsite2.berkeley.edu/GenD
  • login demo
  • password demo
  • Developers
  • rbeaubie_at_library.berkeley.edu
  • ghill_at_library.berkeley.edu
  • jhassan_at_library.berkeley.edu
Write a Comment
User Comments (0)
About PowerShow.com