Grappling with Metadata - PowerPoint PPT Presentation

1 / 47
About This Presentation
Title:

Grappling with Metadata

Description:

'Data about data' data categories. Data describing a discrete data object or objects ... TGM Thesaurus of Graphic Materials. AAT Art and Architecture Thesaurus ... – PowerPoint PPT presentation

Number of Views:83
Avg rating:3.0/5.0
Slides: 48
Provided by: Goog183
Category:

less

Transcript and Presenter's Notes

Title: Grappling with Metadata


1
Grappling withMetadata
November 17, 2008
Leah Prescott Digital Projects Coordinator Getty
Research Institute
Master of St. Bartholomew, The Meeting of the
Three Kings (detail), ca. 1480 J. Paul Getty
Museum
2
What is Metadata?
  • Data about datadata categories
  • Data describing a discrete data object or
    objects
  • Cataloging or indexing information created to
    arrange, describe, and otherwise enhance access
    to an information object

3
What is Metadata?
"Metadata has aptly been compared to labels on
cans in the supermarket. Without labels, we would
have to open all the cans to see what is in them,
we would have to analyse their contents to know
whether the substance is edible or not, and so
on."
Metadata for Digital Resources
4
What is Metadata?
Thanks to the labels, we know what is in the can,
what it is made of, where it comes from, how much
it costs, up to what date we can safely eat it,
possibly how we can prepare it and how to
preserve it.
5
Why is metadata important?
  • Enhanced accessibility
  • Retention of context
  • Expanding use
  • Multi-versioning
  • Legal issues
  • Preservation of data

6
Don't go in blindfolded
Emblem (Premitur, non opprimitur) from Claude
Paradin, The heroicall devises of M. Claudius
Paradin (London W. Kearney, 1591)
http//emblem.libraries.psu.edu/parad176.htm
As you start planning for digital projects, you
need to be aware of the various options that
exist for creating metadata
7
Types of Metadata
  • Administrative
  • Descriptive
  • Preservation
  • Technical
  • Structural
  • Use

8
Why do we need them?
Standards
9
Standards
  • Fundamental to the efficient exchange of
    information
  • Essential for meaningful search-and-retrieval of
    information
  • To improve the quality and consistency of
    information
  • To improve compatibility of information
    structures 
  • To protect the long-term value of data
  • To facilitate information retrieval
  • To facilitate information exchange

10
Standards
Information standards and controlled vocabularies
can help extricate us from our metadata
dilemmas...
11
A brief (but related) detour Markup Languages
Glossary to EAD application guidelines http//www
.loc.gov/ead/ag/agappf.html
12
Putting the Pieces Together
13
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

MARC - Machine Readable Cataloging MODS
Metadata Object Description Schema EAD - Encoded
Archival Description Dublin Core - Core elements
(Dublin, Ohio) METS - Metadata Encoding and
Transmission Standard CDWA Categories for the
Description of Works of Art VRA Core - Core
elements of the Visual Resources Association
14
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

MARC - Machine Readable Cataloging http//www.loc
.gov/marc/
15
Library of Congress MARC record
16
Brief display of previous LC MARC record, with
field labels instead of numbers
17
MARC records can also be expressed in XML format
http//www.loc.gov/standards/ marcxml/
18
Classic MARC record
19
MARCXML record
20
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

MODS Metadata Object Description
Schema http//www.loc.gov/standards/ mods/
21
MODS
  • An initiative of Network Development and MARC
    Standards Office at the Library of Congress
  • Uses XML schema
  • Originally designed for library applications, but
    may be used for others
  • A derivative of MARC

22
MODS high-level elements
  • Note
  • Subject
  • Classification
  • Related item
  • Identifier
  • Location
  • Access conditions
  • Extension
  • Record Info
  • Title Info
  • Name
  • Type of resource
  • Genre
  • Origin Info
  • Language
  • Physical description
  • Abstract
  • Table of contents
  • Target audience

23
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

EAD - Encoded Archival Description http//www.lo
c.gov/ead/
24
EAD
25
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 
  • Dublin Core - Core elements
  • (Dublin, Ohio)
  • http//dublincore.org/

26
Dublin Core
27
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

METS - Metadata Encoding and Transmission
Standard http//www.loc.gov/standards/mets/
28
Sample METS Object
29
(No Transcript)
30
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

CDWA Categories for the Description of Works of
Art http//www.getty.edu/research/ conducting_res
earch/standards/ cdwa/
31
CDWA
32
DATA STANDARDS
  • Data Structure Standards
  • (Metadata Element Sets) 

VRA Core - Core elements of the Visual Resources
Association http//www.vraweb.org/projects/ vraco
re4/
33
VRA Core
Lindisfarne Gospel
34
DATA STANDARDS
Data Value Standards (Vocabularies)
  • LCSH Library of Congress Subject Headings
  • LCNAF Library of Congress Name Authority File
  •  
  • TGM Thesaurus of Graphic Materials
  •  
  • AAT Art and Architecture Thesaurus
  • TGN Thesaurus of Geographic Names
  • ULAN Union List of Artist Names
  • and there are more...

35
DATA STANDARDS
Data Content Standards  (Cataloging Rules)
  • AACR (?RDA) Anglo-American Cataloging Rules
    (Resource Description and Access)
  •  
  • DACS Describing Archives A Content Standard
  •  
  • ISBD International Standard Bibliographic
    Description
  •  
  • CCO Cataloging Cultural Objects

36
DATA STANDARDS
Data format standards (standards expressed in
machine-readable form)
  • MARC Machine Readable Cataloging
  • MARCXML MARC expressed in XML
  •  
  • EAD Encoded Archival Description
  •  
  • CDWA Lite Categories for the Description of
    Works of Art - expressed in XML
  •  
  • Dublin Core Simple XML schema
  •  
  • DC Qualified XML schema
  •  
  • VRA Core XML schema

37
Mapping and Crosswalking
Connecticut History Online Project Dublin
Core/MARC Crosswalk
38
Issues with the Web...
  • Are your collections reachable by commercial
    search engines?
  • If yes, how will you contextualize individual
    collection objects?
  • If not, what is your strategy to lead Web users
    to your search page?

39
Visible Web vs. Deep Web
  • The Visible Web is what you see in the results
    pages from general Web search engines subject
    directories (static Web pages)
  • The Invisible or Deep Web consists of data from
    dynamically searchable databases that cannot be
    indexed by search engines, because they arent
    stored anywhere.

40
The Google factor
41
What Google looks at
  • Title tag
  • Text on the Web page
  • Referring links

42
What Google doesnt look at (mostly)
  • Keywords meta tag
  • Description meta tag

43
Challenges
  • How to assist users in unmediated searching,
    browsing, etc.
  • How to present large, complicated amounts of data
    in a way that users can understand and interpret
  • How to create cataloging for the Web
    harnessing and adapting the power of metadata and
    controlled vocabularies
  • How to provide reliable, up-do-date,
    authoritative metadata

44
Facing the Challenges
  • Institutions need to carefully chose and
    consistently apply metadata schemes to their
    collections information.
  • Application of vocabulary resources (including
    local authorities and thesauri) is essential for
    enhancing end-user access.
  • Use of picklists, thematic groupings, and
    browsing categories based on institutions
    organized data improves end-user access.

45
Facing the Challenges
  • Careful and consistent implementation of title
    tags and other metadata on Web pages facilitates
    end-user searching and retrieval of Web resources
  • Use metadata and usability analysis should be a
    routine part of digital library work.
  • Provide both searching and browsing
    functionalities (and carefully consider an
    advanced fielded searching option)

46
Facing the Challenges
  • Dont necessarily show all your data, nor make
    it all available for end-user searching
  • Dont create hyperlinks simply because you can
  • Dont let software dictate decisions
  • Create thematic groupings (based on carefully
    constructed metadata!) that reflect your
    collections and help your users
  • Study end-user behavior (including your own)

47
Thank you for your attention!
lprescott_at_getty.edu
Joseph Ducreux, Yawning (Self-Portrait), before
1783 J. Paul Getty Museum
Write a Comment
User Comments (0)
About PowerShow.com