Deconstructing Metadata - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

Deconstructing Metadata

Description:

Metadata may have no structure 'This book I gave to Mary Baxter. ... Examples: XHTML, EAD, TEI, NDNP 'self-describing' When is metadata. created? ... – PowerPoint PPT presentation

Number of Views:30
Avg rating:3.0/5.0
Slides: 30
Provided by: kjl1
Category:

less

Transcript and Presenter's Notes

Title: Deconstructing Metadata


1
Deconstructing Metadata
  • Kathryn Lybarger

2
What is metadata?
  • data about data
  • Can be used to
  • Identify data
  • Interpret or use data
  • Search data
  • Manage data
  • Communicate about your data

3
Types of Metadata
  • Descriptive Metadata
  • Structural Metadata
  • Administrative Metadata
  • Preservation Metadata
  • Rights and Access Metadata

4
Examples
This space intentionally left blank.
5
What does metadata look like?
6
May be same format as data
Header added by Project Gutenberg ?
Ebook submitted ?
7
Metadata may have different format
Audio data
Text metadata
8
Metadata may have no structure
  • "This book I gave to Mary Baxter. After her
    death, I gave it to Mrs. Spruill. After her
    death, to Kate Wilson. She never read it, so on
    a visit to her, I took back for my own reading."

9
Metadata may have some structure
  • Word processors allow document properties
  • File name
  • Indicates content
  • Indicates format

10
Metadata may have rich structure
  • Example MARC record
  • Allows very detailed searching
  • Requires more expertise to create

11
XML eXtensible Markup Language
  • Many rich metadata formats are encoded as XML
  • A schema or DTD specifies rules which a document
    must follow
  • Examples XHTML, EAD, TEI, NDNP
  • self-describing

12
When is metadatacreated?Who creates metadata?
13
(No Transcript)
14
Where is metadata?
15
Metadata may be inside the data
  • Physical
  • Title page
  • Table of contents
  • Index
  • Digital
  • Header information

16
  • Binary data ?
  • Header information
  • in an image file
  • XML metadata ?

17
Metadata can be near the data
  • Title and author on the spine of a book
  • Associated .txt file with a .wav file
  • Alternate data streams (Windows)

18
Metadata can be gathered elsewhere
  • Card catalog
  • Index
  • Search engine

19
Metadata can be multiple places
microfilm
catalog
box lid
Bee S-50 Earlington, KY
98 1892 negative
20
How is metadata different from normal data?
  • No clear distinction!
  • Metadata is also data
  • Metadata can have metadata

21
Meta-metadata?
22
How much metadata?
  • Too little metadata?
  • Different objects may have the same metadata
  • Too much metadata?
  • You may never get started
  • Collection may take too long
  • Collection may be incomplete

23
Example NDNP
  • Issue metadata
  • Reel metadata
  • OCR (ALTO)
  • Image file headers
  • Preservation metadata (PREMIS)

24
NDNP Issue Metadata
25
Framework
  • ltmetsgt
  • lt!--METS HEADER--gt
  • ltmetsHdr gt
  • lt!--DESCRIPTIVE METADATA--gt
  • ltdmdSec ID"issueModsBib"gt
  • ltdmdSec ID"pageModsBib1"gt
  • lt!--FILE SECTION--gt
  • ltfileSecgt
  • lt!--STRUCTURAL MAP--gt
  • ltstructMap gt
  • lt/metsgt

26
Descriptive Metadata (Issue)
  • ltdmdSec ID"issueModsBib"gt
  • ltmdWrap MDTYPE"MODS" LABEL"Issue
    metadata"gt
  • ltxmlDatagt
  • ltmodsmodsgt
  • ltmodsrelatedItem
    type"host"gt
  • ltmodsidentifier
    type"lccn"gtsn86069162lt/modsidentifiergt
  • ltmodspartgt
  • ltmodsdetail
    type"volume"gt

  • ltmodsnumbergt28lt/modsnumbergt
  • lt/modsdetailgt
  • ltmodsdetail
    type"issue"gt

  • ltmodsnumbergt32lt/modsnumbergt
  • lt/modsdetailgt
  • lt/modspartgt
  • lt/modsrelatedItemgt
  • ltmodsoriginInfogt
  • ltmodsdateIssued
    encoding"iso8601"gt1902-01-01lt/modsdateIssuedgt
  • lt/modsoriginInfogt
  • ltmodsnote type"noteAboutRepr
    oduction"gtPresentlt/modsnotegt

27
Descriptive Metadata (Page)
  • ltdmdSec ID"pageModsBib1"gt
  • ltmdWrap MDTYPE"MODS" LABEL"Page
    metadata"gt
  • ltxmlDatagt
  • ltmodsmodsgt
  • ltmodspartgt
  • ltmodsextent
    unit"pages"gt
  • ltmodsstartgt1lt/modsst
    artgt
  • lt/modsextentgt
  • lt/modspartgt
  • ltmodsrelatedItem
    type"original"gt
  • ltmodsphysicalDescriptiongt
  • ltmodsform
    type"microfilm" /gt
  • lt/modsphysicalDescription
    gt
  • ltmodsidentifier
    type"reel number"gt0010047928Alt/modsidentifiergt
  • ltmodsidentifier
    type"reel sequence number"gt3lt/modsidentifiergt
  • ltmodslocationgt
  • ltmodsphysicalLocation
    authority"marcorg" displayLabel"University of
    Kentucky, Lexington, KY"gtKyUlt/modsphysicalLocatio
    ngt
  • lt/modslocationgt
  • lt/modsrelatedItemgt

28
File Metadata
  • ltfileSecgt
  • ltfileGrp ID"pageFileGrp1"gt
  • ltfile ID"masterFile1" USE"master"gt
  • ltFLocat LOCTYPE"OTHER"
    OTHERLOCTYPE"file" xlinkhref"0007.tif" /gt
  • lt/filegt
  • ltfile ID"serviceFile1"
    USE"service"gt
  • ltFLocat LOCTYPE"OTHER"
    OTHERLOCTYPE"file" xlinkhref"0007.jp2" /gt
  • lt/filegt
  • ltfile ID"otherDerivativeFile1"
    USE"derivative"gt
  • ltFLocat LOCTYPE"OTHER"
    OTHERLOCTYPE"file" xlinkhref"0007.pdf" /gt
  • lt/filegt
  • ltfile ID"ocrFile1" USE"ocr"gt
  • ltFLocat LOCTYPE"OTHER"
    OTHERLOCTYPE"file" xlinkhref"0007.xml" /gt
  • lt/filegt
  • lt/fileGrpgt

29
Structural Metadata
  • ltstructMap xmlnsnp"urnlibrary-of-congressndnp
    metsnewspaper"gt
  • ltdiv TYPE"npissue" DMDID"issueModsBib"gt
  • ltdiv TYPE"nppage"
    DMDID"pageModsBib1"gt
  • ltfptr FILEID"masterFile1" /gt
  • ltfptr FILEID"serviceFile1" /gt
  • ltfptr FILEID"otherDerivativeFile1
    " /gt
  • ltfptr FILEID"ocrFile1" /gt
  • lt/divgt
  • ltdiv TYPE"nppage"
    DMDID"pageModsBib2"gt
  • lt/divgt
Write a Comment
User Comments (0)
About PowerShow.com