Title: Deconstructing Metadata
1Deconstructing Metadata
2What is metadata?
- data about data
- Can be used to
- Identify data
- Interpret or use data
- Search data
- Manage data
- Communicate about your data
3Types of Metadata
- Descriptive Metadata
- Structural Metadata
- Administrative Metadata
- Preservation Metadata
- Rights and Access Metadata
4Examples
This space intentionally left blank.
5What does metadata look like?
6May be same format as data
Header added by Project Gutenberg ?
Ebook submitted ?
7Metadata may have different format
Audio data
Text metadata
8Metadata may have no structure
- "This book I gave to Mary Baxter. After her
death, I gave it to Mrs. Spruill. After her
death, to Kate Wilson. She never read it, so on
a visit to her, I took back for my own reading."
9Metadata may have some structure
- Word processors allow document properties
- File name
- Indicates content
- Indicates format
10Metadata may have rich structure
- Example MARC record
- Allows very detailed searching
- Requires more expertise to create
11XML eXtensible Markup Language
- Many rich metadata formats are encoded as XML
- A schema or DTD specifies rules which a document
must follow - Examples XHTML, EAD, TEI, NDNP
- self-describing
12When is metadatacreated?Who creates metadata?
13(No Transcript)
14Where is metadata?
15Metadata may be inside the data
- Physical
- Title page
- Table of contents
- Index
- Digital
- Header information
16- Binary data ?
- Header information
- in an image file
- XML metadata ?
17Metadata can be near the data
- Title and author on the spine of a book
- Associated .txt file with a .wav file
- Alternate data streams (Windows)
18Metadata can be gathered elsewhere
- Card catalog
- Index
- Search engine
19Metadata can be multiple places
microfilm
catalog
box lid
Bee S-50 Earlington, KY
98 1892 negative
20How is metadata different from normal data?
- No clear distinction!
- Metadata is also data
- Metadata can have metadata
21Meta-metadata?
22How much metadata?
- Too little metadata?
- Different objects may have the same metadata
- Too much metadata?
- You may never get started
- Collection may take too long
- Collection may be incomplete
23Example NDNP
- Issue metadata
- Reel metadata
- OCR (ALTO)
- Image file headers
- Preservation metadata (PREMIS)
24NDNP Issue Metadata
25Framework
- ltmetsgt
- lt!--METS HEADER--gt
- ltmetsHdr gt
-
- lt!--DESCRIPTIVE METADATA--gt
- ltdmdSec ID"issueModsBib"gt
- ltdmdSec ID"pageModsBib1"gt
- lt!--FILE SECTION--gt
- ltfileSecgt
- lt!--STRUCTURAL MAP--gt
- ltstructMap gt
- lt/metsgt
26Descriptive Metadata (Issue)
- ltdmdSec ID"issueModsBib"gt
- ltmdWrap MDTYPE"MODS" LABEL"Issue
metadata"gt - ltxmlDatagt
- ltmodsmodsgt
- ltmodsrelatedItem
type"host"gt - ltmodsidentifier
type"lccn"gtsn86069162lt/modsidentifiergt - ltmodspartgt
- ltmodsdetail
type"volume"gt -
ltmodsnumbergt28lt/modsnumbergt - lt/modsdetailgt
- ltmodsdetail
type"issue"gt -
ltmodsnumbergt32lt/modsnumbergt - lt/modsdetailgt
- lt/modspartgt
- lt/modsrelatedItemgt
- ltmodsoriginInfogt
- ltmodsdateIssued
encoding"iso8601"gt1902-01-01lt/modsdateIssuedgt - lt/modsoriginInfogt
- ltmodsnote type"noteAboutRepr
oduction"gtPresentlt/modsnotegt
27Descriptive Metadata (Page)
- ltdmdSec ID"pageModsBib1"gt
- ltmdWrap MDTYPE"MODS" LABEL"Page
metadata"gt - ltxmlDatagt
- ltmodsmodsgt
- ltmodspartgt
- ltmodsextent
unit"pages"gt - ltmodsstartgt1lt/modsst
artgt - lt/modsextentgt
- lt/modspartgt
- ltmodsrelatedItem
type"original"gt - ltmodsphysicalDescriptiongt
- ltmodsform
type"microfilm" /gt - lt/modsphysicalDescription
gt - ltmodsidentifier
type"reel number"gt0010047928Alt/modsidentifiergt - ltmodsidentifier
type"reel sequence number"gt3lt/modsidentifiergt - ltmodslocationgt
- ltmodsphysicalLocation
authority"marcorg" displayLabel"University of
Kentucky, Lexington, KY"gtKyUlt/modsphysicalLocatio
ngt - lt/modslocationgt
- lt/modsrelatedItemgt
28File Metadata
- ltfileSecgt
- ltfileGrp ID"pageFileGrp1"gt
- ltfile ID"masterFile1" USE"master"gt
- ltFLocat LOCTYPE"OTHER"
OTHERLOCTYPE"file" xlinkhref"0007.tif" /gt - lt/filegt
- ltfile ID"serviceFile1"
USE"service"gt - ltFLocat LOCTYPE"OTHER"
OTHERLOCTYPE"file" xlinkhref"0007.jp2" /gt - lt/filegt
- ltfile ID"otherDerivativeFile1"
USE"derivative"gt - ltFLocat LOCTYPE"OTHER"
OTHERLOCTYPE"file" xlinkhref"0007.pdf" /gt - lt/filegt
- ltfile ID"ocrFile1" USE"ocr"gt
- ltFLocat LOCTYPE"OTHER"
OTHERLOCTYPE"file" xlinkhref"0007.xml" /gt - lt/filegt
- lt/fileGrpgt
29Structural Metadata
- ltstructMap xmlnsnp"urnlibrary-of-congressndnp
metsnewspaper"gt - ltdiv TYPE"npissue" DMDID"issueModsBib"gt
- ltdiv TYPE"nppage"
DMDID"pageModsBib1"gt - ltfptr FILEID"masterFile1" /gt
- ltfptr FILEID"serviceFile1" /gt
- ltfptr FILEID"otherDerivativeFile1
" /gt - ltfptr FILEID"ocrFile1" /gt
- lt/divgt
- ltdiv TYPE"nppage"
DMDID"pageModsBib2"gt - lt/divgt
-