Descriptive Metadata - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

Descriptive Metadata

Description:

Descriptive Metadata When will mods.xml be used by METS (aip.xml) ? METS will use the mods.xml to encode descriptive metadata. Information that describes, classifies ... – PowerPoint PPT presentation

Number of Views:132
Avg rating:3.0/5.0
Slides: 36
Provided by: nek47
Category:

less

Transcript and Presenter's Notes

Title: Descriptive Metadata


1
Descriptive Metadata
  • When will mods.xml be used by METS (aip.xml) ?
  • METS will use the mods.xml to encode descriptive
    metadata.
  • Information that describes, classifies, and
    characterizes the identity of the content.
  • How will METS (aip.xml) access MODS (mods.xml)?
  • METS uses a pointer to the metadata that is
    located outside of the METS document. More
    specifically, it uses a xlinkhref to indicate
    the location of mods file.
  • For example
  • lt!-- MODS object  --gt
  • ltmetsdmdSec ID"DMD_OTHER"gt
  • ltmetsmdRef ID"D09002ee180b004da" MDTYPE"MODS"
    MIMETYPE"text/xml" LOCTYPE"URL"
    xmlnsxlink"http//www.w3.org/1999/xlink"
    xlinkhref"file/mods.xml" /gt
  • lt/metsdmdSecgt

2
MODS
  • MODS elements considered mandatory are essential
    to the ingestion of the content to a repository
  • Those elements are the following
  • originInfo
  • language
  • identifier
  • location
  • physicalDescription
  • typeOfResource
  • recordInfo

3
originInfo
  • originInfo Information that pertains to the
    origin of the resource
  • Recommended If Applicable sub-elements of
    ltoriginInfogt
  • ltpublishergt name of entity
  • ltdateIssuedgt date published
  • Optional sub-element
  • ltissuancegt informs how the resource was issued
  • For example
  • ltoriginInfogt
  • ltpublishergtU.S. Government Printing
    Officelt/publishergt
  • ltdateIssued encoding"w3cdtf"gt2010-09-29lt/dateIss
    uedgt
  • ltissuancegtmonographiclt/issuancegt
  • lt/originInfogt

4
language
  • language of the resource
  • Required sub-elements
  • ltlanguageTermgt is a repeatable sub-element that
    details whether the language of the resource is
    in textual form or coded form.
  • For example
  • ltlanguagegt
  • ltlanguageTerm type"code" authority"iso639
    2b"gtenglt/languageTermgt
  • lt/languagegt

5
Identifier
  • identifier a unique number or code that
    identifies a resource
  • lttypegt is a required attribute, that indicates
    the type of identifier. For example, the value
    local refers to a local identifier.
  • For Example
  • ltidentifier type"local"gtV0b002ee180b003e5lt/identi
    fier

6
location
  • location indicate the repository holding the
    resource or a URL where the resource is available
  • lturlgt it is a mandatory sub-element that refers
    to the Uniform Resource Location for the resource
  • ltdisplayLabelgt is an attribute the provides
    information associated with the location
  • ltaccessgt is an attribute that indicates the type
    resource that will be accessed via the URL link

7
location example
  • For example
  • ltlocation xmlnsexslt"http//exslt.org/common"gt
  • lturl displayLabel"HTML rendition" access"raw
    object"gthttp//www.gpo.gov/fdsys/pkg/BILLS-111s388
    0is/html/BILLS-111s3880is.htmlt/urlgt
  • lturl displayLabel"PDF rendition" access"raw
    object"gthttp//www.gpo.gov/fdsys/pkg/BILLS-111s388
    0is/pdf/BILLS-111s3880is.pdflt/urlgt
  • lturl displayLabel"XML rendition" access"raw
    object"gthttp//www.gpo.gov/fdsys/pkg/BILLS-111s388
    0is/xml/BILLS-111s3880is.xmllt/urlgt
  • lt/locationgt

8
physicalDescription
  • physicalDescription contains all sub-elements
    that relate to the physical description
    information of the resource
  • ltnotegt is a recommended if applicable
    sub-element, contains physical description that
    does not fall under any other sub-element.
  • ltdigitalOrigingt is a required sub-element, that
    describes the method use to achieve digital form
    of the resource
  • ltextentgt is a recommended if applicable
    sub-element, describes the number of units that
    make up the resource

9
physicalDescription Example
  • For example
  • ltphysicalDescriptiongt
  • ltnote type"source contenttype"gtdepositedlt/notegt
  • ltdigitalOrigingtborn digitallt/digitalOrigingt
  • ltextentgt7 p.lt/extentgt
  • lt/physicalDescriptiongt

10
typeOfResource
  • typeOfResource information describing the form
    of the resource
  • For example
  • lttypeOfResourcegttextlt/typeOfResourcegt

11
recordInfo
  • recordInfo contains information pertain metadata
  • Required sub-elements
  • ltlanguageOfCataloginggt refers to the language of
    the text of the MODS record
  • ltlanguageTermgt required attribute, refers to the
    language of the metadata reocrd
  • Recommended sub-elements
  • ltrecordContentSourcegt information about the
    metadata of the original record such as who
    created it or modified it
  • ltrecordOrigingt shows the origin of MODS record
  • Optional sub-elements
  • ltrecordCreationDategt the date the record was
    created
  • ltrecordChangeDategt the date the record was
    modified
  • ltrecordIdentifiergt contains the organization and
    the system control number assigned to it

12
recordInfo Example
  • For example
  • ltrecordInfogt
  • ltrecordContentSource authority"marcorg"gtDGPOlt/r
    ecordContentSourcegt
  • ltrecordCreationDate encoding"w3cdtf"gt2010-10-06lt
    /recordCreationDategt
  • ltrecordChangeDate encoding"w3cdtf"gt2010-10
    -06lt/recordChangeDate
  • ltrecordIdentifier source"DGPO"gtBILLS-111s3880is
    lt/recordIdentifiergt
  • ltrecordOrigingtmachine generatedlt/recordOrigingt
  • ltlanguageOfCataloginggt ltlanguageTerm
    type"code" authority"iso639-2b"gtenglt/languageTer
    mgt
  • lt/languageOfCataloginggt
  • lt/recordInfogt

13
Preservation Metadata
  • When will premis.xml be used by METS (aip.xml) ?
  • METS will use the premis.xml to encode
    preservation metadata.
  • information a repository uses to support the
    digital preservation process
  • Which includes information such as
  • Provenance refers to who has ownership of the
    digital object
  • Authenticity refers to the claim of the digital
    object
  • Preservation activity refers to the activities
    that have been carried out to preserve the
    digital object
  • Technical environment refers to the tasks
    required to interpret and use the digital object
  • Rights management refers the intellectual
    property rights that must be declared

14
Preservation Metadata
  • How will METS (aip.xml) access PREMIS
    (premis.xml)?
  • METS uses a pointer to the metadata that is
    located outside of the METS document. More
    specifically, it uses a xlinkhref to indicate
    the location of mods file.
  • Example code from aip.xml
  • lt!-- PREMIS OBJECT   --gt
  • ltmetsamdSec ID"AMD_OTHER"gt
  • ltmetstechMD ID"D09002ee180affcca-TEC"gt
  • ltmetsmdRef ID"M09002ee180affcca-tdiv"
    MDTYPE"PREMIS" MIMETYPE"text/xml"
    LOCTYPE"URL" xmlnsxlink"http//www.w3.org/1999
    /xlink" xlinkhref"file/premis.xml" /gt
  • lt/metstechMDgt
  • ltmetsdigiprovMD ID"D09002ee180affcca-DIG"gt
  • ltmetsmdRef ID"M09002ee180affcca-ddiv"
    MDTYPE"PREMIS" MIMETYPE"text/xml"
    LOCTYPE"URL" xmlnsxlink"http//www.w3.org/1999
    /xlink" xlinkhref"file/premis.xml" /gt
  • lt/metsdigiprovMDgt
  • lt/metsamdSecgt

15
PREMIS Data Model
16
PREMIS Intellectual Entity
  • Intellectual Entity refer to content that can
    be describe as a unit (e.g. books, maps,
    articles)

17
PREMIS Object Entity
  • Objects refer to units of information in
    digital form. PREMIS defines different kinds of
    objects it can an a file, bitstream or
    representation
  • File it is a computer file, such as a pdf, txt
    or JPEG
  • Bitstream refer to data bits within a file that
    contain common properties for preservation
    purposes

18
PREMIS Object Entity
  • Representation refer to a set of files, that
    includes structural metadata, required to be
    identified, stored and maintained in order to
    assemble a complete rendition of an Intellectual
    unit.
  • For example, text files and images files of a
    magazine are required to form a representation.

19
PREMIS Object Entity
  • PREMIS Data Dictionary defines the mandatory
    semantic units (elements) of object entity
  • Those elements are the following
  • objectIdentifier
  • objectCategory
  • objectCharacteristics
  • format
  • storage
  • Indicates repeatable semantic unit

20
objectIdentifier
  • objectIdentifier refers to the unique identifier
    of the object
  • ltobjectIdentifierTypegt refers to the
    classification of the domain that creates the
    object identifier.
  • ltobjectIdentifierValuegt value of the object
    identifier.

21
objectIdentifier Example
  • For example
  • ltobject xsitype"representation"gt
  • ltobjectIdentifiergt
  • ltobjectIdentifierTypegtFDsys ACPlt/objectIdentifie
    rTypegt ltobjectIdentifierValuegtR0b002ee180b003b0lt/o
    bjectIdentifierValuegt
  • lt/objectIdentifiergt
  • lt/objectgt

22
objectCharacteristics
  • objectCharacteristics refers to the technical
    properties of a file
  • ltcompositeLevelgt indicates if the object is
    subject to one or more processes of decoding or
    unbundling under
  • ltfixitygt used to verify if an object has been
    changed in an undocumented or unauthorized way
    under
  • ltsizegt the size of the object
  • ltformatgt the format information of the object

23
objectCharacteristics Example
  • ltobjectCharacteristicsgt
  • ltcompositionLevelgt0lt/compositionLevelgt
  • ltfixitygt
  • ltmessageDigestAlgorithmgtSHA-256lt/messageDiges
    tAlgorithmgt
  • ltmessageDigestgt4977070b92f0bb2642c6be368ad
    68a8d1d1c5dbbb3310544db781f56a860b0a1lt/messageDige
    stgt
  • ltmessageDigestOriginatorgtFDsyslt/messageDigest
    Originatorgt
  • lt/fixitygt
  • ltsizegt9326lt/sizegt
  • ltformatgt
  • ltformatDesignationgt
  • ltformatNamegttext/plainlt/formatNamegt
  • lt/formatDesignationgt
  • ltformatRegistrygt
  • ltformatRegistryNamegtPRONOMlt/formatRegistryNa
    megt
  • ltformatRegistryKeygtx-fmt/111lt/formatRegistry
    Keygt
  • lt/formatRegistrygt
  • ltformatNotegtPlain Text Filelt/formatNotegt
  • lt/formatgt
  • lt/objectCharacteristicsgt

24
storage
  • storage information about where and how a files
    are stored in the repository
  • ltcontentLocationgt information needed to retrieve
    a file from a storage system.
  • ltcontentLocationTypegt refers to the way of
    accessing the location of the content.
  • ltcontentLocationValuegt refers to the location
    of the content used by the storage system
  • ltstorageMediumgt The medium on which an object is
    stored

25
storage Example
  • ltstoragegt
  • ltcontentLocationgt
  • ltcontentLocationTypegtURIlt/contentLocationTypegt
  • ltcontentLocationValuegtfile/u02/app/emc/document
    um /data/fdsysprod1/fdsysprod1/content_storage_01/
    00002 ee1/80/55/b0/48.txtlt/contentLocationValuegt
  • lt/contentLocationgt
  • ltstorageMediumgthard disklt/storageMediumgt
  • lt/storagegt

26
PREMIS Event Entity
  • Events refers to actions that involve an object
    and an agent known to the system
  • Events are critical for maintaining the digital
    provenance of an object (helps demonstrates the
    authenticity of the object)
  • Examples of Events
  • modifying an document
  • actions that create new relationships
  • Object could be related to another object as a
    result of a particular event, for instance if a
    program takes file 1 and generates a different
    version known as file 2
  • Actions that check the validity and integrity of
    the objects (i.e. virus scan)

27
PREMIS Event Entity
  • PREMIS Data Dictionary defines the mandatory
    semantic units (elements) of event entity
  • Those elements are the following
  • eventIdentifier
  • eventType
  • eventDateTime

28
Event Entity
  • lteventIdentifiergt unique identifier for the
    event
  • lteventIdentifierTypegt refers to the
    classification of the domain that creates the
    event identifier.
  • lteventIdentifierValuegt value of the event
    identifier.
  • lteventTypegt classifies the nature of the event.
  • lteventDataTimegt date, time and type of event
  • lteventDetailgt detail description of the event
  • lteventOctomeInformationgt the outcome of the
    event
  • lteventOutcomegt indicates if the event was a
    success, partial success, or failure.

29
Event Entity
  • ltlinkingAgentIdentifiergt agents involved in the
    event and their specific roles Agents role are
    defined here because agents can perform different
    roles in different events
  • ltlinkingAgentIdentifierTypegt refers to the
    classification of the domain that creates the
    linking agent identifier.
  • ltlinkingAgentIdentifierValuegt value of the
    linking agent identifier.
  • ltlinkingAgentRolegt indicates the role of the
    agent associated to the event.
  • ltlinkingObjectIdentifiergt Objects involved in
    the event and their specific roles
  • ltlinkingObjectIdentifierTypegt refers to the
    classification of the domain that creates the
    linking object identifier.
  • ltlinkingObjectIdentifierValuegt value of the
    linking object identifier.
  • ltlinkingObjectRolegt indicates the role of the
    object associated to the event.

30
PREMIS Event Example
  • lteventgt
  • lteventIdentifiergt
  • lteventIdentifierTypegtFDsyseventlt/eventIdentif
    ierTypegt
  • lteventIdentifierValuegt1cdd2b6c-5a2d-449b-b386-
    ebb15eb4af11lt/eventIdentifierValuegt
  • lt/eventIdentifiergt
  • lteventTypegtRendition Submittedlt/eventTypegt
  • lteventDateTimegt2010-10-06T193847-0400lt/event
    DateTimegt
  • lteventDetailgtRendition R0b002ee180b003b0,
    uploaded by hotfolderadmin, was submitted in the
    Submission Information package P0b002ee180b003aflt/
    eventDetailgt
  • lteventOutcomeInformationgt
  • lteventOutcomegtSuccesslt/eventOutcomegt
  • lt/eventOutcomeInformationgt
  • ltlinkingAgentIdentifiergt
  • ltlinkingAgentIdentifierTypegtFDsysagentlt/linki
    ngAgentIdentifierTypegt
  • ltlinkingAgentIdentifierValuegthotfolderadminlt/l
    inkingAgentIdentifierValuegt
  • ltlinkingAgentRolegtimplementerlt/linkingAgentRol
    egt
  • lt/linkingAgentIdentifiergt
  • ltlinkingObjectIdentifiergt
  • ltlinkingObjectIdentifierTypegtFDsyslt/linkingObj
    ectIdentifierTypegt
  • ltlinkingObjectIdentifierValuegtR0b002ee180b003b
    0lt/linkingObjectIdentifierValuegt

31
PREMIS Agent Entity
  • Agents refer to people, organizations, or
    software associated with events, more
    specifically preservation events, of an object
  • In the data model diagram, there is no arrow from
    Agent entity to the Object entity, that is
    because Agents influence Objects indirectly
    through Events.

32
PREMIS Agent Entity
  • PREMIS Data Dictionary defines the mandatory
    semantic unit (element) of agent entity
  • That element is the following
  • ltagentIdentifiergt is a repeatable semantic unit

33
Agent Entity
  • agentIdentifier unique identifier for the agent
  • ltagentIdentifierTypegt refers to the
    classification of the domain that creates the
    agent identifier.
  • ltagentIdentifierValuegt value of the agent
    identifier
  • ltagentNamegt the agents name
  • ltagentTypegt the type of agent (people,
    organization, or software)

34
PREMIS Agent Example
  • ltagentgt
  • ltagentIdentifiergt
  • ltagentIdentifierTypegtFDsysagentlt/agentIdentif
    ierTypegt
  • ltagentIdentifierValuegthotfolderadminlt/agentIde
    ntifierValuegt
  • lt/agentIdentifiergt
  • ltagentNamegthotfolderadminlt/agentNamegt
  • ltagentTypegtPersonlt/agentTypegt
  • lt/agentgt

35
References
  • Data Dictionary for Preservation Metadata
  • http//www.oclc.org/research/activities/past/orpro
    jects/pmwg/premis-final.pdf
  • Digital Library Federation/Acquifer
    Implementation Guidelines for Shareable MODS
    Records
  • https//wiki.dlib.indiana.edu/download/attachments
    /24288/DLFMODS_ImplementationGuidelines.pdf
  • Fdsys Requirements Document
  • http//www.gpo.gov/pdfs/fdsys-info/documents/FDsys
    _RD_v2.1.pdf
  • MODS User Guidelines
  • http//www.loc.gov/standards/mods/userguide/
  • MODS Uses and Features
  • http//www.loc.gov/standards/mods/mods-overview.ht
    ml
  • Understanding PREMIS
  • http//www.loc.gov/standards/premis/understanding-
    premis.pdf
  • W3C Schools
  • http//www.w3schools.com/xml/default.asp
Write a Comment
User Comments (0)
About PowerShow.com