Title: Metadata and Discovery data portal tools
1Metadata and Discovery (data portal tools)
Compiled by Allison Gaylord and Robert Bochenek
AOOS DMAC Data Management Workshop January 27,
2005
2I. Supply a Definition
Metadata is data about data. Metadata describes
the content, quality, condition, and other
characteristics of data. Metadata describes the
who, what, when, where, why, and how about a data
set. - NOAA Coastal Services Center
3I. Supply a Definition
Two aspects of metadata Syntactic structure
of the metadata file, how it is organized.
Computers use the structure to search and
retrieve information. Allows for
interoperability. Semantic the meaning of the
terms in the metadata file. Details data
acquisition, methods, procedures, sensors, data
processing algorithms, QA/QC methods, etc.
Computers cannot use, yet
4I. Supply a Definition
DATA DICTIONARIES Data dictionaries provide a
standard vocabulary for metadata descriptors and
promote data discovery. Example An individual
collecting sea surface temperature may describe
his or her measurement as SEATEMP in the
metadata, another researcher performing the exact
same measurement may describe his or her
measurement as Temp or T. In terms of providing
metadata, both researchers have met their
responsibility. The researchers have not adhered
to restrictions dictated by a Data Dictionary.
Consideration Some metadata specs have their
own data dictionaries which can be validated,
some dont.
5II. List candidates for achieving the task
Consider the types of data and potential
candidates
Real Time (ISO, SensorML, MarineXML, NDBC moored
buoy spec) Gridded (ISO, FGDC, DIF, HDF) In
Situ (ISO, FGDC, EML)
6II. List candidates for achieving the task
- There are dozens, but lets limit the discussion
to the most widely known - ISO 19115 is a geo-spatial metadata standard
developed by ISO/TC 211. ISO 19115 defines a
comprehensive metadata model for geographic
objects. ISO/TC 211also defined a smaller set of
core metadata elements (shown on example slide.)
This core contains the minimum elements that
satisfy the requirements of an ISO conformant
metadata record. The ISO 19115 standard does not
specify storage format, but XML schemas are under
development for an XML encoding of it (in full or
for specialized profiles). - CSDGM/FGDC (Content Standard for Digital
Geospatial Metadata) is a standard for metadata
for geographic objects developed by FGDC (Federal
Geographic Data Committee). However, this
standard is not limited to spatial data. FGDC
enables development of profiles, i.e.
customization of the standard to suit the needs
of a particular application domain (while staying
within the framework of the standard). - SPOT imagery FGDC example
- http//gcmd.nasa.gov/servlets/md/getdif.py?entry_i
dGCMDCANEMRCCRSSPOTxsldif_to_fgdc-html.xslcu
rrentTabcurrentItemportalgcmd - EML (Ecological Metadata Language)
- http//knb.ecoinformatics.org/data.html
- SensorML (Sensor Model Language) is an Open GIS
Consortium (OGC) initiative that aims to provide
an XML schema for defining the geometric,
dynamic, and observational characteristics of a
sensor. The goal is to make all types of
Web-resident devices (e.g., flood gauges, stress
gauges on bridges, mobile heart monitors, Web
cams, and satellite-borne earth imaging devices)
discoverable and accessible using standard
services and schemas. - NDBC spec (National Data Buoy Center) ?
7II. List candidates for achieving the task
- Continued
- DIF (Directory Interchange Format) 6 required
fields ascii format compatible with FGDC - SPOT imagery DIF example
- http//gcmd.nasa.gov/records/GCMD_CANEMRCCRSSPOT.h
tml - Dublin Core - established by an international,
cross-disciplinary group of professionals from
librarianship, computer science, text encoding,
the museum community to describe a wide range of
networked resources. Widely used in Europe and
Australia. - MarineXML - Under development. EU funded
project run by an international consortium to
develop a framework for XML interoperability
within the Marine sector. It will not generate an
alternative standard, but will develop mechanisms
for registering, extending and cross-walking
emerging standards. - NcML (NetCDF Markup Language) defines metadata
for generic NetCDF data. NetCDF is a common
format used in global oceanographic programmes
such as Argo. observational datasets. - HDF (Hierarchical Data Format) is a multi-object
file format that facilitates the transfer of
various types of data between machines and
operating systems.
8II. List candidates for achieving the task
ISO (More optional entries than FGDC)
9FGDC
II. List candidates for achieving the task
10III. Discuss the candidates, including Pros and
Cons
-Consider only XML/GML based standards -Interopera
bility challenges. Does a standard have metadata
crosswalks available (to map elements between
standards.) -ISO International standard that
almost all metadata tools strive to comply
with. -FGDC US Federal agency standard widely
accepted in the US Typically required as a
deliverable for Federal grants and contracts
this format is required by NSDI clearinghouse
nodes to support data discovery Evolved to meet
ISO Metadata Standard 19115. -DIF (Directory
Interchange Format) established standard (been
around for 16 years but the DIF structure was
"frozen" on September 18, 1987). The DIF does
not compete with other metadata standards. It is
simply the "container" for the metadata elements
that are maintained in the IDN database, where
validation for mandatory fields, keywords,
personnel, etc. takes place. -EML (Ecological
Metadata Language) more extensive than FGDC,
has FGDC translator for contributing to the NSDI
clearinghouse system to promote data discovery.
-SensorML (Sensor Model Language) far more
thorough than DIFs for satellite imagery,
supported by CEOS working group and OGC.
Interoperability testing underway Fall, 2004. Is
there a cross-walk for this one?
http//vast.uah.edu/SensorML/index.html
-MarineXML International working group funded
to developed proposed standard Examples
http//www.aodc.gov.au/products/prod/documentation
/marine_xml_schema.html
11IV. Provide Demonstrations
12IV. Provide Demonstrations
Metadata and Data Discovery Portals
Excellent Internet Map Server Portal that
integrates data / metadata and makes it available
for download (unfortunately, the Alaska data is
sparse.) http//www.ncddc.noaa.gov/COOS/GIS_Mappin
g/ National Spatial Data Infrastructure
Clearinghouse Search http//clearinghouse1.fgdc.go
v/servlet/FGDCServlet Alaskas Cooperatively
Implemented Information Management
System http//info.dec.state.ak.us/ciimms/
http//www.asgdc.state.ak.us/metadata/vector/biol
ogic/m_mammal/sealhout.html Geographic
Information Network of Alaska (GINA) Metadata
Catalog http//map.gina.alaska.edu/metadataexplore
r/explorer.jsp
13IV. Provide Demonstrations
Metadata Creation/Management Tools
Metalite (USGS) FGDC Core metadata
creation http//edcnts11.cr.usgs.gov/MetaLite/ Me
tadata Parser (MP) FGDC parser for
clearinghouse submission http//geology.usgs.gov/t
ools/metadata/tools/doc/mp.html Metacat and
Morpho (KNB) Creation of EML type metadata and
management of XML based metadata
files http//knb.ecoinformatics.org/software/ Mer
maid (NCDDC) Centralized metadata management
tools. http//www.ncddc.noaa.gov/Metadata/tools E
nvironmental Systems Research Institute (ESRI)
ArcGIS ArcCatalog module FGDC /
ISO www.esri.com NPS Metadata Tools Extension
www.nature.nps.gov/im/units/ mwr/gis/metadata/met
adata_tools.htm
14V. Discuss Emerging Standards
- Tools for crosswalking metadata will be
increasingly popular - International Organisation for Standardisation
(ISO) Technical Committee's (TC)211 Metadata
Standard 19115 was adopted June 2004 (FGDC is
compatible with this standard.) - Harmonisation and adoption of OGC standards
within the ISO TC211 - (ISO19000) series of standards include
- 19135 Procedures for registration of geographic
items - 19110 Feature Type Cataloguing methodology
- 19126 UML/XML implementation of 19110,19135
- 19136 GML
- 19139 GML/19115 metadata implementation
- Because OGC is the worlds authoritative source
for standards related to geoprocessing
interoperability, and because OGC has strong
international industry and government support in
domains that depend on sensors, its likely that
SensorML will quickly become established in all
areas where such a standard can be of use. - FGDC ISO transitional
15VI. Recommendations for AOOS
- Adopt standards that are FGDC compliant. Or
have FGDC crosswalks including - FGDC/ISO for Geospatial data
- EML for Geospatial ecological data
- SensorML for sensors (if theres a crosswalk
for this one?) - Data Elements required for moored arrays (See
MM_Report) - Consider the development of custom FGDC
profiles to meet the needs of the marine
community. - Specify that data management activities
(specifically data archiving and metadata) are a
budgeted at a minimum percentage for all AOOS
projects. - Follow the discussion and conclusions of the
Marine Metadata Working Group - Provide periodic training in the
standard/standards for AOOS participants.
16(No Transcript)
17(No Transcript)