Ontologies in Data and Application Integration an Update - PowerPoint PPT Presentation

1 / 62
About This Presentation
Title:

Ontologies in Data and Application Integration an Update

Description:

Knowledge-Based Information Systems Lab. Data and Knowledge Systems (DAKS) ... Simple ASCII-based RDF and OWL entry and manipulation. Semantic Data Registration ... – PowerPoint PPT presentation

Number of Views:74
Avg rating:3.0/5.0
Slides: 63
Provided by: bertr68
Category:

less

Transcript and Presenter's Notes

Title: Ontologies in Data and Application Integration an Update


1
Ontologies in Data and Application Integration
an Update
Kai Lin Bertram Ludäscher Knowledge-Based
Information Systems Lab Data and Knowledge
Systems (DAKS) San Diego Supercomputer
Center University of California San Diego
http//www.geongrid.org
2
Outline
  • Motivation
  • Ontology Cheat Sheet
  • Ontology-enabled Prototypes and Tools
  • Data Service Registration (Structural
    Semantic)
  • Scientific Workflows

3
(No Transcript)
4
Ontology Cheat Sheet (1/2)
  • What is an ontology? An ontology usually
  • specifies a theory (a set of models) by
  • defining and relating
  • concepts representing features of a domain of
    interest
  • Also an overloaded (sometimes sloppy) term for
  • Controlled vocabularies
  • Database schema (relational, XML, )
  • Conceptual schema (ER, UML, )
  • Thesauri (synonyms, broader term/narrower term)
  • Taxonomies
  • Informal/semi-formal representations
  • Concept spaces, concept maps
  • Labeled graphs / semantic networks (RDF)
  • Formal ontologies, e.g., in Description Logic
    (OWL)
  • formalization of a specification
  • ? constrains possible interpretation of terms

5
A Multi-Hierarchical Rock Classification
Ontology (GSC)
Genesis
Fabric
Composition
Texture
6
Ontology Cheat Sheet (2/2)
  • What are ontologies used for?
  • Conceptual models of a domain or application,
    (communication means, system design, )
  • Classification of
  • concepts (taxonomy) and
  • data/object instances through classes
  • Analysis of ontologies e.g.
  • Graph queries (reachability, path queries, )
  • Reasoning (concept subsumption, consistency
    checking, )
  • Targets for semantic data registration
  • Conceptual indexes and views for
  • searching,
  • browsing,
  • querying, and
  • integration of registered data

7
Application Example Geologic Map Integration
domain knowledge
Knowledge representation Ontologies!?
Nevada
8
Geologic Map Integration in the Portal
  • After registering datasets, ontologies (here
    classes), and an application (OMI), the
    datasets can be searched and displayed in an
    integrated way.

9
Concept-Based Queries and Analysis
  • After registering a source with one or more
    ontologies, concept-based queries and analysis
    can be launched
  • Here light-weight client-side processing (SVG)

10
Ontologies and Data Management
  • Where do ontologies fit within data management
    architectures?
  • Several answers, specifically
  • An ontology is similar to a schema or conceptual
    model if one exists, but is
  • Developed independently of a particular
    application
  • Probably given in a different language
  • Inherently more general
  • Usually not a very good schema (weak structure)

11
Ontologies and Data Management(? watch out for
Semantic Data Registration later)
Ontology
use concepts from (explicitly or implicitly)
Design Artifact
Conceptual Model
Conceptual Model
Schema
Schema
Schema
Schema
? Metadata
Data
12
Creating and Sharing Concept Maps (here
Seismology concept map Cmap tool)
  • Lock up scientists for 2 days
  • Add CS/KRDB types
  • Create concept maps
  • Refine
  • Iterate
  • ? from napkin drawings, to concept maps, to
    ontologies

13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
Graph (RDF) Queries on Ontologies
visualisation
RQL Query Show all products
Query Results
17
Community-Based Ontology Development
  • Current concept maps and
  • emerging ontologies
  • Igneous Rocks/Plutons
  • Seismology
  • Geochemistry
  • Draft of a geochemistry ontology developed by
    scientists

18
Protégé ( not so ezOWL yet)
19
Sparrow (a poor mans OWL tool )
  • Simple ASCII-based RDF and OWL entry and
    manipulation

20
Semantic Data Registration(joint work w/ Shawn
Bowers)
21
What is Data/Ontology/ Registration?
  • A mechanism by which data sources, ontologies,
    services,
  • are published in a repository/registry
  • for the purpose of smart discovery, querying,
    integration

22
Things to Register
  • Data files (individual files)
  • Shapefile as a blob ( file type)
  • Collections (of files nested eg satellite data)
  • Databases (has schema and can be queried)
  • Shapefile with schema registered
  • Ontologies
  • Services (web grid services)
  • Other/external applications

23
Connecting Datasets to Ontologies
Ontology (snippet)
How can we register the dataset to concepts in
the Ontology?
Dataset
Date Site Transect SP_Code Count
2000-09-08 CARP 1 CRGI 0 2000-09-08 CARP 4
LOCH 0 2000-09-08 CARP 7 MUCA 1
2000-09-22 NAPL 7 LOCH 1 2000-09-18 NAPL 1
PAPA 5 2000-09-28 BULL 1 CYOS 57
24
Step1 Selecting Relevant Concepts
Concepts from an Ontology
  • DataCollectionEvent
  • AbundanceCollectionEvent
  • Measurement
  • Abundance
  • SpeciesAbundance
  • MeasurementContext
  • Location
  • LTERSite
  • SBLTERSite
  • naples
  • MeasurableItem
  • SpeciesCount
  • Species

Dataset
Date Site Transect SP_Code Count
2000-09-08 CARP 1 CRGI 0 2000-09-08 CARP 4
LOCH 0 2000-09-08 CARP 7 MUCA 1
2000-09-22 NAPL 7 LOCH 1 2000-09-18 NAPL 1
PAPA 5 2000-09-28 BULL 1 CYOS 57
25
Step1 Selecting Relevant Concepts
Concepts from an Ontology
  • DataCollectionEvent
  • AbundanceCollectionEvent
  • Measurement
  • Abundance
  • SpeciesAbundance
  • MeasurementContext
  • Location
  • LTERSite
  • SBLTERSite
  • naples
  • MeasurableItem
  • SpeciesCount
  • Species

Dataset
Date Site Transect SP_Code Count
2000-09-08 CARP 1 CRGI 0 2000-09-08 CARP 4
LOCH 0 2000-09-08 CARP 7 MUCA 1
2000-09-22 NAPL 7 LOCH 1 2000-09-18 NAPL 1
PAPA 5 2000-09-28 BULL 1 CYOS 57
26
Step2 Generate Object Model
Concepts from an Ontology
  • DataCollectionEvent
  • AbundanceCollectionEvent
  • Measurement
  • Abundance
  • SpeciesAbundance
  • MeasurementContext
  • Location
  • LTERSite
  • SBLTERSite
  • naples
  • MeasurableItem
  • SpeciesCount
  • Species

Abundance Collection Event
contains
measureOf
SpeciesAbundance
SpeciesCount
hasSpecies
hasValue
hasUnit
Species
RatioUnit
RatioValue
hasTime
hasLoc
DateTime
SBLTERSite
27
(No Transcript)
28
(No Transcript)
29
Applications of Semantic Registration
  • Mentioned before
  • Smart data discovery, integration etc.
  • New application
  • Generating data transformation semi-automatically
    for chaining together computational services

30
Problem Service Reusability
  • Unless designed to fit, independent services
    are structurally incompatible
  • Generally, the source output type will not be a
    subtype of the target input type

Incompatible
StructuralType Pt
StructuralType Ps
(?)
Desired Connection
Source Service
Target Service
Pt
Ps
31
Service Reusability
  • A data transformation mapping (?) is required to
    connect the services artificially creating
    subtype compatibility
  • If such a ? exists, the services are
    structurally feasible

Incompatible
StructuralType Pt
StructuralType Ps
(?)
?
?(Ps)
Desired Connection
Source Service
Target Service
Pt
Ps
32
Service Reusability
  • Idea
  • annotate services with semantic types (concept
    expressions) primarily for discovery of services

Ontologies (OWL)
Compatible
(?)
SemanticType Ps
SemanticType Pt
Desired Connection
Source Service
Target Service
Pt
Ps
33
Service Reusability
  • Services can be semantically compatible, but
    structurally incompatible

Ontologies (OWL)
Compatible
(?)
SemanticType Ps
SemanticType Pt
Incompatible
StructuralType Pt
StructuralType Ps
(?)
?
?(Ps)
Desired Connection
Source Service
Target Service
Pt
Ps
34
The Ontology-Driven Framework (work w/ Shawn
Bowers, SEEK)
Ontologies (OWL)
Compatible
(?)
SemanticType Ps
SemanticType Pt
Registration Mapping (Input)
Registration Mapping (Output)
StructuralType Pt
StructuralType Ps
Correspondence
?(Ps)
Generate
Source Service
Target Service
Transformation
Pt
Ps
Desired Connection
35
Example Generated Data Transformation (in XQuery)
  • Based on the structural correspondences and
    certain assumptions, we derive the transformation
    query

ltcohortTablegt for s in /population/sample
return ltmeasurementgt for c in
s/meas/cnt return ltobsgtc/text()lt/obsgt
for l in s/lsp return ltphasegtl/text()lt/pha
segt lt/measurementgt lt/cohortTablegt
36
Scientific Workflows(Efrat Jaeger et al.)
37
Reverse Engineering a Scientific Workflow using
the KEPLER Tool (Efrat Jaeger)
38
A Scientific Workflow in Kepler
Extract mineral composition for row Id.
Igneous Rock Diagrams information.
Rock Name.
39
A Scientific Workflow in Kepler
40
A Scientific Workflow in Kepler
41
(No Transcript)
42
Reverse-Engineered the Geological Map Integration
in Kepler
43
DataMapper Sub-Workflow
44
Result launched via the BrowserUI actor
45
KEPLER and YOU
  • Kepler
  • is a community-based, cross-project, open source
    collaboration
  • for minute made application integration
  • using web (grid) services as basic building
    blocks
  • has a joint CVS repository, mailing lists, web
    site,
  • is gaining momentum thanks to contributors and
    contributions
  • BSD-style license allows commercial spin-offs
  • a pre-packaged, shrink-wrapped version
    (Kepler-to-GO) coming soon to a place near you

46
F I N Questions?
47
Additional Material
48
The KEPLER GUI (Vergil from Ptolemy II)
Drag and drop utilities, director and actor
libraries.
49
Running the workflow
50
Distributed Workflows in KEPLER
  • Web and Grid Service plug-ins
  • WSDL
  • ProxyInit, GlobusGridJob, GridFTP,
    DataAccessWizard
  • SRB
  • SSH, SCP
  • Web Service Harvester
  • Imports all the operations of a specific WS (or
    of all
  • the WSs in a UDDI repository) as Kepler actors
  • XSLT and XQuery transformers to link non-fitting
    services together
  • Web Service Deployment (ongoing work)

51
A Generic Web Service Actor
  • Given a WSDL and the name of an operation of a
    web service, dynamically customizes itself to
    implement and execute that method.

52
Set Parameters and Commit
Set parameters and commit
53
WS Actor after Instantiation
54
Web Service Harvester
  • Imports the web services in a repository into
    the actor library.
  • Has the capability to search for web services
    based on a keyword.

55
Composing 3rd-Party WSs
Input of next web service
User interaction Transformations
56
Providing DB Access through Kepler
  • Database connection actor
  • Opening a database connection and passing it to
    all actors accessing this database.
  • Database query actor
  • A generic actor that queries a database and
    provides its result.
  • DBConnection type and DBConnectionToken
  • A new IOPort type and a token to distinguish a
    database connection from any general type.

57
Database Connection Actor
  • OpenDBConnection actor
  • Input database connection information.
  • Output A DBConnectionToken, a reference to a
    database connection instance, through a
    DBConnection output port.

58
Database Query Actor
  • Database Query actor
  • Input A query string (SQL) and a database
    connection reference.Parameters output type
    XML, Record or String.
    output each row separately or all at once.
    Process Execute query. Produce results
    according to parameters.

59
Querying Example
60
Resource Description Framework (RDF)
  • Simple data model that consists of
  • Resources (uniquely identified via URIs)
  • Properties
  • Values (resources or character strings)
  • Data organized into triples (subject, property,
    value)

locatedIn
SonomaRegion
CaliforniaRegion
Property (Resource)
Subject (Resource)
Value (Resource)
locatedIn(SonomaRegion, California)
61
RDF Schema
  • Adds a set of pre-defined properties to define
    classes and properties
  • Allows instances to be connected to classes
  • Sub-class and sub-property (is-a) relationships

Region is a class locatedIn is a
property locatedIn connects Regions
locatedIn
Region
rdftype
rdftype
locatedIn
CaliforniaRegion
SonomaRegion
62
OWL
  • Adds additional pre-defined properties to further
    constrain an ontology
  • (See http//www.w3.org/TR/owl-guide/)
  • Note, RDF(S) and OWL use XML
  • Some graphic tools exist (e.g., Protégé)

A Vintage is a class that is a subclass of an
unnamed class whose instances always have one
hasVintageYear property.
ltowlClass rdfID"Vintage"gt ltrdfssubClassOfgt
ltowlRestrictiongt ltowlonProperty
rdfresource"hasVintageYear"/gt
ltowlcardinalitygt1lt/owlcardinalitygt
lt/owlRestrictiongt lt/rdfssubClassOfgt
lt/owlClassgt
Note the uglified XML syntax The good news
meant for parsers, not humans!
Write a Comment
User Comments (0)
About PowerShow.com