Federated Database Systems - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Federated Database Systems

Description:

The problem of combining data residing at different sources and providing the ... http://www.almaden.ibm.com/cs/people/fagin/ride95.pdf -Garlic System ... – PowerPoint PPT presentation

Number of Views:49
Avg rating:3.0/5.0
Slides: 29
Provided by: archan6
Category:

less

Transcript and Presenter's Notes

Title: Federated Database Systems


1
Federated Database Systems
  • -Archana Meka
  • Ravi Kanth Kolli

2
What is Data Integration?
  • The problem of combining data residing at
    different sources and providing the user with a
    unified view of these data.
  • Started around 1960s

3
(No Transcript)
4
Federated Databases
  • Transparently integrates multiple autonomous
    database systems into a single federation
  • Coined by Heimbigner and McLeod (1985)
  • Pros
  • A component DBS can continue its local
    operations and participate in the federation

5
Approaches for Federation
  • Global View Integration
  • Multi Database Language Approach
  • Mediated Query Systems
  • Ontology Driven Approach

6
Ontology driven approach
  • All the databases are first converted to a common
    model for easy access
  • Uses ontologies as a common interface for all the
    databases
  • Queries

7
Process
  • Find ontologies related to databases
  • Develop ontologies
  • Extending ontologies, merging
  • Matching
  • Add mappings
  • Query Languages

8
Step 1
  • Reusing resources
  • Checking if there are ontologies that are in the
    similar domain as the database
  • Creating new ontologies
  • New ontologies for databases

9
Step 2
  • Merging of ontologies
  • Merging similar ontologies or a few smaller one
    into a single larger one
  • Matching
  • Different databases may have same concepts
  • Matching between the schema of the ontologies

10
Step 3
  • Add mappings
  • Mapping database schema and instances to
    ontologies
  • Use Declarative Language for Mapping
  • D2R mapping for RDBMS to RDF

11
D2R Mapping
  • Extract the record sets
  • Record sets are grouped
  • Create class instances and assigned a URI
  • Instance properties are created

12
Contd.
13
Query
  • Various methods
  • Query database using the SQL or other database
    query language
  • Query ontologies using SPARQL etc
  • Provide an advanced GUI

14
Few Systems
  • The Garlic System
  • Infoharness
  • Infosleuth
  • Observer

15
The Garlic System
16
Contd.
  • Data repositories
  • Includes relational, non-relational database
    systems, file systems, image managers
  • Repository Wrapper
  • Conversion of data from one format to the Garlic
    Format

17
Contd.
  • Metadata Repository
  • Information on unified garlic schema, translation
    related information
  • Complex Object Repository
  • Complex objects that garlic application needs to
    combine the underlying data in new ways

18
Contd.
  • Query Processing and data manipulation
  • Provides a unified object oriented view of the
    database for Garlic applications
  • Queries are expressed in object oriented
    extension of SQL language
  • C API that provides an interface to Garlic
    Applications

19
Infoharness
  • A metadata management system with a generic
    metadata storage system as its metabase
  • Through 1993-1996
  • Some of the research was commercialized as the
    AdaptX Harness platform, system and related
    services from Bellcore
  •   InfoHarness is a trademark of Bellcore.

20
(No Transcript)
21
InfoSleuth
22
(No Transcript)
23
OBSERVER Global Architecture
24
Observer
  • Query Processor
  • Takes as input a user query expressed in DL
    using terms from a chosen user Ontology
  • Interontology Relationships Manager (IRM)
  • Interontology relationships relating the terms in
    various ontologies
  • Solution to vocabulary sharing problem

25
  • Ontology Server
  • Provides information about the ontologies
    residing in the node
  • Ontology
  • It is a set of terms of interest in a particular
    information domain
  • Semantically rich metadata capturing information
    content of underlying data repositories

26
Problems
  • Semantic Heterogeneity
  • Query processor

27
Semantic Heterogeneity
  • Occurs when there is a disagreement about the
    meaning, interpretation or intended use of the
    same or related data

28
References
  • http//lsdis.cs.uga.edu/lib/download/105-InfoHarne
    ss.pdf -Infoharness
  • http//www.almaden.ibm.com/cs/people/fagin/ride95.
    pdf -Garlic System
Write a Comment
User Comments (0)
About PowerShow.com