Mini-Ontology Generation from Canonicalized Tables - PowerPoint PPT Presentation

About This Presentation
Title:

Mini-Ontology Generation from Canonicalized Tables

Description:

Mini-Ontology GeneratOr (MOGO) Mini-Ontology Generation from Canonicalized Tables ... NOTE: MOGO implements a base set of algorithms for each step of the process and ... – PowerPoint PPT presentation

Number of Views:19
Avg rating:3.0/5.0
Slides: 11
Provided by: tang1
Learn more at: https://tango.byu.edu
Category:

less

Transcript and Presenter's Notes

Title: Mini-Ontology Generation from Canonicalized Tables


1
Mini-Ontology Generation from Canonicalized Tables
  • Stephen Lynn
  • Data Extraction Research Group
  • Department of Computer Science
  • Brigham Young University

Supported by the
2
TANGO Overview
TANGO Table ANalysis for Generating
Ontologies Project consists of the following
three components
  1. Transform tables into a canonicalized form
  2. Generate mini-ontologies
  3. Merge into a growing ontology

3
Thesis Statement
  • Proposed Solution
  • Develop a tool to accurately generate
    mini-ontologies from canonicalized tables of data
    automatically, semi-automatically, or manually.
  • Evaluation
  • Evaluate accuracy of tool with respect to
    concept/value recognition, relationship
    discovery, and constraint discovery.

4
Sample Input
Region and State Information Region and State Information Region and State Information Region and State Information
Location Population (2000) Latitude Longitude
Northeast 2,122,869
Delaware 817,376 45 -90
Maine 1,305,493 44 -93
Northwest 9,690,665
Oregon 3,559,547 45 -120
Washington 6,131,118 43 -120
Sample Output
5
Mini-Ontology GeneratOr (MOGO)
  • Concept/Value Recognition
  • Relationship Discovery
  • Constraint Discovery

NOTE MOGO implements a base set of algorithms
for each step of the process and allows for
runtime integration of new algorithms.
6
Concept/Value Recognition
  • Lexical Clues
  • Data value assignment
  • Labels as data values
  • Default
  • Classifies any unclassified elements according to
    simple heuristic.

7
Relationship Discovery
  • Dimension Tree Mappings
  • Lexical Clues
  • Generalization/Specialization
  • Aggregation
  • Data Frames
  • Ontology Fragment Merge

8
Constraint Discovery
  • Generalization/Specialization
  • Computed Values
  • Functional Relationships
  • Optional Participation

Region and State Information Region and State Information Region and State Information Region and State Information
Location Population (2000) Latitude Longitude
Northeast 2,122,869
Delaware 817,376 45 -90
Maine 1,305,493 44 -93
Northwest 9,690,665
Oregon 3,559,547 45 -120
Washington 6,131,118 43 -120
9
Validation
  • Concept/Value Recognition
  • Correctly identified concepts
  • Missed concepts
  • False positives
  • Data values assignment
  • Relationship Discovery
  • Valid relationship sets
  • Invalid relationship sets
  • Missed relationship sets
  • Constraint Discovery
  • Valid constraints
  • Invalid constraints
  • Missed constraints

Precision Recall
Concept Recognition
Relationship Discovery
Constraint Discovery
10
Contribution
  • Tool to generate mini-ontologies
  • Assessment of accuracy of automatic generation
Write a Comment
User Comments (0)
About PowerShow.com