Using Non-Taxonomic Knowledge to Improve Semantic Matching - PowerPoint PPT Presentation

About This Presentation

Title:

Using Non-Taxonomic Knowledge to Improve Semantic Matching

Description:

Many AI tasks require determining whether two knowledge ... Bumper. Gold. has-part. material. A: 'Acme makes a car made of Gold.' Knowledge Acquisition ... – PowerPoint PPT presentation

Number of Views:32

Avg rating:3.0/5.0

Slides: 50

Provided by: duderoc

Learn more at: https://www.cs.utexas.edu

Category:

more less

Transcript and Presenter's Notes

Title: Using Non-Taxonomic Knowledge to Improve Semantic Matching

1
Using Non-Taxonomic Knowledge to Improve Semantic
Matching

Peter Yeh
July 22, 2003

2
Talk Outline

Introduction
Analysis of Existing Techniques
Our Approach
Initial Evaluation
Proposed Work

3
Introduction

Many AI tasks require determining whether two
knowledge representations encode the same
knowledge.

4
Information Retrieval

Match queries with documents.

Q A car with a bumper made of gold.
A Acme makes a car made of Gold.
5
Knowledge Acquisition

Match new knowledge with existing knowledge.

KB
KB Are you trying to encode a conversion?
6
Rule-based Classification

Match rule antecedents with working memory. For
example, Course of Action (COA) critiquing.

Pattern
COA
This COA has a rating of good for enemy maneuver
engagement.
7
The Core Problem

Solving this matching problem is hard because
multiple encodings of the same knowledge rarely
match exactly.
Representations dont match exactly because
Expressive Ontology.
Knowledge is encoded by different sources.
Knowledge being encoded is complex.

8
Types of Mismatches

Informal examination of a knowledge-base
containing
Patterns.
COAs.
Knowledge-base was built by two Subject Matter
Experts (SMEs) participating in DARPAs RKF
project.
Looked for cases of mismatch.

9
Types of Mismatches (cont.)
an armored brigade engaging an armored
battalion.

Taxonomic Differences

10
Types of Mismatches (cont.)
One military unit attacking another unit.

Taxonomic Differences
Equivalent Alternatives

11
Types of Mismatches (cont.)
Mechanized infantry brigade engaging mechanized
infantry battalion.

Taxonomic Differences
Equivalent Alternatives
Omissions

12
Types of Mismatches (cont.)
Support attack occurs before main attack.

Taxonomic Differences
Equivalent Alternatives
Omissions
Granularity

13
Analysis of Existing Techniques

Analogy
Inexact Matching
Semantic Matching
Conceptual Indexing
Ontology Merging

14
Analogy

Analogy mapping of knowledge from a base domain
to a target domain.
Structure Mapping Engine (Forbus et. al. 89)
Maps relational knowledge (mappable systems).
Systematicity Principle used to select best
analogy.
Analogy based on common generalizations (Leishman
92)
Maps both relational knowledge and object
attributes.
Prefers minimal common generalization.

15
Analogy Structure Mapping Engine
16
Inexact Matching

Inexact Matching tries to address mismatches
between representations
Graph Editing (Tsai et. al. 83, Shapiro and
Haralick 81, Messmer et. al. 93, Wolverton et.
al. 2003)
Uses edit distance parameters.
Similarity based on shortest sequence of edits.
Partial Matching
Does not require representations to be
isomorphic.
Similarity based on amount of structural overlap.
Minimal Common Supergraph (Bunke et. al. 2000)
and Maximal Common Subgraph (Bunke and Shearer
98).

17
Inexact Matching MCS
18
Semantic Matching

Semantic Matching uses knowledge to match
representations.
Projection
Uses taxonomic knowledge.
Ontoseek (Guarino et. al. 99) and ELEN (Huibers
et. al. 96).
Projection Projection alone is too restrictive
??-projection (Genest and Chein 97).
Common generalization, graph splitting, regular
expressions (Fargues 92, Buche et. al. 2000,
Martin et. al. 2001).
Semantic Overlap
Maximal Joins and Generalizations (Myaeng 92,
Poole et. al. 95).
Shared Semantic Structures (Zhong et. al. 2002).

19
Semantic Matching Semantic Overlap
20
Conceptual Indexing

Conceptual indexing how to organize and index
knowledge.
Requires so form matching.
Generalization hierarchy (Bournard et. al. 95,
Ellis 92, Levinson 82, Woods 97).
Knowledge indexed by common generalizations.
Generalizations organized hierarchically by
subsumption relationships.
Retrieve Most Specific Subsumer (MSS) of a query.
Match procedure is similar to Projection -
suffers the same problems.

21
Ontology Merging and Translation

Ontology Merging merge multiple ontologies built
by different sources
Chimaera (McGuinness et. al. 2000)
SMART (Noy and Musen 99).
Ontology Translation translates a representation
from one language to another
Ontomorph (Chalupsky 2000).
Goals are different but share some of the same
problems.

22
Our Approach

The goal of this research is to solve the
matching problem.
We believe existing semantic approaches can be
extended with additional knowledge to
significantly improve matching.
What kinds of additional knowledge?
Transformations
Handle mismatches.
Improve matching.
Not taxonomic knowledge.

23
Our Approach (cont.)

Generality and domain-independence.
Want additional knowledge (e.g. Transformations)
to be useful across domains.
We believe domain-independence is possible given
a reusable domain-neutral upper ontology.
Contains a small set of general concepts.
SMEs use this upper ontology to build KBs on
specialized topics (e.g. chemistry, biology,
battle space planning).
No training in logic or knowledge representation.

24
Illustration of Our Framework
Ontology
Domain-independent KB for the task of matching.
KB can be viewed as a domain-specific matcher
(e.g. match symptoms to diseases).
25
Our Prototype

Extend semantic matchers with transformations.
Apply transformations in a forward-chaining
manner.
Use existing techniques for reasoning with
Conceptual Graphs (Corbett et. al. 99, Salvat et.
al. 96, Willems 95)
Projection.
Unification.
Graph rules.
Two caveats because existing techniques lead to
promiscuous matches.

26
Transformations that Retains Semantics
Projection
27
Transformations that Retains Semantics
28
Rule Applicability
29
Rule Applicability
30
Enumerating Transformations

Transformations derived from our domain-neutral
upper ontology.
Enumerated all ways that a relation can be
legally used to encode information in a
conceptual graph.
Considered whether the same information can be
expressed differently.
Enumeration was possible because
Small upper ontology.
Each concept had well-defined semantics.

31
Transformations Enumerated

We were able to enumerate about 300
transformations.
Resulting transformations fall into three general
categories
Transitivity
Part Ascension
Transfers Through

32
Transformations Enumerated (cont.)
relation Transitive Part Ascension Transfers Through
causes X - subevent, resulting-state
caused-by X subevent-of resulting-from
defeats - - -
defeated-by - subevent-of caused-by
enables X - causes, resulting-state, subevent
enabled-by X subevent-of caused-by, resulting-from
inhibits - subevent-of resulting-state
inhibited-by - subevent-of caused-by, resulting-from
by-means-of X - -
means-by-which X - -
prevents - subevent-of -
prevented-by - subevent-of caused-by, resulting-from
resulting-state - - causes
resulting-from - - -
33
Example Our Approach
l1
l1 (1,A)
M
(1,A)
34
Example Our Approach
35
Example Our Approach
36
Example Our Approach
37
Initial Evaluation

Used our matcher in an application in the domain
of battle space planning (DARPA's RKF Project).
The task is to analyze COAs.
Battle space ontology built by extending our
upper ontology.
Two military analysts used this ontology to build
KBs containing
Patterns.
COAs.
Our matcher matched the patterns to COAs.

38
Example Output
39
Experiment 1

Evaluates our first hypothesis.
How significant is the improvement?
Compared our matcher to
Maximal Common Subgraph (MCS).
Semantic Search Lite (SSL).
Methodology
300 domain-neutral transformations 80
domain-specific transformations.
Matched the patterns to the COAs.
A pattern matches a COA if the match score meets
or exceeds a pre-specified threshold.
Used metrics of precision and recall.

40
Experiment 1 Precision
41
Experiment 1 Recall
42
Experiment 2

Initial evaluation of our second hypothesis.
Assesses the domain independence of using
transformations.
Limited - conducted in only one domain, but can
still offer some insight.
Methodology
Divided transformations into 2 groups
(domain-neutral vs. domain-specific).
Used domain-neutral transformations to construct
DN
Used domain-specific transformations to construct
DS
Everything else is the same as Experiment 1.

43
Experiment 2 Precision
44
Experiment 2 Recall
45
Proposed Work

More Comprehensive Evaluation.
Use background knowledge.
Incorporate indexing to make matching more
efficient.

46
Comprehensive Evaluation

Evaluate our approach in several applications in
four domains.
Four data sets
Chemistry (Halo).
Biology (RKF).
Battle Space Planning (RKF).
Office Procedures (EPCA).
Three Applications
Elaboration Chemistry and Office Procedures.
Question Answering Biology and Battle Space.
Plan Evaluation Battle Space and Office
Procedures.

47
Background Knowledge

Background Knowledge.
Can be used to normalize new knowledge at
acquisition time via a join (Mineau et. al. 93).
Idea can be applied to matching.
Increase similarity.
Two problems
When should a join be performed?
How to better control the join?

48
Background Knowledge

Background Knowledge.
Can be used to normalize new knowledge at
acquisition time via a join (Mineau et. al. 93).
Idea can be applied to matching.
Increase similarity.
Two problems
When should a join be performed?
How to better control the join?

49
Indexing