Title: Ontology Development
 1 - Ontology Development 
- To Support 
- AQUA Question-Answering 
- Richard Fikes 
- Jessica Jenkins Bill MacCartney Rob McCool 
 Deborah McGuinness
- Knowledge Systems Laboratory 
- Stanford University 
- www.ksl.stanford.edu
2KSL and the WMD Coalition
- Tools for ontology creation, evolution, and 
 maintenance
- Coalition teams have adopted KSLs Ontolingua and 
 Chimaera as a standard for ontology development,
 maintenance, and analysis
-  (Additionally, we use other internal tools  
 JTP, DQL, IW)
- Initial evaluation and ongoing support 
- KSL evaluated initial stage of WMD ontology using 
 our Chimaera tools, reviewed findings with other
 teams, and taught others how to use the tools
 themselves
3KSL and the WMD Coalition
- Tools for ontology creation, evolution, and 
 maintenance
- Initial evaluation and ongoing support 
- Knowledge representation and consulting work 
- KSL providing some core new KB development for 
 CNS core  Russian naval facilities and Newly
 Independent States facilities  and is augmenting
 this with extraction results
- Providing consultation on sources / merging 
 opportunities -Counter-terrorism KBs built for
 DARPA (HPKB, ISX-Cyladian- HORUS, SAIC-Cycorp,
 ) semantic web for the military (SWMU ontology
 tutorial), ontologies for information fusion, etc.
4KSL and the WMD Coalition
- Tools for ontology creation, evolution, and 
 maintenance
- Initial evaluation and ongoing support 
- Knowledge representation and consulting work 
- Knowledge extraction 
- KSL knowledge extraction tools for RNF and NIS 
- New focus on utilizing other important useful KB 
 sources
- SUMO is a core ontology for ontology sharing 
 3,900 axioms relations and sets processes and
 objects temporal, spatial, and mereological
 relations agents, etc.
- Domain ontologies WMDs, terrorism, biological 
 viruses,
- Written in SUO-KIF, a proprietary dialect of KIF 
- Published by Teknowledge under GNU public license 
 as part of IEEE SUO working group
 (ontology.teknowledge.com/)
5SUMO
- SUMO requires translation to be used with any 
 reasoner
- KSL has successfully translated SUMO to 
 plain-vanilla KIF
- Full translation complete semantic content 
 retained
- Highly portable should be fully compatible with 
 most FOL reasoners
- Accurate most test queries demonstrably 
 answerable from translated axioms
- However, the result is not yet fully usable 
- Most test queries not answered from full axiom 
 set in reasonable time
- SUMO was not designed for efficient automated 
 reasoning
- Solution a smarter translator, and some SUMO 
 brain surgery
- Translation to DAML may provide another path 
- The existing translation is quite lossy, but 
 enables some query-answering
- Further work will enable a more complete and 
 accurate translation
6DAML Versions of SUMO, WMD, and Terrorism
- DAML translations of SUMO, WMD, and 5 
 terrorism-related ontologies and knowledge bases
 (KBs) provided by Teknowledge
7DAML Versions of SUMO, WMD, and Terrorism
- A few simple translations were used by 
 Teknowledge to translate original KIF content to
 DAML
- Example subrelation
 lObjectProperty
(subrelation father parent)
-  Example KIF triples to RDF triples
 rdfresourceSpain/ 
(part MadridSpain Spain) 
 8DAML Versions of SUMO, WMD, and Terrorism
- Teknowledges DAML files provide a great starting 
 point, but there are a few problems
- Syntactic errors and issues with resolving 
 references across files -- these problems are
 easy to fix.
- A large amount of the original KIF content was 
 dropped in the translation to DAML.
 Reincorporating some of this content is trivial,
 but it is generally nontrivial.
Example (trival) TransitiveRelation  
(instance part TransitiveRelation) 
 9KIF - DAML Example (nontrivial)
Original KIF
( (and (instance ?SUBSTANCE BiochemicalAgent) (
possesses ?AGENT ?SUBSTANCE)) (capability 
BiochemicalAttack agent ?AGENT))
Translation of capability ternary to binary 
relation
 rdfIDcapabilityRole rdfresourceCapability/ rdfresourceCaseRole/ operty rdfIDcapabilityProcess   
 Property 
 perty 
 10KIF - DAML Example (nontrivial) cntd.
Original KIF
( (and (instance ?SUBSTANCE BiochemicalAgent) (
possesses ?AGENT ?SUBSTANCE)) (capability 
BiochemicalAttack agent ?AGENT))
Translation of ?AGENT has BiochemicalAttack 
capability if it possesses a BiochemicalAgent
ability rdfresourcesumoagent/ 
Attack/ rdfIDAgentsWithBiochemicalAttackCapability  
 y/ micalAttackAgentCapability/   
 rdfresourcesumopossesses/    
 11Query-Answering Example 1
- What has the capability of being the agent of a 
 biochemical attack?
- Query pattern (capability ?agt 
 Biochemical-Attack-Agent-Capability)
- Knowledge in the ontology 
- A thing is an Agent-With-Biochemical-Attack-Capab
 ility if and only if it
- Has a capability Biochemical-Attack-Agent-Capabil
 ity or
- Possesses a Biochemical-Agent 
- An Agent-With-Biochemical-Attack-Capability has 
 capability Biochemical-Attack-Agent-Capability
- A Nerve-Agent is a Biochemical-Agent 
- If AGT has capability Biochemical-Attack-Agent-Ca
 pability, then AGT is capable of being an
 agent in a Biochemical-Attack
- If C is the capability of playing role R in 
 processes of type PT, and AGT is known to have
 played role R in a process of type PT, then AGT
 has capability C
- Knowledge from documents 
- Al-Qaida is a Foreign-Terrorist-Organization 
 that possesses a Nerve-Agent
- Aum-Supreme-Truth-Chemical-Attack-27-Jun-94 is a 
 Chemical-Attack whose agent is
 Aum-Supreme-Truth
12Query-Answering Example 1
- What has the capability of being the agent of a 
 biochemical attack?
- Query pattern (capability ?agt 
 Biochemical-Attack-Agent-Capability)
- Answer Al-Qaida 
- Al-Qaida is a Foreign-Terrorist-Organization 
 that possesses a Nerve-Agent from
 documents
- A thing is an Agent-With-Biochemical-Attack-Capab
 ility if and only if it
- Has a capability Biochemical-Attack-Agent-Capabil
 ity or
- Possesses a Biochemical-Agent from 
 ontology
- An Agent-With-Biochemical-Attack-Capability has 
 capability Biochemical-Attack-Agent-Capability
 from ontology
- If AGT has capability Biochemical-Attack-Agent-Ca
 pability, then AGT is capable of being an
 agent in a Biochemical-Attack from ontology
- A Nerve-Agent is a Biochemical-Agent 
 from ontology
13Query-Answering Example 1
- What has the capability of being the agent of a 
 biochemical attack?
- Query pattern (capability ?agt 
 Biochemical-Attack-Agent-Capability)
- Answer Aum-Supreme-Truth 
- Aum-Supreme-Truth-Chemical-Attack-27-Jun-94 is a 
 Chemical-Attack whose agent is
 Aum-Supreme-Truth from documents
- Playing the role agent in a Biochemical-Attack
 requires the capability Biochemical-Attack-Agent
 -Capability from ontology
- If playing role R in a process of type PT 
 requires capability C, and Agt plays role R in a
 process of type PT, then Agt has capability C
 from ontology
- Aum-Supreme-Truth has capability 
 Biochemical-Attack-Agent-Capability
14Query-Answering Example 2
- Who are the agents of attacks that used the same 
 type of weapons as Recent-Attack-001?
- Query pattern (type Recent-Attack-001 ?res) 
 (onProperty ?res instrument) (hasClass ?res
 ?inst-type) (type ?attack ?res) (agent ?attack
 ?agt)
- Must-bind variables ?agt ?attack 
- Knowledge in the ontology 
- A Mortar-Attack has an instrument of type 
 Mortar
- Knowledge from documents 
- Recent-Attack-001 is a Thing that has an 
 instrument of type Mortar
- Revolutionary-Armed-Forces-Of-Colombia-Mortar-Att
 ack-1-Jul-00 is a Mortar-Attack that has agent
 Revolutionary-Armed-Forces-Of-Colombia.
- Answer Revolutionary-Armed-Forces-Of-Colombia
15AQUA Program Plan
- Overview of the project 
- Goal is to create a system that can answer 
 complex questions
- With plus up funding, we now have an end-to-end 
 system. Makes use of KSLs Ontolingua Knowledge
 Server and Java Theorem Prover (JTP) to develop
 answers to queries
- Uses SAIC and other technology to automatically 
 populate KBs with information from new text
 sources
- Uses multiple extractors from multiple sources to 
 answer queries
- KSL extractor 
- UMBC/NMSU extractor 
- IBM extractor
16AQUA Current Plans
NL QUESTION
MOQA Text ? TMR Translator
A
SAIC TMR ? KIF Mapper/Translator
C
MOQA NL ?TMR Query Processor
B
CNS TEST DATA
IBM Text ? KIF Translator
 KIF-Formatted Question 
 Ontolingua Knowledge Server ---------------- JAVA
 Theorem Prover 
 KIF Answer/ Proof tree 
 KSL Extractor 
D
E
SAIC KIF ? TMR Mapper/Translator
 KSL generated explanation 
MOQA TMR ? NL Answer Processor
NL ANSWER 
 17AQUA Initial Concept
SAIC Interlingua? KIF Translator
NMSU Query Processor
QUESTION
Interlingua Query
NL Query
KIF Query
Interlingua Answer
KIF Answer
NL Answer
SAIC KIF ? Interlingua Translator
NMSU NL Generator
ANSWER
KSL Java Theorem Prover 
 18Key Tasks - SAIC 
- Perform translation of Onyx/UMBC extracted TMRs 
 to KIF (Item A)
- Align two disparate ontologies 
- Translate terms once aligned 
- Both formalized queries and extracted text need 
 to be translated
- Develop CNS WMD ontology 
- Co-ordinate subcontractors and develop system 
 interfaces
19Key Tasks - Onyx
- Provide formalized translation of NL queries 
 (MOCA  item B)
- Perform extraction of CNS data into text (MOCA  
 item A)
20Key Tasks - IBM
- Assist in relations extraction from text into WMB 
 ontology
21KSLs Current Activities
- JTP  Hybrid reasoning for query answering 
- Includes a temporal reasoner 
- Is a DQL (DAML Query Language) server 
- Knowledge Base Partitioning  Enabling Q-A from 
 large scale KBs using parallel heterogeneous
 reasoners
- Inference Web  Providing understandable 
 explanations for derived query answers
- Knowledge extraction from semi-structured 
 documents
- Tables, lists, outlines, property-value pairs, 
 etc.
22SAIC Current Activities 
- SAIC 
- In-house Ontolingua server with JTP now installed 
 and in use in development efforts
- Ontology is available as part of demonstration in 
 the demo rooms
- Please visit the SAIC/KSL demo stand
23SAIC Current Activities (cont.)
- SAIC spearheading a federation of a WMD ontology 
 development effort, assisted by Stanford KSL
-  Begun development of CNS ontology. Ontology is 
 currently 700 terms and viewable in our in-house
 version of Ontolingua. (Demo available)
24SAIC Current Activities (cont.)
- Discussions underway with Sergei to put Onyx 
 under subcontract to SAIC. Subcontract to go out
 as soon as possible.
- Labor division is defined and agreed to 
- Major issue  Due to subcontract issues Onyx is 
 still not under subcontract. This affects Q?A
 ayatem development rates as this task is on the
 critical path for system development.
- Distributed ontology to KSL and IBM. 
- Development of the ontology is critical in order 
 to allow the extractors to function appropriately
25WMD Ontology Creation Initial -Confederation 
Assignments
- Stanford/KSL NIS-Facilities (439 terms) and 
 Russian-Naval-Facilities (365 terms)
- IBM MPT-Topic (771 terms) 
- Xerox-Parc Missiles-Topic (765 terms) 
- Tecknowledge NIS-Nuclear-Weapons-Aggregate (219 
 terms)
- Battelle Nuclear-Safety-Assistance (36 terms)
26Year Two Project Goals
- Complete CNS ontology development 
- Participate in TREC 
- System is still immature 
- Novel appoach 
- Significant potential for further development 
- Refine interfaces and determine system metrics 
 to ensure maximum performance in future system
 iterations
27TREC participation
- SAIC is signed up for TREC participation this 
 year.
- A multi-pronged approach is possible with the 
 current architecture
- With the SAIC/Onyx route and NL interface, gives 
 the initial capability for an end-to-end system
 with restricted domain and range
- Formatted queries possible for IBM extraction 
- System will be very immature in year 1 and likely 
 achieve poor TREC scores, but will mature in
 multiple and novel directions over time
28Future Plans
- Continue multi-pronged approach (running multiple 
 extractors over a uniform KB)
- Plan further enhancements (Possibly add more 
 extractors or reasoners
- Leverage multiple KB approach to optimize 
 research in multi-partition reasoning
- Develop effective metrics to determine efficacy 
 of this approach and which pathways are optimal
29Future plans (Cont)
- Work on implementing latter Proof tree to NL 
 mocha interface in the future (Reverse TMR to
 KIF)
- Transition from KIF to DAML format where possible 
- Extend range and capabilities of question 
 answering. Initial participation will be limited
 in terms of domain and range of questions.