Title: XML Topic Maps ISO13250
1XML???---??????Topic Maps (ISO/13250)
- ?????????
- ???
- ????91?8?22?
2What is XML
- XML is a method for defining special markers or
tags that can be inserted into text to indicate
its logical structure and to make explicit the
meaning or rhetorical role of its component parts
3Why XML?
- HTML Containing information only about a pages
appearance. - ltH1gtThe future of the electronic scientific
literaturelt/H1gt - ltH3gtby John Smithlt/H3gt
- XML Document to be tagged with machine-readable
metadata - ltarticletitlegtThe future of the electronic
scientific literaturelt/articletitlegt - ltauthorgtltfirstnamegtJohnlt/firstnamegtltlastnamegtSmith
lt/lastnamegtlt/authorgt
4HTML VS. XML
- Latin phrase "Quid pro quo" in HTML
- ltigtQuid pro quolt/igt
- US Battle Ship in HTML
- ltigtUSS Constitutionlt/igt
- Latin phrase "Quid pro quo" in SGML
- ltforeign lang"latin"gtQuid pro quolt/foreigngt
- US Battle Ship Title
- ltname type"ship"gtUSS Constitutionlt/namegt
5XML Intelligence
- George Washington -- ????? ltname
typepersongtGeorge Washingtonlt/namegt - Washington, D.C.-- ??ltname typeplacegtWashington,
D.C.lt/namegt - Washington's Army -- ??????ltname
typeorggtWashington's Armylt/namegt - USS Washington -- ??????ltname typeshipgtUSS
Washingtonlt/namegt
6XML Structure for Journal Article
7XML-Structured DocumentforJournal Article
8What does that mean? (I)
- Every part of the document is not just
displayable, but also definable, including tables
and/or charts. - For example, in a scientific article, XML tags
can be used to distinguish the title of the
article from the names of its authors or the
cells in a table
9What does that mean? (II)
10XML -- Analytical
- Tag (Marker) -- standardize -- metadata
- Data Interchange -- Dublin Core
- System Communication -- OpenUrl
11XML -- Analytical -- Application
- Data Interchange
- ??????????(OAI??)
- MARC?XML?
- Machine Communication
- OpenUrl Resolver
12XML -- Structural
- Logical Structure
- Logical Relationship
13XML -- Structural -- Application
- XML in Hierarchical and Structural Context
- Metadata Within XML
- Knowledge Structure
- Knowledge Organization Tools
14Three General Categories of Knowledge
Organization
- Term List
- Emphasizing lists of terms with definitions
- Classification and Categorization
- Emphasizing the creation of subject sets
- Relationship List
- Emphasizing the connections between terms and
concepts
15Term Lists
- Authority files
- Glossaries
- Dictionaries
- Gazetteers
16Classification and Categorization
- Subject headings
- Classification schemes, taxonomies, and
categorization schemes
17Relationship Lists
- Thesauri
- Topic Maps
- Semantic network
- Ontologies
18Principles of Knowledge Organization
19Key Issues in the Principle
- Terms -- Subjects -- Concepts
- Subject Relationship -- Concept ? Concept ?????
20What is Concept(??)?
- ??
- ?????????,????????
- ??????????????, ??????????????
- ??
- ????? ????????????, ?????????????
- ????????????? ?, ??????, ???.
- ??-??????????????????
- ??????????????,?????
- ?????????????, ???.
- ???????????,?????
- ????????????????, ?, ?.
21??????????(I)
- ????
- ?????????????????????????,????????????
- ????
- ?????????????????????
- ??????????????,???????????????
- ??????????????,????????
22??????????(II)
- ????
- ????????????,???????????????????????,???????????
- ????
- ??????????????????,?????????????
- ?????????????????????????
23??????????(III)
- ????
- ????????????????????????????
24????????????
25?????-- ??-- ????????????K.O.?
26 27Topic Map ???
- Definition (T. A. O.)
- A set of Topics, Associations, Occurrence, Facet,
and Added Theme Elements that are used to Manage
a set of Terms relevant to a particular Knowledge
Domain.
28Topic Maps
- a Topic Map is a collection of topics and
(semantically meaningful) relationships between
these topics - Topic Maps link these topics with external
references, such as resources behind URLs - XTM serves as XML-based interchange format for
topic maps
29Topic Maps (contd)
- TMs are a superimposed semantic layer
- connection between topics and resources are URLs
- TMs capture real-world subjects/objects but also
concepts, like TCP or love - these are defined not absolute but relative to
each other
30Topic Maps (contd)
- can deal with incomplete knowledge
- I know that Prince Charles was married but I do
not know the name of his wife. - can be merged
- Maybe someone else knows that someone called
Dianna was married to a British Prince - merging maps by identifying common topics
31Topic Maps (contd)
- are supposed to deal with many thousand topics
- are built to denote information, not knowledge
(no semantic network) - are not built for a specific application but will
be reused in many different contexts
32(No Transcript)
33(No Transcript)
34(No Transcript)
35(No Transcript)
36(No Transcript)
37(No Transcript)
38(No Transcript)
39(No Transcript)
40What is Topic Map
- Information connection is not just web
hyperlinks, instead a structured semantic link
network over the resources -- easy and selective
navigation to the requested information.
41Elements of Topic Map
- Topics
- Association
- Occurrence and Resources
- Scope
42(No Transcript)
43(No Transcript)
44Reification (Definition)
- creation and/or identification of a subject
- this topic stands for the subject (proxy)
- in this process, a topic will be created and
- characteristics (name, ...) will be assigned
45Topics
- A topic can be any thing. Regardless whether it
exists or not, whether it is of physical nature
or just an idea or expression - Web resources (Stock Quotes, Documents ....)
- real world (someone, people, countries, ....)
- A topic can be any concept.
- Abstract idea (Happiness, Effectiveness)
- Each topic has an internal identification (id),
an external representation (baseName), can have
any number of external references (occurrence)
and any number of classification (instanceOf) - Topics are only representants they represent
(proxy) the subject the subject itself exists
outside the topic map -- This is what a subject
is reified by a topic means and why
subjectIdentity element is proposed.
46Topic (Example)
47(No Transcript)
48Topic Names
- every topic has an unique id within a map
- this id is for internal use only
- every topic can have (one or more) names
- this name is visible to end users
49Topic Name -- BaseName
- The ltbaseNamegt element specifies a topic name
- A topic name is represented by one string the
content of the ltbaseNameStringgt child of
ltbaseNamegt - The context within which the assignment of a name
to a topic is valid may be expressed using a
ltscopegt child element. - A topic may have multiple base names in the same
and/or multiple scopes.
50Topic Name -- Variant
- The ltvariantgt element is an alternate form of a
topic's base name appropriate for a processing
context specified by the variant's ltparametersgt
child element - A variant name whose parameters include the
display or sort published subjects, which is
semantically equivalent to display names and sort
names (respectively) as defined in ISO 13250.
51Variants
- variants are names for a specific purpose and/or
in a specific format - name, as it should
- appear on a mobile display
- logo on blackwhite screen
- high resolution
- low resolution
- be used for sorting
52Variants (contd)
- external representations
- organized as a tree
- parameters control which variant will be used
53Topic Types
- any topic can have any number of types
- every type is itself a topic
- either within the same map
- lttopicRef xlinkhrefuniversity/gt
- then university must be a defined topic
- or defined via some other document
- ltsubjectIndicatorRef
- xlinkhrefhttp//www....../all/about/unis.
html/gt
54Topic Types (contd)
- topic types introduce a type hierarchy
- every topic map has its own type hierarchy
- there is NO global type system (ontology)
55Topic -- instanceOf
- The ltinstanceOfgt element specifies the class to
which its parent belongs, via a lttopicRefgt or
ltsubjectIndicatorRefgt child element. - The ltinstanceOfgt element is a syntactic shortcut
for an association of a special type defined by
the class-instance published subject.
56Topic Types (Example)
- lttopic idbond-unigt
- ltinstanceOfgt
- lttopicRef xlinkhrefuniversity/gt
- lt/instanceOfgt
- ltbaseNamegt
- ltbaseNameStringgtBond Universitylt/baseNameSt
ringgt - lt/baseNamegt
- ltoccurrencegt
- ltresourceRef xlinkhrefhttp//www.bond.edu
.au//gt - lt/occurrencegt
- lt/topicgt
57Topic -- SubjectIdentity
- The ltsubjectIdentitygt element specifies the
subject that is reified by a topic, via
ltresourceRefgt, ltsubjectIndicatorRefgt, and/or
lttopicRefgt child elements. - When a topic has an addressable subject, the
subject can be addressed directly via a
ltresourceRefgt element. In that case, it is the
resource itself which is considered the subject
of the topic, not what the resource means or
indicates. There can be only one such resource
per topic. - Resources may also be subject indicators, as
opposed to subjects in and of themselves.
Resources are used to indicate subjects via
ltsubjectIndicatorRefgt elements, of which there
may be more than one per topic. - A topic may also indicate that it has the same
subject as another topic by addressing that topic
via a lttopicRefgt element.
58Associations
- topics can participate in relationships, called
association, in which topics play roles as
members - Among the associations, which relationship two or
more topics have to each other. It must be
explicitly defined. - topics play there
- Members the topics involved in the association
are called members - Role and the members play the role.
- typical associations
- is-located-in, lived-in, written-by
- is-facillity-provided-by, requires-to-have
59Association (Example)
60Associations (contd)
- all newly introduced topics has to be defined
- is-located-in, building, location
- also these topics can be linked with associations
- associations can have any number of members (1,
2, 3, ...)
61Topic Occurrences
- reference external resources
- documents via URLs
- http//www....../where/is/the/document.pdf
- defined by IANA/ICANN via URNs
- urninetbond.edu.autech_report01
- not defined, but globally unique
- ???????
- urnmy-social-security-numbers1234-5678-9
- a topic can have any number of resources
62Topic Occurrences (contd)
- lttopic idbond-unigt
- ltbaseNamegt
- ltbaseNameStringgtBond Universitylt/baseNameSt
ringgt - lt/baseNamegt
- ltoccurrencegt
- ltresourceRef xlinkhrefhttp//www.bond.edu
.au//gt - lt/occurrencegt
- lt/topicgt
63Scopes
- not all topic characteristics are valid in all
contexts - scopes limit a characteristic
- scopes are topics themselves
64Scopes (contd)
- occurrences
- a web document could be written in german
- the document is not for a beginner, but an
expert - a visa to visit a country is not relevant for
residents, only for non-residents - names
- the document writes about trees in computer
science but not about trees in agriculture
65Scopes (contd)
- associations
- Santa Clause brings the presents is good enough
for children, but not for adults
66Scopes (contd)
- if no scope was defined, then the characteristic
is valid in ALL scopes - ? unconstrained scope
67Scopes (Example)
68mergeMap
- A ltmergeMapgt element references an external
lttopicMapgt element through an xlinkhref
attribute containing a URI. - lt!ELEMENT mergeMap ( topicRef resourceRef
subjectIndicatorRef ) gt - lttopicRefgt
69(No Transcript)
70Topic Map-DTD
71Topic Map-XML
72Topic Map-XSL
73(No Transcript)
74(No Transcript)
75(No Transcript)
76(No Transcript)
77(No Transcript)
78(No Transcript)
79(No Transcript)
80(No Transcript)
81(No Transcript)
82(No Transcript)
83(No Transcript)
84(No Transcript)
85(No Transcript)
86(No Transcript)
87(No Transcript)
88(No Transcript)
89(No Transcript)
90(No Transcript)
91(No Transcript)
92(No Transcript)
93(No Transcript)
94(No Transcript)
95(No Transcript)
96(No Transcript)
97Topic Maps Limitation
- XML-enabled Database and Search Engine
- Association is only Part of Relationship
(Non-directional Relationship)
98(No Transcript)
99(No Transcript)
100Database-supported Topic Maps
101What is an ontology
- Philosophy Theory of existence
- An ontology is an explicit specification of
objects and relations in the target world
intended to share with the community and to use
for building a model of the target world - It is a taxonomy of concepts
102Ontology
- To support the sharing and reuse of formally
represented knowledge, it is useful to define the
common vocabulary in which shared knowledge is
represented. A specification of a
representational vocabulary for a shared domain
of discourse -- definitions of classes,
relations, functions, and other objects -- is
called an ontology.
103Ontology
- Ontology is a Specification of a
Conceptualization - Ontology
- a formal explicit description of concepts and
relationship in a domain of knowledge - Class -- Concepts
- Slot (roles, properties) -- Features
attributes of Concepts - Facet (role restriction)
- Subject Description Analysis
- Relationship
- Vertical Horizon
- Hierarchical Structured
- Semantic and Conceptual Relationship
104Ontology
- Classes describe concepts in the domain
- A class can have subclasses that represent
concepts that are more specific than the
superclass - An ontology together with a set of individual
instances of classes constitutes a knowledge base - Ontology ends and the knowledge base begins
105Components of an ontology
- Concepts
- Taxonomy of the concepts
- Relations among concepts
- Formal specification of the concepts and relations
106Ontology has
- A common vocabulary
- An explicit representation of thing
(conceptualization) usually left implicit behind
a system - An explicit representation of a shared
understanding of the target world
107Ontology Engineering
- Ontology Engineering Defining terms in the
domain and relations among them - Defining concepts in the domain (classes)
- Arranging the concepts in a hierarchy
(subclass-superclass hierarchy)-- (taxonomy) - Defining which attributes and properties (slots)
classes can have and constraints on their values - Defining individuals and filling in slot values
108Ontology
- What is Ontology?
- ???????????????????
- ????????????????
- ????????????????????
- ???????? ?? ?????
- ????????
- ???????????
109ontology
- Concept Relation Instances
- subConceptOf relation domain
instance - domain
relation -
domain
110concept
- Research Project
- Full Professor(AcademicStaff)
- PhDStudent
- ontology
111Relation
- worksAtProject
- Supervises
- Supervisor
- ontology
112Instances
- Rudi
- Concept FullProfessor
- Relation 1. Supervises York
- 2. Name Rudi Studer
- York
- Concept PhDStudent
- Relation 1. worksAtProject On-To-
- Knowledge
- 2. Name York Sure
- 3. Supervisor Rudi
- On-To-Knowledge
- Concept ResearchProject
- Relation 1.name On-To-Knowledge
ontology
113Rudi
supervises
worksAtProject
supervisor
York
On-To-Knowledge
worksAtProject
ontology
114(No Transcript)
115(No Transcript)
116(No Transcript)
117Gene Ontology
- Gene Ontology ???
- Life Science is Knowledge base instead of
Axiom base - Knowledge of the biological role of proteins in
one organism can often be transferred to other
organisms. - ??????????, ???? Ontology??????
- ?? ??? Protein ? DNA ?, ??????????????
118Gene Ontology
- Gene Ontology ??????
- Biological process ????
- Molecular function ????
- Cellular component ????
119Gene Ontology
Cellular Component
??????
Biological Process
Molecular Function
120(No Transcript)
121(No Transcript)
122(No Transcript)
123(No Transcript)
124(No Transcript)
125(No Transcript)
126Upgrading the Application of XML in Knowledge
Organization
- HTML Web Content Display
- XML
- Data Interchange
- Web Content Representation
- Elements for the Object Attributes
- Elements for the Object Attributes with Structure
- Elements for the Object Attributes with Structure
and Semantic meaning
127Some links
- Cycorp Cyc Knowledge Server for artificial
intelligence-based Common Sense
http//www.cyc.com/ - UMLS - Unified Medical Language System (UMLS) of
the National Library of Medicine (NLM). - What is an Ontology? http//www-ksl.stanford.edu
/kst/what-is-an-ontology.html - Ontology.org http//www.ontology.org/
- Ontology internet links (complet)
http//saussure.irmkant.rm.cnr.it/onto/link.html - GENE ONTOLOGY CONSORTIUM http//www.geneontology.
org/ - Descriptive and Formal Ontology
http//www.formalontology.it/ - Ontology projects http//www.kr.org/top/projects
.html - The InterMed Project http//camis.stanford.edu/p
rojects/intermed-web/ - The Bio-ontologies Working Group
http//smi-web.stanford.edu/projects/bio-ontology
/ - Java Ontology Browser http//igd.rz-berlin.mpg.d
e/www/oe/mbo.html - The GO Browser http//www.informatics.jax.org/go
/go_browser_help.shtml - Synchronous editing of an ontology tools
http//www.swi.psy.uva.nl/wondertools/html/wonder
tools.html - XOL - Ontology Exchange Language
http//www.ai.sri.com/pkarp/xol/ - The Ontology Inference Layer OIL
http//www.ontoknowledge.org/oil/ - Line's Ontology Resource http//www.cs.utk.edu/
pouchard/onto/ - Formal ontology and conceptual analysis
http//www.ladseb.pd.cnr.it/infor/ontology/Papers
/Ontobiblio/TOC.htmla