Title: The Semantic Web: [You are here]
1The Semantic Web You are here
- Professor James Hendlerhttp//www.cs.umd.edu/hen
dler - Director, Semantic Web and Agent Technologies
- Maryland Information and Network Dynamics
Laboratory
2MIND SWAP
- Maryland Information and Network Dynamics
Laboratory, Semantic Web and Agents Project - J. Hendler
- B. Parsia
- Jennifer Golbeck
- Aditya Kalyanpur
- Grecia Lapizco-Encinas
- Katy Newton
- Corportate Research Partners
- Fujitsu Laboratory of America, College Park
- Lockheed Martin Advanced Technology Laboratories
- NTT Corp
- SAIC Corp.
- http//owl.mindswap.org Owl-powered Semantic Web
page (under development) - http//www.mindswap.org Traditional web page
- Evren Sirin
- Ronald Alford
- Ross Baker
- Amy Alford
- Matt Westhoff
- Michael Grove
- Ronald Reck
3Building the Semantic Web
4The World Wide Web
- Works reasonably well for single document texts,
or for finding sites based on single document
text - Cannot integrate information from multiple
documents - Cannot find things in databases, programs,
devices and sensors - Cannot ever get better (qualitatively)!
- Keyword-based IR will never really do better than
it does today (in satisfying user needs)
5What about the rest of the Web?
6Getting there The Semantic Wave
(Berners-Lee, 03)
7Whats the problem
Name Rank Serial Number
8XML is NOT semantics
9XML is NOT semantics
ltphotogt ltsubjectgt http//www.w3.org/timbl
lt/subjectgt ltnamegt Tim Berners-Leelt/namegt
lt/namegt lt/photogt
10XML is NOT semantics
Xml schema is DOCUMENT checking photo has
multiple subject fields photo has one physical
location etc.
ltphotogt ltsubjectgt http//www.w3.org/timbl
lt/subjectgt ltnamegt Tim Berners-Leelt/namegt
lt/namegt lt/photogt
11XML is NOT semantics
Xml schema is DOCUMENT checking photo has
multiple subject fields photo has one physical
location etc. WHICH SAYS NOTHING ABOUT TALKS,
SUBJECTS, PEOPLE, EVENTS, etc.
ltphotogt ltsubjectgt http//www.w3.org/timbl
lt/subjectgt ltnamegt Tim Berners-Leelt/namegt
lt/namegt lt/photogt
12The SEMANTICS is inthe links (e.g. to
ontologies)!
Eventtitle
ltdamlObjectProperty rdfID"photograph"gt ltrdfsdo
main rdfresource"Picture"/gt ltrdfsrange
rdfresource person"/gt lt/damlObjectPropertygt
EventWebPage
lt gt rdftype photoPhotograph, PhotoFile
http///imagesimage1, Phototopic
event1eventspeaker.
Event1 a Eventevent date May 7-11,
speaker http//timbl.html Title WWW
2002 TimBL rdftype w3c-ontperson name
Tim Berners-Lee
ltsClass rdfabout"http//www.semanticweb.org/ont
ologies/swrc-onto-2000-09-10.damlConference"gt lts
commentgt describes a generic conceptabout events
lt/scommentgt ltssubClassOf
rdfresource"http//www.semanticweb.org/ontologie
s/swrc-onto-2000-09-10.damlEvent"/gt ltadisjointFr
om rdfresource"http//www.semanticweb.org/ontolo
gies/swrc-onto-2000-09-10.damlWorkshop"/gt ltarest
rictedBy rdfresource"http//www.semanticweb.org/
ontologies/swrc-onto-2000-09-10.damlgenid18"/gt
ltrdfDescription rdfabout"http//www.w3.org/200
1/03/earl/0.95Person"gt ltrdftype
rdfresource"http//www.w3.org/2000/01/rdf-schema
Class"/gt ltrdfssubClassOf rdfresource"http//ww
w.w3.org/2001/03/earl/0.95Assertor"/gt lt/rdfDescr
iptiongt
13From the small and structured
ltdamlClass rdfabout"Book"gt
lt/damlClassgt ltdamlClass rdfabout"Author"gt
ltrdfssubClassO rdfresource"Person"/gt
lt/damlClassgt ltdamlProperty
rdfabout"age"gt ltrdfsdomain
rdfresource"Book"/gt ltrdfsrange
rdfresource"age_value_id"/gt
lt/damlPropertygt ltrdfsProperty
rdfabout"award"gt ltrdfsdomain
rdfresource"Book"/gt ltrdfsrange
rdfresource"award_value_id"/gt
lt/rdfsPropertygt
- Database schema for International Digitial
Childrens Library Metadata repository - 40 classes
- 25 properties (corresponding to database keys)
14To the large and complex
owlClass rdfID"CD80_Antigen"gt
ltrdfssubClassOf rdfresource"Cell_Adhesion_Mole
culelt/rdfslabelgt ltrdfslabelgtCD80
Antigenlt/rdfslabelgt ltrdfssubClassOfgt ltowl
Restrictiongt ltowlonProperty
rdfresource"Protein_Expressed_In_Tissue"/gt
ltowlsomeValuesFrom rdfresource"T_Cell"/gt
lt/owlRestrictiongt lt/rdfssubClassOfgt
ltrdfssubClassOfgt ltowlRestrictiongt
ltowlonProperty rdfresource"Protein_Expressed_I
n_Tissue"/gt ltowlsomeValuesFrom
rdfresource"B_Cell"/gt lt/owlRestrictiongt lt/rd
fssubClassOfgt ltrdfssubClassOfgt ltowlRestric
tiongt ltowlonProperty rdfresource"Protein_Ex
pressed_In_Tissue"/gt ltowlsomeValuesFrom
rdfresource"Macrophage"/gt lt/owlRestrictiongt
lt/owlClassgt ltCD80_Antigen rdfID"CD80_AntigenDa
ta"gt ltSwiss_ProtgtP33681lt/Swiss_Protgt
ltDEFINITIONgt lt!CDATAltdef-sourcegtNCIlt/def-so
urcegt ltdef-definitiongtA member of the
immunoglobulin supergene family .
Co-regulator of T cell activation with CD86.
Plays a critical role in autoimmune, humoral,
and transplant responses. Expressed on
activated T and B cells and on macrophages.
(from PROW)lt/def-definitiongtgt lt/DEFINITIONgt
- US National Cancer Institute Ontology
- 17,000 classes
- gt1 million synonyms and definitions
- RDF graph gt500,000 triples
15My Semantic Web
(I have nearly 200 daml/owl/rdf(s) files on my
laptop) -- March, 2003
16Our Semantic Web
- Jan 1, 03 Crawler finds 5.8M DAML statements on
20,000 web pages - Doesnt include many instance KBs tied to
ontologies - Doesnt include many very large RDFS-based KBs
that include some OWL - Ontology library at http//www.daml.org has 209
ontologies (March 2003) - Open for anyone to create
- Open for anyone to use
- OWL is being supported by large corporation labs
- Web tool developers IBM, HP, Sun, Intel, Fujitsu
- Content providers Daimler-Chrysler, Nokia,
Motorola, EDS, Agfa - OWL is starting to be used by thesaurus
developers - C.f. National Cancer Institute metathesaurus
released in OWL Lite - CALL Thesaurus in DAML
- NASA thesaurus available in DAML
- Use of semantic markup for Web Services beginning
to move beyond basic research - DAML-S cited as required reading for Web Services
Choreography WG - DAML-S being demoed to WSDL WG next week
17The incoming tide
18What comes next
19Tools are critical
Tools for marking up images, figures and other
non-textual data must be built in a way that
they tie into the "business processes" of the
working scientist -- that is, rather than
learning a whole new set of tools, the basic web
tools of the scientist must include mechanisms
that make it EASIER for the scientist to produce
web content (appropriately markedup) while
authoring papers, performing experiments,
creating and logging data, and the other day to
day activities of the working researcher.
Science and the Semantic Web, Hendler, 03
The same is true of the military user
20Automatically-organized portals
21New Info Management Paradigms
22And other cool stuff
23Tools for visualization and use
A number of tools for querying, browsing and
visualizing SW data are being developed
24Handle larger size/complexity
Tools are now scaling as well
25PhotoSMORE Marking up (sub)images
26PhotoSMORE Marking up (sub)images
27And the info flows
28And the info flows correctly
29Surfs up!
30Putting all this together
31Wrapping/Mapping tool
XML Schema (WSDL)
Define transformation functions
DAML Ontology (DAML-S)
Manual editing if necess.
Mappings for attributes
32is starting to work
Are there any mountains in Europe bigger than Mt.
Blanc?
document
service
database
33Ad hoc Programming
Advanced information management capabilities
Discovery, Filtering, Composition
34Ad hoc programming logic
35Distributed trust
36Conclusions
- The Semantic Web is real
- allows us to create models of information and
share these on the web - In a web-like open-system way
- In a way that exposes more semantics
- Vocabularies/ontologies first step, but they need
to be easier to create and use and modify and
crawl and query - Application development has begun!
- A new generation of Semantic Web tools is
becoming available - First prototypes are becoming available for end
to end functionality - Some components, and the OWL language, are ready
for prime time - The Semantic Wave
- Is an incoming tide