Title: Exploring Digital Libraries: Integrating Browsing, Searching, and Visualization
1Exploring Digital Libraries Integrating
Browsing, Searching, and Visualization
- Paper by Rao Shen, Naga Srinivas Vemuri, Weiguo
Fan, Ricardo da S. Torres, Edward A. Fox
Slides by fox_at_vt.edu http//fox.cs.vt.edu
with some modifications by lillian.cassel_at_villanov
a.edu Original version presented at JCDL 2006
2Acknowledgements (Selected)
- Sponsors NSF grant ITR-0325579, ASOR, CWRU,
ETANA, Vanderbilt U., Virginia Tech - Faculty/Staff Lillian Cassel, Debra Dudley,
Manuel Perez, - VT (Former) Students Marcos A. Gonçalves, Doug
Gorton, Aaron Krowne, Ming Luo,
3Introduction
- Whats exploring?
- searching, browsing, investigating, studying, or
analyzing for purposes of discovery, pursuing
truth or facts about something - Are browsing and searching duals or can they be
converted to each other when certain conditions
are met? - Can we generalize these DL exploring services
within a formal DL framework? - Can the formal generalization guide development
of exploring services for domain focused DLs?
4Related Work on Integrating Services in DLs
integrating searching and browsing
found in
systems In 1980s
systems In 1990s
systems In 2000s
has an example
has an example
has an example
I3R
RABBIT
DataWeb
PESTO
SenseMaker
MIX
ScentTrails
BBQ
CODER
ODL
MARIAN
5Exploring Services Formalization
- Generalize DL exploring services such as
browsing, searching, clustering, and
visualization - Exploration Space (Espa) is a Space
- Espa(Q, Contents, OP_Set)
- Q is a set of conceptual representations for user
information needs - Contents associated with collection C
- OP_Set is a set of operations on Q and Contents
- OPviz, OPclu, OPs, OPb OP_Set
6Exploring Services Formalization (Cont.)
- Sample OP_Set OPviz , OPclu, OPs, OPb
- OPviz maps a set of digital objects to a visual
mark - OPclu gets similarity of a pair of subsets of
collection and their associated contents - OPs associates a query with a digital object
and its contents - OPb associates a traverse link with contents of
the target node (i.e., follows a hypertext link)
7Exploring Services Formalization (Cont.)
- An Exploring Service (Eser) is a set of scenarios
over an exploration space (Espa). - Eser(sc1, sc2, , sci, , scn),
- where sci is a sequence of events
- each event is associated with one or more of the
operations in Espa
8Exploring Services Formalization (Cont.)
State Diagram
OP_Set
Searching Ops Browsing Opb Clustering
Opclu Visualization Opviz
ei
9Reading the paper
10Overview first
- What is this paper about?
- What is the main point or are the main points?
- What is the structure of the paper?
- Is this what you would expect to see in any
well-organized conference paper? - Are there sections that are specific to this
project?
11Related work
- What is the oldest work cited?
- What is the most recent work cited?
- How large a body of work contributed to this
project? - How closely related are the works that this paper
cites? - Is there a good reason for each reference?
- How many of the cited works are by the same
author (or some of the same authors) or from the
same research laboratory?
12Definitions and notations
- What specific terms are defined?
- Are these general terms that have particular
meanings in this paper or are they new terms with
no known meaning? - Are there general terms defined, or notations
that provide shorthand for use in the later
discussions? - List the terms. Discuss the meaning of each with
a classmate. Is anything unclear? Try to answer
each others questions or formulate a question
for the class to address together. - Note the reference 10 is one we read earlier
in the semester.
13- Definition 2 A structure is a tuple (G,L, F),
where G(V,E) is a directed graph with vertex set
V and edge set E, L is a set of label values, and
F is a labeling function F (V ? E) ? L - Definition 16 A digital object is a tuple do
(h, SM, ST, Structured-Streams) where - h ? H, where H is a set of universally unique
handles (labels) - SM sm1, sm2, , smn is a set of streams
- ST st1, st2, , stm is a set of structural
metadata specifications - StructuredStreamsstsm1, stsm2, , stsmp is a
set of StructuredStream functions defined from
the streams in the SM set (the second component)
of the digital object and from the structures in
the ST set (the third component.)
14The operations
- Exactly what operations are defined and are of
interest in this paper? - What relationships exist between and among the
operations defined?
15New understanding
- The paper states Our theory-based approach to
describing DL exploring services allows us to
understand browsing and searching in a new way. - What are all the exploring services discussed in
the paper? - How are these explored in the context of the
ETANA-DL (which we initially looked at early in
the semester).
16The Authors presentation
- The paper as presented by the author
- Slides provided by Dr. Edward A. Fox
17Exploring Digital Libraries Integrating
Browsing, Searching, and Visualization
Excerpt from JCDL 2006, Chapel Hill, NC, June 12,
2006 Rao Shen, Naga Srinivas Vemuri, Weiguo Fan,
Ricardo da S. Torres, and Edward A.
Fox fox_at_vt.edu http//fox.cs.vt.edu
Some adaptations by lillian.cassel_at_villanova.edu F
or CSC 9010-Special Topics - Digital Libraries
and other Web-based information presentation
18Acknowledgements (Selected)
- Sponsors NSF grant ITR-0325579, ASOR, CWRU,
ETANA, Vanderbilt U., Virginia Tech - Faculty/Staff Lillian Cassel, Debra Dudley,
Manuel Perez, - VT (Former) Students Marcos A. Gonçalves, Doug
Gorton, Aaron Krowne, Ming Luo,
19Introduction
- Whats exploring?
- searching, browsing, investigating, studying, or
analyzing for purposes of discovery, pursuing
truth or facts about something - Are browsing and searching duals or can they be
converted to each other when certain conditions
are met? - Can we generalize these DL exploring services
within a formal DL framework? - Can the formal generalization guide development
of exploring services for domain focused DLs?
20Related Work on Integrating Services in DLs
integrating searching and browsing
found in
systems In 1980s
systems In 1990s
systems In 2000s
has an example
has an example
has an example
I3R
RABBIT
DataWeb
PESTO
SenseMaker
MIX
ScentTrails
BBQ
CODER
ODL
MARIAN
21Exploring Services Formalization
- Generalize DL exploring services such as
browsing, searching, clustering, and
visualization - Exploration Space (Espa) is a Space
- Espa(Q, Contents, OP_Set)
- Q is a set of conceptual representations for user
information needs - Contents associated with collection C
- OP_Set is a set of operations on Q and Contents
- OPviz, OPclu, OPs, OPb OP_Set
22Exploring Services Formalization (Cont.)
- Sample OP_Set OPviz , OPclu, OPs, OPb
- OPviz maps a set of digital objects to a visual
mark - OPclu gets similarity of a pair of subsets of
collection and their associated contents - OPs associates a query with a digital object
and its contents - OPb associates a traverse link with contents of
the target node (i.e., follows a hypertext link)
23Exploring Services Formalization (Cont.)
- An Exploring Service (Eser) is a set of scenarios
over an exploration space (Espa). - Eser(sc1, sc2, , sci, , scn),
- where sci is a sequence of events
- each event is associated with one or more of the
operations in Espa
24Exploring Services Formalization (Cont.)
State Diagram
OP_Set
Searching Ops Browsing Opb Clustering
Opclu Visualization Opviz
ei
25Theory-based approach to describing DL Exploring
Services guides us to design and implement
exploring services for ETANA-DL
- Multi-dimensional browsing
- Searching and browsing integration
- Visualization
- Usability evaluation
26An Integrated DL
- Etana brings together several separate and
different collections of materials into an
integrated DL. - Virtual Nimrin (http//www.case.edu/affil/nimrin/m
enu/nimrin.htm) - Madaba Plains (http//www.madabaplains.org/home.ht
ml) - Lahav Website (http//www.cobb.msstate.edu/dig/lah
av/) - Megiddo (http//www.tau.ac.il/humanities/archaeolo
gy/megiddo/index.html) - And others
27ETANA-DL approach
- Applying and extending Digital Library (DL)
techniques to solve key problems making primary
data available, data preservation, and
interoperability - Modeling archaeological information systems using
5S to better understand the domain and design the
system and the supporting services - Rapidly prototyping DLs that handle heterogeneous
archaeological data using componentized
frameworks - eliciting requirements
- refining metamodel and union schema
- modeling sites
- mapping
- harvesting
- providing useful services
28ETANA-DL ArchitectureDigBase and DigKit
Search
U S E R I N T E R F A C E
D A T A B A S E W R A P P E R S
Lahav
Browse
Nimrin
Recommend
Umayri
ETANA-DL UNION CATALOG
Note
Hisban
Personalize
Review
Megiddo
Visualizations
Jalul
Archaeology Specific
New Sites
Work in progress
29ETANA-DL Website
http//digbase.etana.org8080/etana/servlet/Start
30(No Transcript)
31Exploring Service in ETANA-DL Multi-dimensional
Browsing
32(No Transcript)
33(No Transcript)
34(No Transcript)
35Searching and Browsing Integration
36(No Transcript)
37(No Transcript)
38(No Transcript)
39The Important Point
- These are independent digital libraries or
databases - The idea is to give an appearance of a single,
integrated site with access to all the
information in all the sources. - Harvesting -- OAI- PMH
- Then how to make it all appear like one
collection of materials?
40DL Integration
- What is DL Integration
- Hide distribution
- Hide heterogeneity
- Enable autonomy of individual component
- Why Integration
- island-DLs
- inability to seamlessly and transparently access
knowledge across DLs
Use various autonomous DLs in concert
413 new sites
2 new types of artifacts
42EtanaViz Initial Interface
43EtanaViz Bone records from Nimrin
44EtanaViz Total Number of Animal Bones across
Nimrin Culture Phrases
45EtanaViz Percentages of Animal Bones across
Nimrin Culture Phrases
46Impression about ETANA-DL services
Browse Search EtanaViz Save navigation path (SNP) Search within browsing context (SWBC)
4.0 4.0 4.0 4.5 4.5