Title: CADAL Digital Library
1 CADAL Digital Library
The 2nd International Conferenceon Universal
Digital Library(ICUDL 2006)
- Wu Jiang-Qin,Zhuang Yue-Ting
- Pan Yun-he
- College of Computer Science, Zhejiang
University,China - November 18,2006
2Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
3Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
4CADAL
- The China-Us Million Book Digital Library(CADAL)
is an international cooperation program between
China and the US. - The objective of CADAL project , is to create a
free-to-read, searchable collection of one
million book, available to everyone over the
internet. - CADAL is the important part of Universal Digital
Library(UDL), universal access to human
knowledge.
5The challenges and services (1)
- the amount of the digital resources including
digital books and multimedia for research and
education can reach 100 terabyte(The number of
digital books is 1,023,425 by October of
2006,including previous Chinese ancient books,
Chinese minguo books ,Chinese Modern books,
Chinese degree dissertation,English
books,image,video etc.. - active services of unified paralleling search for
the different types of digital resources
6The challenges and services (2)
- image, video,3-D model and other types of media
resources, various types of media resources are
included in the CADAL resources. - the services of quickly retrieving and
structurally browsing of multimedia documents
including image, video
7The challenges and services (3)
- there are two kinds of language digital books.
Chinese and English, in the CADAL resources. - the services of bilingual translation
8The challenges and services (4)
- traditional Chinese culture resources are
important part of the CADAL resources. - the services related to Chinese traditional
culture resources.
9Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
10Background
- TB volume of various types of digital resources,
such as dissertation, ancient minguo book, modern
book, minguo journal, English book, drawing,
video and illustration are available in the
CADAL, which is one of the distinct
characteristic of CADAL. So CADAL presents a
challenge for the technique of searching
resources based on metadata.
11Metadata
- Dublin core metadata is used to describe the
million digital books in the CADAL project.
Metadata corresponding to the other types of
multimedia resources are used to describe them.
Independent data map is designed for each kind of
resource metadata.
12Unified parallel searching
- In order to meet the requirements of different
users and improve the users interactive
experience, the service for the different types
of digital resources is provided for users
convenient searching.
13(No Transcript)
14(No Transcript)
15Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
16Background
- As the digital library contains unstructured
multimedia resources such as images, videos,
audios etc besides digital books, effective and
efficient analysis and retrieval of multimedia
resources is a challenging problem in the CADAL
digital library. - Here we examine the analysis and retrieval issues
related to two primary kinds of multimedia, image
and video.
17Contents
- Content-based Image Retrieval
- Image retrieval by peer indexing
- Image annotation
- Image search engine
- Video analysis system
- Video Browser(structure and summary)
- Metadata-based Video Retrieval
18Content based image retrieval
- Extracting visual features
- color featurecolor histogram, color moment,
color coherence vector, color correlgram - textureTamura textural feature and co-occurrence
textural feature - relevance feedback
- Make image retrieval coincide with users
requirement
19Content based image retrieval
Query example
Negative example
Relevance feedback
Image searching
Positive example
20Image retrieval by peer index
- A new scheme for image indexing, Peer Index, is
the method that describe images through
semantically relevant peer images. - In particular, each image is associated with a
two-level peer index, including - global peer index describing the data
characteristics of this image - personal peer indexes describing the user
characteristics of an individual user with
respect to this specific image - Both types of peer index are learned
interactively and incrementally from user
feedback information.
21Peerindex-based image retrieval
semantic relevance feedback
Semantic query
22Image annotation
- Automatic semantic annotation for images by
machine learning and statistical modeling - Classify the training images, and create a
semantic skeleton for each class of the training
image. - Classify new image with Support Vector Machine
automatically, and describe it using the semantic
skeleton - Select the key words for the image by statistical
methods
23Image annotation
............
classify
statistical learning
Semantic skeleton
annotation
tiger
annotate
Visual similar
classify segment
24Text based image retrieval
25Image search engine
- We implemented an image search engine, Octopus,
which provides Peer Index and relevance feedback
to avoid the gap between the semantics and
low-level features, according to the intuitive
and simple idea that the semantic concept is
hidden in each image and the semantic concept
appears apparently in the relation between the
image and the other images.
26Integrating into CADAL DL
27The image retrieval interface
28Our target for video
- Analyze multimodal information, such as the
visual, the audial, motion and caption to
generate structural information and video summary -
- Support video browsing and video retrieval based
on metadata and structural information
efficiently
29Main idea
- Nonlinear browsingGenerate structural indexing
such as key frame, shot and shot group from the
original video stream - Content compressionAnalyze time sequence in
video stream, eliminate redundant data, and
generate the summary and the highlight scene for
the original video.
30System
- Video Fusion Analysis System (VideoFAS)
- VideoBrowser
31VideoFAS-system interface
Original Video
Similar Video shots are Clustered together
Video shot
32VideoFAS-system functions
- Basic operation
- Importing and Saving
- Appending
- Separating the video stream into video and audio
data - transcoding and compressing
33VideoFAS-system functions
- Feature Extraction
- Visual feature
- color color histogram, color moment, color
coherence vector, color correlgram - Texture Tamura textural feature and
co-occurrence textural feature - shapecontour feature
- Audial feature
- temporal featurezero-crossing rate
- Frequency featureMel coefficient?tone and
sub-band statistical feature - Target Feature
- Integrate OpenCV face detection module into the
system Extract the face features
34VideoFAS-system functions
- Video structuring
- shot detection
- Cut shot detection
- Transition shot detection
- key frame extraction
- Similar shot grouping
- group the shots based on Support Vector Machine
35(No Transcript)
36VideoFAS-system functions
- Video summarization
- Summarize by Mining Non-Trivial Repeating
Patterns - Extract frequent and non-trivial shot sequence to
generate video summary
37VideoFAS-system functions
- Metadata annotation
- Annotate Video clip with metadata conform to
Dublin Core Standard - Save the metadata and the video structural
information in database
38VideoBrowser-framework
39VideoBrowser-system interface
40VideoBrowser-system interface
metadata
media player
Video structural information
41System architecture
Web
Movies
Internet
Web server
Retrieval service
video data
firewall
switcher
Online storage Disk array
annotation
structuring
Archive server
summarization
taper(offline storage)
42Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
43Background
- As there are both English and Chinese books in
CADAL, bilingual services are required for users
to access resources in any language.
44Services
- Some technologies and prototypes have been
developed by north technical center on how to
carry out the multi-layered bilingual machine
translation in English and Chinese books, such as
- the metadata translation between English and
Chinese - the accurate translation of proper nouns such as
names for unique individuals, events,or places - the selective translation in a full-text context
- the translation of Old Chinese text
- the distributed translation service technique.
45Services
- An online translation service is integrated into
the CADAL digital library. - Users can be directly conducted semantic-based
multi-linguistics retrieval of required
information in our CADAL digital library. - The translation of contents of a page on line.
- The translation of metadata of a digital book.
46Bilingual Search
47 The translation of contents of a page
48(No Transcript)
49Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
50Background
- Since most people are interested in the art of
the beautiful styles of calligraphy character
rather than the meaning of the character, the
service of Chinese calligraphy character
retrieval is provided in the CADAL digital
library, treating them just as they are images
without recognizing them like OCR does.
51Calligraphy art still alive in
52Key issues
- Feature extraction character complexity, stroke
density and shape, the three kinds of features of
the calligraphy character are proposed - similarity matching cost retrieve relevant
images according to it.
53Contents
- Chinese Calligraphy Page Segmentation
- Features Extraction
- Character Image Retrieval
54Chinese Calligraphy Page Segmentation
- The page image are binarized with characters in
black and the background in white. - Cut the page into columns according to the
vertical projecting histogram, and columns
continued to be cut into individual characters. - All the characters are normalized in order to
keep scale invariant Contour information,
55Page segmentation
56Features Extraction
- shape
- character complexity
57shape representation
- Calligraphic characters shape is represented by
its contour points. - The polar coordinates is used to describe
directional relationship of points instead of the
Cartesian coordinates. - For direction, we use 8 bins in equal degree size
to divide the whole space into 8 directions. - For radius, we use 4 bins
- For each point of a given point set composed of
sampling points, its approximate shape context is
described by its relationship with the remaining
points in weighted bins.
58shape representation
Contour point
59Calligraphy Character Complexity
- We use Calligraphy Character Complexity as a
filter at the beginning to discard the
calligraphy character that has no possibility to
be similar to the query.
L be the number of sampled contour points from
the query and Li be the number of sampled contour
points from candidate image. ? is the threshold
obtained by experience.
60Character Image Retrieval
- Compute the values of the character complexity of
the calligraphy character. - Normalize the scale size of the query and sample
its contour points. - Filter the candidate images by character
complexity - Extract the shape feature and employ the shape
matching method introduced in 6 to compute the
matching cost for every remaining candidate image
and the query. - Rank the results according to the matching cost,
and return.
61The calligraphy character retrieval
62interface of browsing the original works
63Outline
Introduction
1
2
Unified Paralleling Search
Multimedia Analysis and Retrieval
3
Bilingual services
4
5
Chinese Calligraphy Character Retrieval
6
Conclusion and Future Work
64All the services have been accessed by the users
from over 70 countries 280.000 times per day.
65Conclusion and Future Work
- With the increase of the number of the users and
the amount of the resources. future work with
CADAL digital library will proceed in several
directions - Improving the performance of the current
services, to be more complete and be more stable - Continuing exploring the application of
multimedia in Digital Library.
66Thanks!Welcome Visiting CADAL Digital
Library(WWW.CADAL.ZJU.EDU.CN)
Email wujq_at_cs.zju.edu.cn yzhuang_at_zju.edu.cn