CONCERT 2000, Taipei - PowerPoint PPT Presentation

1 / 41
About This Presentation
Title:

CONCERT 2000, Taipei

Description:

Title: ProQuest Overview Strategic Plan Author: tfegan Last modified by: Andrew Ding Created Date: 4/6/2000 9:57:46 PM Document presentation format – PowerPoint PPT presentation

Number of Views:68
Avg rating:3.0/5.0
Slides: 42
Provided by: tfe6
Category:
Tags: concert | taipei | z3950

less

Transcript and Presenter's Notes

Title: CONCERT 2000, Taipei


1
CONCERT 2000, Taipei
  • Adding Value to Full Text Databases
  • A Look at the Digital Vault and Intelligent
    Document Linking
  • By Richard Hollingsworth
  • Bell Howell Information and Learning

2
Mission
  • To effectively search multiple knowledge sets?
  • Process needs to be intuitive
  • Organization is imperative
  • Answers need to be precise (of course!)
  • Dont want to create more confusion

3
Approaches
  • Meta Searching
  • Search everything at once
  • Controlled Searching
  • Need to begin with a core set of data.
  • Index
  • Taxonomy

4
Meta Searching
  • Meta Searching is the buzz
  • Search everything at once

5
Difficulties in Meta Searching
  • Not all databases are created equal
  • Different Thesauri
  • Different Vocabularies
  • Different Engines
  • Different Formats
  • Z39.50 Is one solution
  • Well we all know what that means!

6
Example
  • User searches on Clinton and campaign finance
    reform
  • Search Web, Biographies, Business, Health, News,
    OPAC, Reference
  • Results are either organized in categories or
    combined.

7
Results
  • Confusing at best
  • Why search bibliographies for campaign finance
    reform?
  • Doesnt highlight related or relevant topics and
    figures to the original query.
  • Doesnt add any intelligence into the process.
  • Navigating from one set to another.

8
Summary on Meta Searching

9
Controlled Searching
  • Start with a known core
  • Integrate relevant components around it

10
Example
  • User searches on Clinton and campaign finance
    reform
  • Search General Reference, Business and News
  • Link to OPAC, Biographies, and other reference
    material.

11
Results
  • Core relevant list of articles to begin the
    research process
  • Advantages
  • Known query syntax
  • Expected results output
  • Easier navigation
  • Simple method of joining relevant resources
  • Supports idea generation and thinking.

12
Controlled Searching
  • Start with a known core
  • Integrate relevant components around it


13
A solution!ProQuest IDL
  • Intelligent Document Linking
  • Three major components
  • Term recognition or markup
  • Knowledgebase(s)
  • External content sources

14
Markup
  • Term recognition
  • Sophisticated software that marks-up terms in the
    text such as people, places, and companies.
  • Extension of our auto-indexing software.
  • Software is tunable.
  • Limit to the vocabulary
  • Remove the limits

15
Knowledgebase
  • Known list of answers
  • First integrated knowledgebase is The WorldBook
    Encyclopedia
  • Easy to read
  • Conveniently available in the ProQuest vault
  • Great fit for current general reference and news
    products.

16
External Sources
  • OPAC
  • Best of Web (Index of Web Sites)
  • Other subscription and free sites.
  • Dictionary
  • Maps
  • 3rd party Database Subscriptions
  • Others.

17
(No Transcript)
18
(No Transcript)
19
(No Transcript)
20
(No Transcript)
21
(No Transcript)
22
(No Transcript)
23
(No Transcript)
24
(No Transcript)
25
(No Transcript)
26
Future
  • Additional KnowledgeBases
  • Health
  • example follows
  • Business
  • Adding premium content sources to supplement our
    high quality AI databases.

27
(No Transcript)
28
(No Transcript)
29
Future
  • Additional support for external sources
  • Help us define these!
  • Additional markup capabilities
  • Subjects/Concepts
  • Products
  • etc...

30
Digital VaultTM Opening the Vault on 500 Years
of History
31
Digitizing The Vault
  • Bell Howell Information and Learnings
    microfilm vault is the largest commercially
    available collection
  • 20,000 periodicals, 7,000 newspapers and 400
    Research Collections and 1,000,000 dissertations
  • 3 climate controlled underground vaults
  • Over 5.5 billion page images
  • Using the microfilm contained in the vaults to
    create the largest digital collection

32
Opening The Vault
  • First Digital Vault product released 1999 - Early
    English Books Online (EEBO)
  • Focus in 2000
  • Bringing a core collection of periodicals to
    libraries
  • Two New Collections
  • - Gerritsens Collection on Womens History
  • - Genealogy and Local History
  • Digital Sanborn Maps

33
Early English Books Online
  • EEBO - Digitized version of 3 Early English
    Books-related collections
  • Total surviving record of the English-speaking
    world from 1473 - 1700.

This is the first book printed in English by the
famous printer, William Claxton in 1473. While
printed in English it was actually printed in
France. (Huntington Library)
34
Early English Books Online
  • Database transforms scholarship of material
    from this era
  • Electronic works are now used in a classroom
    setting - not just for graduate study
  • Search and retrieve works instantly
  • Covers virtually every subject area
  • Science, mathematics, engineering, womens
    studies, etc.

35
Periodicals from the Digital VaultTM
Connecting tomorrows library with the past
36
Digital Vault - Periodicals
  • Definition
  • Web-access to page images from retrospective
    journals and magazines
  • Cover to cover coverage - opinion and fact remain
    intact alongside the advertisements of the day
  • Creation of dirty ACSII for keyword searching
    plus TOC level access
  • Seamless integration with current content in
    ProQuest

37
Digital Dissertations
38
Maintaining the Scholarly Record
  • Capturing dissertation research in depth
  • From 1938 to the present, Bell Howell
    Information and Learning has been the recognized
    repository for dissertations in North America.
  • World-wide access to over 1.6 million citations
  • More than 1 million titles available in full text
  • Each year over 55,000 new titles added
  • Retrospective North American coverage to 1861
  • REFERENCE COPIES ON DEMAND
  • ARCHIVE

39
Creating the Digital Library
  • Beginning with 1997 submissions, we are
    converting all incoming paper dissertations to
    Adobe PDF format.
  • Currently there are over 170,000 full text
    dissertations and Masters theses available for
    downloading in our digital archive.
  • We are now accepting dissertations in digital
    format. Institutions can submit via CD-ROM or a
    FTP server.

40
ProQuest Digital Dissertations
  • Access to the the Dissertation Abstracts
    database
  • Visitors have free access to over 100,000 titles
  • http//wwwlib.umi.com/dissertations
  • Library subscription to the entire 1.6 million
    citation database including free access to all
    PDF files from their school
  • Free twenty-four page previews for all PDF files
  • Search by both fixed-field and key word

41
Adding Value to Full Text
  • Quality of Index and Abstracts
  • SiteBuilder Technology
  • Intelligent Document Linking
  • Digital Vault Initiative and relevant
    knowledgebase(s)
Write a Comment
User Comments (0)
About PowerShow.com