Title: Collaboration on Digital Libraries NEC Dec' 27, 2000
1Collaboration on Digital LibrariesNECDec. 27,
2000
- Edward A. Fox
- fox_at_vt.edu http//fox.cs.vt.edu
- CS DLRL Internet TIC
- Virginia Tech, Blacksburg, VA, USA
2Acknowledgements (Selected)
- Mentors JCR Licklider, Michael Kessler, Gerard
Salton - Sponsors Adobe, IBM, Microsoft, NLM, NSF, OCLC,
SOLINET, SURA, UNESCO, US Dept. of Ed. (FIPSE),
- VT Faculty/Staff Tony Atkins, Debra Dudley,
John Eaton, Gwen Ewing, Peter Haggerty, JAN Lee,
Gail McMillan, Manuel Perez, Len Peters, James
Powell, - VT Students Emilio Arce, Fernando Das Neves,
Brian DeVane, Robert France, Marcos Goncalves,
Scott Guyer, Robert Hall, Brian Hobbs, Neill
Kipp, Paul Mather, Tim McGonigle, Todd Miller,
Constantinos Phanouriou, William Schweiker, Ohm
Sornil, Hussein Suleman, Patrick Van Metre, Laura
Weiss,
3URLs
- http//fox.cs.vt.edu
- http//ei.cs.vt.edu/dlib (Courseware)
- http//www.dlib.org (D-Lib Magazine)
- www.smete.org and later www.nsf.gov/nsdl
- www.ndltd.org and www.theses.org
- www.cstc.org (CSTC and JERIC)
- www.openarchives.org
- www.jcdl.org (JCDL2001 June 24-28)
4Digital Library Courseware
- http//ei.cs.vt.edu/dlib/
- WWW pages or large PDF copy files
- CourseInfo quizzes based on books by Michael Lesk
(MKP.com) and William Arms (MIT Press) - Contents based on books, with other popular
topics added (e.g., agents) - Separate pages to supplement Definitions,
Resources (People, Projects), and References
5JCDL 2001
- First Joint ACM/IEEE Conference on
Digital Libraries ( NSF DLI-2 PI mtg) - http//www.jcdl.org
- June 24-28, 2001 in Roanoke, VA
- Conference Committee
- General Chair Edward A. Fox, Virginia Tech
- Program Chair Christine Borgman, UCLA
- Treasurer Neil Rowe, Naval Postgraduate School
- Posters Chair Craig Nevill-Manning, Rutgers U.
6Locating Digital Libraries in Computing
and Communications Technology Space
Digital Libraries technology trajectory
intellectual access to globally distributed
information
Communications (bandwidth, connectivity)
Computing (flops)
Digital content
less
more
(Slide from S. Griffin, NSF)
7(No Transcript)
8PetaPlex Complex
Service Machine 1
Service Machine 3
Service Machine 2
Service Machine 4
FRONT END MACHINE RS/6000, 1G RAM, 4 Proc.
PetaPlex Core
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
Nanoserver
9PetaPlex
- Digital Library Machine (super object store)
Parallel computer / storage utility - Research inverted files, video server,
(supported by IBM, AOL, NSF, ) - Knowledge Systems Incorporated is supplying
VT-PetaPlex-1 with 2.5 terabytes through 100
nodes - Net connection 25GB disk 233 MHz Pentium
Linux
10MARIAN
- Multiple Access Retrieval of Information with
Annotations - (Marian the Librarian )
- Evolved from CODER system to a distributed Online
Public Access Catalog (OPAC), then DL backend,
now becoming a full DL system - From C/C to Java
- Future NDLTD, NUDL, PetaPlex
- Use for campus collection management
- Use for www.theses.org as centralized system with
gateway services OAI, Harvest, Z39.50,
11MARIAN Layers
User Interface Layer
User Information Layer
Search Engine Layer
Database Layer
12MARIAN Parallelism
13(No Transcript)
14(No Transcript)
15(No Transcript)
16(No Transcript)
17(No Transcript)
18ENVISION
- NSF A User-Centered Database from the Computer
Science Literature (1991-93) - Collected bib/typesetter data, converted to SGML
- Scanned thousands of page images
- MARIAN search engine - can be made available
(also applied to the Virginia Tech library
catalog) used as part of a prototype object-based
DL, with tailored visualization interface (L.
Nowell dissertation)
19(No Transcript)
20(No Transcript)
21(No Transcript)
22(No Transcript)
23DL-Related Timeline
WWW
1985
1990
1995
2000
xxx
OAI
Scholarly EPub in Us
CoRR
NCSTRL
CSTR
XML
PDF
SGML
MPEG-7
JPEG, MPEG
DLI
Proposed Ugrad DL (Envision, EI)
DLI2
PCs
NSDL (CSTC, iLumina,)
TEI
HyperCard
Java
DC
RDF
Hypertext Conf.
ETDs
NDLTD
24Information Life Cycle
Borgman et al. Workshop Report on Social Aspects
of Digital Libraries http//www-lis.gseis. ucla.
edu/DL/
25Core of DL
- Collecting
- Authoring, Repositories, Archives, Museums,
- Organizing
- Packaging of Data and Metadata, Storing
- Naming/Identifying and Cataloging
- Classification, Clustering,
- Serving
- Indexing, Linking, Summarizing, Visualizing
- Browsing, Accessing, Searching, Filtering,
Retrieving, Distributing, Using,
26Digital LibrariesShorten the Chain from
Author
Editor
Reviewer
Publisher
AI
Consolidator
Library
Reader
27DL Users Direct(Organized Artifact Mediated
Communication)
Roles
Digital Library
Author
Teacher
User
Reader
Editor
Learner
Reviewer
Librarian
Dr.
Patient
28Author tools
www.physik.uni-oldenburg.de/EPS/mmm
29(No Transcript)
30(No Transcript)
31A Digital Library Case Study
- Project
- Networked Digital Library of Theses
Dissertations - http//www.ndltd.org (NDLTD remember
- ND LTD / NDL TD)
- (also, newer NUDL Networked University Digital
Library, with e-courseware, etc.)
- Domain graduate education, research
- Genre ETDs electronic theses dissertations
- Submission http//etd.vt.edu
- Collection http//www.theses.org
32Status of the Local Project
- Approved by university governance Spring 1996
required starting 1/1/97 - Submission access software in place
- Submission workshops for students (and faculty)
occur often beginner/adv. - Faculty training as part of Faculty Development
Initiative - Over 3000 ETDs in collection
- Some have audio, video, large images, software,
- Millions of accesses/yr 100s to 1000s per work
33What are the long term goals?
- Attract all TDs/yr 50K D-US, 25K D-Germany, 10K
TD-Canada, - gt200K/yr rich hypermedia ETDs that may turn into
electronic portfolios (images, video, audio, ) - Dramatic increase in knowledge sharing
literature reviews, bibliographies, - Services providing lifelong access for students
browse, search, prior searches, citation links - Hundreds/thousands of downloads / year / work
34Student Gets Committee Signatures and Submits ETD
Approval form
35Library Catalogs ETD, Access is Opened to the New
Research
WWW
NDLTD
Digital library access control
36US University Members (44)
- Air University (Alabama)
- Baylor University
- Brigham Young University (part, whole)
- Caltech
- Clemson University
- College of William Mary
- Concordia University (Illinois)
- East Carolina University
- East Tenn. State U. require fall 2000
- Florida Institute of Technology
- Florida International University
- George Washington University
- Louisiana State University
- Marshall University (W. Va.)
- Miami University of Ohio
- Michigan Tech
- Mississippi State University
- MIT
- Naval Postgraduate School (CA)
- Penn. State University
- Rochester Institute of Tech.
- U. of Colorado Health Science Center
- U. of Florida
- U. of Georgia
- University of Hawaii, Manoa
- U. of Iowa
- U. of Kentucky
- U. of Maine
- U. of North Texas required since 8/99
- U. of Oklahoma
- U. of South Florida
- U. of Tennessee, Knoxville
- U. of Tennessee, Memphis
- U. of Texas at Austin required in 2001
- U. of Virginia
- U. Wisconsin - Madison
- Vanderbilt U.
- Virginia Commonwealth U.
37National / Regional Projects
- Australia
- U. New South Wales (lead)
- U. of Melbourne
- U. of Queensland
- U. of Sydney
- Australian National U.
- Curtin U. of Technology
- Griffith U.
- Germany
- Humboldt University (lead)
- 3 other universities
- 5 learned societies Math, Physics, Chemistry,
Sociology, Education - 1 computing center
- 2 major libraries
- Consorci de Biblioteques Universitàries de
Catalunya, as group, www.cbuc.es - Universitat de Barcelona
- Universitat Autonòma de Barcelona
- Universitat Politècnica de Catalunya
- Universitat Pompeu Fabra
- Universitat de Girona
- Universitat de Lleida
- Universitat Rovira i Virgili
- Universitat Oberta de Catalunya
- Biblioteca de Catalunya
- OhioLink
- South Africa ECHEA/SEALS
- India, Portugal,
38Other Countries with Members
- Netherland
- Norway
- Russia
- Singapore
- S. Africa
- S. Korea
- Spain
- Taiwan
- UK
- Belgium
- Brazil
- Canada
- Germany
- Hong Kong
- India
- Italy
- Korea
- Mexico
39Build Local ETD Site
Digital Library
40CS Teaching Center (CSTC)
- Collection of reviewed online resources used to
aid in teaching of Computer Science - Supports author submission and peer-review
process for new ACM Journal of Educational
Resources In Computing (JERIC) - Connected with NSDL (NSF 00-44)
- http//www.cstc.org
41CS Teaching Center (CSTC)
- Instead of building large, expensive multimedia
packages, that become obsolete and are difficult
to re-use, concentrate on small knowledge units. - Learners benefit from having well-crafted modules
that have been reviewed and tested. - Use digital libraries as a powerful base of
support for learners, upon which a variety of
courses, self-study tutorials reference
resources can be built. See NSF NSDL - National
Science (math, engineering, technology education)
Digital Library (formerly SMETE-lib) at
www.dlib.org/smete/public/smete-public.html
www.smete.org - iLumina NSF NSDL grant with COLLEGIS Research
Institute/Eduprise, UNCW, TCNJ,
42(No Transcript)
43Browsing (1)
44Browsing (2)
45(No Transcript)
46(No Transcript)
47(No Transcript)
48(From Lee Zia, NSF) Programmatic History
49Expectations of Tracks
- Core Integration to coordinate a distributed
alliance of resource collection and service
providers, and to ensure reliable and extensible
access to and usability of the resulting network
of learning environments and resources - Collections to aggregate and actively manage a
subset of the digital librarys content within a
coherent theme or specialty - Services to increase the impact, reach,
efficiency, and value of the digital library in
its fully operational form - Targeted Research to have immediate impact on
one or more of the other three tracks
50Tracks 29 Projects
- 6 Core Integration Columbia, Cornell,
E.Michigan/MERIT, UCAR, UCB, U-Missouri/NCSA
(Biology, Eng., Teacher Ed.) - 13 Collections Atmosphere, Biology, Biosciences,
Earth Systems, Engineering, Health Sciences, Math - 9 Services Competitive Intelligence, Component
Environment, Earth Systems J., Metadata NLP,
Managing LOs, Peer Review, Video - 1 Targeted Research Paths
51NSDL Spine
(Slide from Dave Fulker, Bill Arms 11/2/2000)
52Our Collaboration for NSDLPARTNERS
- Hofstra
- Villanova
- Penn State (with NEC)
- Virginia Tech
- ACM, IEEE-CS, Morgan Kaufmann,
53Our Collaboration for NSDLFUNDING
- 1M for 2 years, starting 9/1/2001 - NSF
- 225K Hofstra (1 GRA, 1 PI)
- 175K Villanova(1 GRA, 1 PI)
- 175K Penn State(1 GRA, 1 PI)
- 425K VT (4 GRAs, 3 PIs Fox, Lee, Perez)
- ACM, IEEE-CS, Morgan Kaufmann,
54Our Collaboration for NSDLSTRENGTHS
- PetaPlex, MARIAN, NDLTD, CSTC, JERIC
- SIGCSE SIG, Conference, Bulletin
- History as integrating theme adding demos
- Special support for Hispanic community
- Niche portals, search engines, links across
collections, citation data --- for levels
undergrad, high school, middle school, etc.
55Our Collaboration for NSDLPROPOSAL PLAN
- Student project completed Fall 2000
- Kate will continue in Spring 2001 through an
independent study - Meetings John visited VT, Boots and I visit
Hofstra today, I visit NEC on 12/27, John visits
VT again, - Get support letters, refine proposal,
56Our Collaboration for NSDLDOCUMENTS
- Computing Digital Library (CoDL)
- Packet prepared by student group
- See their slides next
- Contents
- Project report
- CoDL proposal outline
- Proposals from some successful NSDL groups