Fourth Annual Midwest CONTENTdm Users Group Meeting - PowerPoint PPT Presentation

1 / 56
About This Presentation
Title:

Fourth Annual Midwest CONTENTdm Users Group Meeting

Description:

CONTENTdm 5 Top Ten. Find Search Engine. Find search engine integrated into CONTENTdm software ... CONTENTdm 5 Top Ten. Find Search Engine ... – PowerPoint PPT presentation

Number of Views:89
Avg rating:3.0/5.0
Slides: 57
Provided by: geribunk
Category:

less

Transcript and Presenter's Notes

Title: Fourth Annual Midwest CONTENTdm Users Group Meeting


1
Fourth Annual Midwest CONTENTdm Users Group
Meeting
2
CONTENTdm Directions
Purdue West Lafayette, Indiana March 19, 2009
  • Geri Ingram
  • OCLC Digital Collection Services
  • User Services Manager

3
Information--looking for grounding in Web 2.0
  • No wonder we talk about information space
  • Porous boundaries
  • no strictly personally interesting info
  • no completely professional info
  • If I cant find it, I didnt park it right. (tag
    it, catalog it, file it)?
  • If I cant find it, I cant get it, or use it
    (share it).

4
Striking our roots
  • Ranganathan 1892-1972, Bangalore, Indiaknown as
    the father of library science in India. A
    mathematician, epistemologist and librarian
  • Five laws of library science
  • Books are for use.
  • Every reader his or her book.
  • Every book its reader.
  • Save the time of the User.
  • The library is a growing organism.

5
Saving the time of the user, at 21st century speed
  • Most of our information transfer is
    digitalflying at the speed of light
  • Where did I see (hear) that?
  • What was the context?
  • How does it relatebecause it ALL relates!
  • Who needs to know this?
  • How can I share it?

6
Ranganathans book
  • Media, education, public service
  • Were all in the same business, helping people
    find, get, and use informationTHEIR information
  • WHEN they need it, regardless of where its
    found, or how its requested.
  • Ranganathans book might be an image

7
Every reader his or her book.
  • So, too, every digitized item is special to
    someone
  • The aggregate is special to most
  • And its integration with other complementary
    content is valuable to all

8
Whats important to you today?
  • The state of the software were here to discuss
  • Have we been listening?
  • What is the significance of Release 5?
  • Are we still listening?
  • The future of the software
  • Will we keep listening?

9
From its inception CONTENT(dm) has been a
response to searchers needs
  • From its roots in bio-medical engineering
  • To Libraries special collections
  • To integrated digital collections for research
    and teaching
  • To a globally created and accessed multi-media
    repository
  • CONTENTdm developers have listened to users
  • Were still listening
  • Were putting in processes to insure that we
    listen
  • Well, and for the long haul

10
CONTENTdm upgrades for users
3.3Jul 2002
3.1May 2001
4.0Jun 2005
3.5Jul 2003
3.7 3.8 Jul 2004
4.1Mar 2006
3.4Jan 2003
3.6Feb 2004
3.2Feb 2002
4.3Oct 2007
4.2Dec 2006
3.5.1Sept 2003
4.0.1Oct 2005
  • Version 3.4 Jan 2003
  • OAI support for harvesting of metadata
  • Multi-Site Server
  • Version 3.5 Jul / Sep 2003
  • WorldCat link for metadata harvesting
  • Customizable Web Templates
  • Version 3.6 Feb 2004
  • Batch Add wizard
  • Advanced Search has a new interface and added
    functionality
  • User Support Center introduced
  • Version 3.7 3.8 Jul 2004
  • Zoom and pan toolbar for viewing images Compare
    button
  • Tab-delimited Text Import
  • Collection and item-level security
  • Web browser-based editor
  • Version 3.8
  • JPEG2000
  • Version 4.0 Jun / Oct 2005
  • EAD support
  • PHP-based API which supports broader
    customizations and interoperability
  • User Interface based on the new PHP API
  • OCR Extension with ABBYY FineReader
  • Version 4.1 Mar 2006
  • Redirects for obsolete URLS
  • PHP 5.0 supported
  • Version 4.2 Dec 2006
  • Multiple Compound Object Wizard
  • OCR Extension upgraded to ABBYY FineReader
    Version 8
  • Ability to highlight, view and clip individual
    newspaper articles
  • Search and browse by date range
  • Version 4.3 Oct 2007
  • Easier to manage and access PDF documents,
    especially multiple-page PDFs
  • Improved controlled vocabulary
  • OCLC Connexion digital import

11
We have a long history of enhancing the software
in response to our users requirements
  • 2003 Librarians said, What about
    interoperability? What about providing federated
    searching?
  • And I need to brand my collections for disparate
    user groups!
  • 2004 We need to move more data, faster I need
    to build big text collections, fast!.
  • and My users want more search functionality, and
    the ability to manipulate the display images
  • And, My users want to see high-resolution images
    but they dont want to install plug-ins! I dont
    want to get tied to proprietary formats, but I
    dont want to expose my valuable source images
    either.

12
And in 2005, you added
  • I need to protect some of my data from access
    by some of my users
  • I need to be able to maintain my metadata
    through a web browser.
  • We need to provide our finding aids onlinewhat
    do you have?
  • And, We want more control over our interface!

13
In 2006, librarians said
  • Loading a book is an improvement, but I need to
    load whole libraries!
  • And My users are interested in newspapersgive
    me some desktop OCR and a good searching/highlight
    mechanism
  • And, When are you going to fix those darn DATES?

14
By 2007 we knew we had to find a way to search
the PDF faster, and completely
  • You said, My researchers need to find every
    instance of a phrase across thousands of
    documents, and they dont want to wait!
  • And, help my catalogers mainstream the workwe
    want to leverage our existing MARC records
  • By 2008, we finally got back to those EAD finding
    aids
  • And to Unicode
  • And the need for faceted searching, with
    relevancy ranking.

15
We are listening
  • As your digital library programs have grown, so
    have your needs for tools
  • Stewarding the materials through the entire
    life-cycle
  • Helping every book (to find) its reader

16
OCLC Digital Collection Services Trends in
library digital collections
  • Aggregation and integration of digital content is
    important
  • Surface/expose collections in common and familiar
    discovery tools - search engines and aggregators
    - make collections radically accessible 
  • Universal search - the ability to search digital
    collections and other electronic resources
    through the same user interface rather than
    specialized sites

17
We are listeninghow CONTENTdm fits into the
bigger picture
  • Build a digital repository within the OCLC
    cooperative
  • A rich set of digital collections created by
    libraries, museums, and other cultural heritage
    organizations
  • Linked through the global discovery of WorldCat
  • As of March 3rd, 1.4 million records (titles)
  • 1.4 billion holdings

18
OCLC Digital Collection Services
  • Solutions to help you to create, manage,
    share and preserve your digital collections
  • Digitization
  • CONTENTdm
  • Hosting Services
  • Web Harvesting
  • WorldCat Harvesting
  • Digital Archive

19
CONTENTdm Version 5 Released December 2008
For more information about Version 5
www.oclc.org/news/releases/20093.htm
20
CONTENTdm 5
  • Milestone release
  • Released December 17th, 2008
  • Significant changes throughout the software
  • Sets foundation for future enhancements that will
    continue to further CONTENTdms use as the
    leading digital repository platform

21
State of the releasewe are listening!
  • HUGE diversity of environments and uses
  • First service pack released February (5.0.1)
  • Second service pack to be released by tomorrow
    (5.0.2)
  • fixes some serious and many just irritating
    problems!
  • There may be a third service pack in April.
  • There will be a version release 5.1 in May.
  • Scheduledual systemsmigration
  • Waiver of EULA restriction

22
CONTENTdm 5 Top Ten
  • Unicode Support
  • Full support of Unicode for importing, storing,
    displaying and searching Unicode languages
  • OCR language support expanded 184 languages
  • Supports the creation and exposure of digital
    collections in any language

23
CONTENTdm 5 Top Ten
  • Find Search Engine
  • Find search engine integrated into CONTENTdm
    software
  • More robust capacity and the ability to offer
    additional search features
  • Relevancy sorting
  • Faceted searching
  • Spelling suggestion
  • Unicode searching

24
CONTENTdm 5 Top Ten
  • Find Search Engine
  • Leveraging existing OCLC technology by
    integrating the Find search engine
  • Search in any language
  • More tools to help end-users find what they are
    looking for, faster
  • Better end-user experience

25
CONTENTdm 5 Top Ten
  • Controlled Vocabularies
  • Integration with OCLC Terminologies Service
  • Providing nine new thesauri for CONTENTdm users
  • Adds efficiency to collection building by
    providing pre-loaded thesauri for cataloging

26
CONTENTdm 5 Top Ten
  • Reports
  • More robust, scalable reporting module integrated
    into software
  • Provides expanded reports
  • Views by collection and item
  • Top searches within CONTENTdm
  • Web statistics by month, day, hour
  • Top URLs, errors, referring sites, IP addresses,
    authenticated users, browsers, and countries
  • Access to log files
  • Export CONTENTdm reports

27
CONTENTdm 5 Top Ten
  • Flexible Workflows
  • Added more options for approving and indexing
    items
  • New batch and subset handling of pending items
  • One-click approve index on demand
  • Scheduling options for approve and index
  • Background processing

28
CONTENTdm 5 Top Ten
  • Registration
  • New registration process added during
    installation
  • One-click sends server information to OCLC
  • Registered servers called once a month to gather
    data on usage
  • FEEDBACK!
  • User Support Center is being completely
    overhauled now!

29
CONTENTdm 5 Top Ten
  • Project Client
  • New client application replaces old version
  • New programming language
  • New, more intuitive interface
  • Unicode support
  • More robust
  • And many other enhancements

30
CONTENTdm 5 Project Client
31
Some notes from the developer--Project Client
Goals
  • Update Technology
  • Increase throughput
  • Improve editing capability
  • Expand and improve support for different data
    types

32
Update Technology
  • .NET 3.0
  • Build on top of newest Microsoft technology to
    build modern application
  • HTTP file transfer
  • Do away with old technologies FTP and SFTP
  • Transfer small packets
  • Easy to configure
  • Robust

33
Why? Increase Throughputover the Acquisition
Station
  • Acquisition Station
  • Not optimized for multi processors
  • Editing one at a time
  • Long wait time for imports and uploads
  • Maximum of 5000 items
  • Pull down 100 items at a time for editing

34
Parallel Processing
  • Decrease time waiting for operations to complete
  • Maximize use of multi processor machines
  • Network transfers asynchronously
  • Upload asynchronously

35
Parallel Processing Multi Processor Systems
  • Project Client is optimized to support this
    architecture
  • Utilizes .NET threading technologies to scale
    well between 1 and n CPUs
  • Uses thread to move tasks to background and allow
    parallel work
  • Upload Manager
  • Background data checking

36
Template Creator
  • We added a hierarchal structure with the ability
    to turn on and off templates based on data types
  • General
  • Images
  • TIF
  • JPEG
  • JPEG 2000
  • PDF
  • Compound Object
  • Video
  • Audio
  • URL

37
CONTENTdm 5 Top Ten
  • File Transfer
  • Replaced FTP with custom HTTP transfer protocol
  • Uploading items occurs in the background
  • Continue working while items are uploaded
  • Pause process and resume later

38
CONTENTdm 5 File Transfer
39
CONTENTdm 5 Top Ten
  • EAD
  • New import process and display options
  • Custom metadata mapping
  • Full text searching
  • Search term highlighting within the EAD
  • Multiple display views
  • XML Web service

40
CONTENTdm 5 EAD
41
CONTENTdm 5 EAD
42
CONTENTdm 5 Top Ten
  • Capacity
  • Increased capacity throughout application
  • Supports more collections, items for batch
    processing, and metadata fields
  • Expand metadata schemas to incorporate
    preservation metadata or more custom fields
  • Faster batch processing and conversion from
    existing databases

43
Whats up next?
  • The CONTENTdm enhancements most requested now
  • Modern, easy-to-customize viewers for all media
  • Integration of Web 2.0 tools like tagging
  • A modern, interactive User Support Center for the
    community
  • A better metadata harvest to WorldCat

44
Web customization and viewers
  • Goals
  • Lessen the labor required to upgrade
  • Provide a more coherent viewer experience
  • Provide smooth integration with players and
    viewersleverage and improve upon the Web 2.0
    features of worldcat.org (listmaking, tagging,
    etc.)
  • Approach
  • Completely overhaul the web interfaces

45
User Support Center
  • New expert staff
  • Re-design for an interactive community experience
  • Surveyed stakeholders
  • Researching 3rd party platforms
  • for a Fall launch

46
USC re-design goals
  • Make the USC a compelling user community space
  • Create a one-stop-shop for all CONTENTdm
    known-issues, documentation, extensions, etc.
  • Increase user satisfaction
  • Make the USC extensible to support info on all
    the Digital Collection Services products and
    services
  • Provide an interactive space for User Groups to
    meet, plan, record, and share!

47
Improve exposure of digital items on the
WebIntroducing the Digital Collection Gateway
  • Improve access presence for digital collections
  • Synchronize non-MARC metadata with WorldCat
  • Provide self-service tools to drive
    synchronization
  • Available for CONTENTdm collections with
    CONTENTdm 5.1--May 2009

48
(No Transcript)
49
You design the WorldCat.org display of your
metadata
50
You can adjust where source metadata fields
appear in the WorldCat.org display
51
The map from source metadata to WorldCat.org
display for this collection is now changed for
all records being synchronized with WorldCat
52
Digital Collection Gateway
53
Digital Collection Gateway
  • Builds upon fundamental work with XML Web
    Services
  • Available with CONTENTdm 5.1--May 2009 for
    CONTENTdm collections
  • Will extend to support other digital repositories
    in second phase

54
The bigger picture your users want digital
content from many sources (wherever!)
The Web End users retrieve the information they
need
Metadata
Content management server
55
Next step for DCG is to enable other metadata to
WorldCat
  • We know you need many interoperable tools to
    build and manage your organizations repositories
  • CONTENTdm is the foundation for over 1,000
    digital libraries worldwideover 500 licenses
    shared collaboratively
  • CONTENTdm is also fundamental for the OCLC
    Digital Repository
  • Every item is special to someone, the aggregate
    is special to most and the integration with other
    content is valuable to all

56
Thank you!
Write a Comment
User Comments (0)
About PowerShow.com