Title: Fourth Annual Midwest CONTENTdm Users Group Meeting
1Fourth Annual Midwest CONTENTdm Users Group
Meeting
2CONTENTdm Directions
Purdue West Lafayette, Indiana March 19, 2009
- Geri Ingram
- OCLC Digital Collection Services
- User Services Manager
3Information--looking for grounding in Web 2.0
- No wonder we talk about information space
- Porous boundaries
- no strictly personally interesting info
- no completely professional info
- If I cant find it, I didnt park it right. (tag
it, catalog it, file it)? - If I cant find it, I cant get it, or use it
(share it).
4Striking our roots
- Ranganathan 1892-1972, Bangalore, Indiaknown as
the father of library science in India. A
mathematician, epistemologist and librarian - Five laws of library science
- Books are for use.
- Every reader his or her book.
- Every book its reader.
- Save the time of the User.
- The library is a growing organism.
5Saving the time of the user, at 21st century speed
- Most of our information transfer is
digitalflying at the speed of light - Where did I see (hear) that?
- What was the context?
- How does it relatebecause it ALL relates!
- Who needs to know this?
- How can I share it?
6Ranganathans book
- Media, education, public service
- Were all in the same business, helping people
find, get, and use informationTHEIR information - WHEN they need it, regardless of where its
found, or how its requested. - Ranganathans book might be an image
7Every reader his or her book.
- So, too, every digitized item is special to
someone - The aggregate is special to most
- And its integration with other complementary
content is valuable to all
8Whats important to you today?
- The state of the software were here to discuss
- Have we been listening?
- What is the significance of Release 5?
- Are we still listening?
- The future of the software
- Will we keep listening?
9From its inception CONTENT(dm) has been a
response to searchers needs
- From its roots in bio-medical engineering
- To Libraries special collections
- To integrated digital collections for research
and teaching - To a globally created and accessed multi-media
repository - CONTENTdm developers have listened to users
- Were still listening
- Were putting in processes to insure that we
listen - Well, and for the long haul
10CONTENTdm upgrades for users
3.3Jul 2002
3.1May 2001
4.0Jun 2005
3.5Jul 2003
3.7 3.8 Jul 2004
4.1Mar 2006
3.4Jan 2003
3.6Feb 2004
3.2Feb 2002
4.3Oct 2007
4.2Dec 2006
3.5.1Sept 2003
4.0.1Oct 2005
- Version 3.4 Jan 2003
- OAI support for harvesting of metadata
- Multi-Site Server
- Version 3.5 Jul / Sep 2003
- WorldCat link for metadata harvesting
- Customizable Web Templates
- Version 3.6 Feb 2004
- Batch Add wizard
- Advanced Search has a new interface and added
functionality - User Support Center introduced
- Version 3.7 3.8 Jul 2004
- Zoom and pan toolbar for viewing images Compare
button - Tab-delimited Text Import
- Collection and item-level security
- Web browser-based editor
- Version 3.8
- JPEG2000
- Version 4.0 Jun / Oct 2005
- EAD support
- PHP-based API which supports broader
customizations and interoperability - User Interface based on the new PHP API
- OCR Extension with ABBYY FineReader
- Version 4.1 Mar 2006
- Redirects for obsolete URLS
- PHP 5.0 supported
- Version 4.2 Dec 2006
- Multiple Compound Object Wizard
- OCR Extension upgraded to ABBYY FineReader
Version 8 - Ability to highlight, view and clip individual
newspaper articles - Search and browse by date range
- Version 4.3 Oct 2007
- Easier to manage and access PDF documents,
especially multiple-page PDFs - Improved controlled vocabulary
- OCLC Connexion digital import
11We have a long history of enhancing the software
in response to our users requirements
- 2003 Librarians said, What about
interoperability? What about providing federated
searching? - And I need to brand my collections for disparate
user groups! - 2004 We need to move more data, faster I need
to build big text collections, fast!. - and My users want more search functionality, and
the ability to manipulate the display images - And, My users want to see high-resolution images
but they dont want to install plug-ins! I dont
want to get tied to proprietary formats, but I
dont want to expose my valuable source images
either.
12And in 2005, you added
- I need to protect some of my data from access
by some of my users - I need to be able to maintain my metadata
through a web browser. - We need to provide our finding aids onlinewhat
do you have? - And, We want more control over our interface!
13In 2006, librarians said
- Loading a book is an improvement, but I need to
load whole libraries! - And My users are interested in newspapersgive
me some desktop OCR and a good searching/highlight
mechanism - And, When are you going to fix those darn DATES?
14By 2007 we knew we had to find a way to search
the PDF faster, and completely
- You said, My researchers need to find every
instance of a phrase across thousands of
documents, and they dont want to wait! - And, help my catalogers mainstream the workwe
want to leverage our existing MARC records - By 2008, we finally got back to those EAD finding
aids - And to Unicode
- And the need for faceted searching, with
relevancy ranking.
15We are listening
- As your digital library programs have grown, so
have your needs for tools - Stewarding the materials through the entire
life-cycle - Helping every book (to find) its reader
16 OCLC Digital Collection Services Trends in
library digital collections
- Aggregation and integration of digital content is
important - Surface/expose collections in common and familiar
discovery tools - search engines and aggregators
- make collections radically accessible - Universal search - the ability to search digital
collections and other electronic resources
through the same user interface rather than
specialized sites -
17We are listeninghow CONTENTdm fits into the
bigger picture
- Build a digital repository within the OCLC
cooperative - A rich set of digital collections created by
libraries, museums, and other cultural heritage
organizations - Linked through the global discovery of WorldCat
- As of March 3rd, 1.4 million records (titles)
- 1.4 billion holdings
18OCLC Digital Collection Services
- Solutions to help you to create, manage,
share and preserve your digital collections - Digitization
- CONTENTdm
- Hosting Services
- Web Harvesting
- WorldCat Harvesting
- Digital Archive
19CONTENTdm Version 5 Released December 2008
For more information about Version 5
www.oclc.org/news/releases/20093.htm
20CONTENTdm 5
- Milestone release
- Released December 17th, 2008
- Significant changes throughout the software
- Sets foundation for future enhancements that will
continue to further CONTENTdms use as the
leading digital repository platform
21State of the releasewe are listening!
- HUGE diversity of environments and uses
- First service pack released February (5.0.1)
- Second service pack to be released by tomorrow
(5.0.2) - fixes some serious and many just irritating
problems! - There may be a third service pack in April.
- There will be a version release 5.1 in May.
- Scheduledual systemsmigration
- Waiver of EULA restriction
22CONTENTdm 5 Top Ten
- Unicode Support
- Full support of Unicode for importing, storing,
displaying and searching Unicode languages - OCR language support expanded 184 languages
- Supports the creation and exposure of digital
collections in any language
23CONTENTdm 5 Top Ten
- Find Search Engine
- Find search engine integrated into CONTENTdm
software - More robust capacity and the ability to offer
additional search features - Relevancy sorting
- Faceted searching
- Spelling suggestion
- Unicode searching
24CONTENTdm 5 Top Ten
- Find Search Engine
- Leveraging existing OCLC technology by
integrating the Find search engine - Search in any language
- More tools to help end-users find what they are
looking for, faster - Better end-user experience
25CONTENTdm 5 Top Ten
- Controlled Vocabularies
- Integration with OCLC Terminologies Service
- Providing nine new thesauri for CONTENTdm users
- Adds efficiency to collection building by
providing pre-loaded thesauri for cataloging
26CONTENTdm 5 Top Ten
- Reports
- More robust, scalable reporting module integrated
into software - Provides expanded reports
- Views by collection and item
- Top searches within CONTENTdm
- Web statistics by month, day, hour
- Top URLs, errors, referring sites, IP addresses,
authenticated users, browsers, and countries - Access to log files
- Export CONTENTdm reports
27CONTENTdm 5 Top Ten
- Flexible Workflows
- Added more options for approving and indexing
items - New batch and subset handling of pending items
- One-click approve index on demand
- Scheduling options for approve and index
- Background processing
28CONTENTdm 5 Top Ten
- Registration
- New registration process added during
installation - One-click sends server information to OCLC
- Registered servers called once a month to gather
data on usage - FEEDBACK!
- User Support Center is being completely
overhauled now!
29CONTENTdm 5 Top Ten
- Project Client
- New client application replaces old version
- New programming language
- New, more intuitive interface
- Unicode support
- More robust
- And many other enhancements
30CONTENTdm 5 Project Client
31Some notes from the developer--Project Client
Goals
- Update Technology
- Increase throughput
- Improve editing capability
- Expand and improve support for different data
types
32Update Technology
- .NET 3.0
- Build on top of newest Microsoft technology to
build modern application - HTTP file transfer
- Do away with old technologies FTP and SFTP
- Transfer small packets
- Easy to configure
- Robust
33Why? Increase Throughputover the Acquisition
Station
- Acquisition Station
- Not optimized for multi processors
- Editing one at a time
- Long wait time for imports and uploads
- Maximum of 5000 items
- Pull down 100 items at a time for editing
34Parallel Processing
- Decrease time waiting for operations to complete
- Maximize use of multi processor machines
- Network transfers asynchronously
- Upload asynchronously
35Parallel Processing Multi Processor Systems
- Project Client is optimized to support this
architecture - Utilizes .NET threading technologies to scale
well between 1 and n CPUs - Uses thread to move tasks to background and allow
parallel work - Upload Manager
- Background data checking
36Template Creator
- We added a hierarchal structure with the ability
to turn on and off templates based on data types - General
- Images
- TIF
- JPEG
- JPEG 2000
- PDF
- Compound Object
- Video
- Audio
- URL
37CONTENTdm 5 Top Ten
- File Transfer
- Replaced FTP with custom HTTP transfer protocol
- Uploading items occurs in the background
- Continue working while items are uploaded
- Pause process and resume later
38CONTENTdm 5 File Transfer
39CONTENTdm 5 Top Ten
- EAD
- New import process and display options
- Custom metadata mapping
- Full text searching
- Search term highlighting within the EAD
- Multiple display views
- XML Web service
40CONTENTdm 5 EAD
41CONTENTdm 5 EAD
42CONTENTdm 5 Top Ten
- Capacity
- Increased capacity throughout application
- Supports more collections, items for batch
processing, and metadata fields - Expand metadata schemas to incorporate
preservation metadata or more custom fields - Faster batch processing and conversion from
existing databases
43Whats up next?
- The CONTENTdm enhancements most requested now
- Modern, easy-to-customize viewers for all media
- Integration of Web 2.0 tools like tagging
- A modern, interactive User Support Center for the
community - A better metadata harvest to WorldCat
44Web customization and viewers
- Goals
- Lessen the labor required to upgrade
- Provide a more coherent viewer experience
- Provide smooth integration with players and
viewersleverage and improve upon the Web 2.0
features of worldcat.org (listmaking, tagging,
etc.) - Approach
- Completely overhaul the web interfaces
45User Support Center
- New expert staff
- Re-design for an interactive community experience
- Surveyed stakeholders
- Researching 3rd party platforms
- for a Fall launch
46USC re-design goals
- Make the USC a compelling user community space
- Create a one-stop-shop for all CONTENTdm
known-issues, documentation, extensions, etc. - Increase user satisfaction
- Make the USC extensible to support info on all
the Digital Collection Services products and
services - Provide an interactive space for User Groups to
meet, plan, record, and share!
47Improve exposure of digital items on the
WebIntroducing the Digital Collection Gateway
- Improve access presence for digital collections
- Synchronize non-MARC metadata with WorldCat
- Provide self-service tools to drive
synchronization - Available for CONTENTdm collections with
CONTENTdm 5.1--May 2009
48(No Transcript)
49You design the WorldCat.org display of your
metadata
50You can adjust where source metadata fields
appear in the WorldCat.org display
51 The map from source metadata to WorldCat.org
display for this collection is now changed for
all records being synchronized with WorldCat
52Digital Collection Gateway
53Digital Collection Gateway
- Builds upon fundamental work with XML Web
Services - Available with CONTENTdm 5.1--May 2009 for
CONTENTdm collections - Will extend to support other digital repositories
in second phase -
54The bigger picture your users want digital
content from many sources (wherever!)
The Web End users retrieve the information they
need
Metadata
Content management server
55Next step for DCG is to enable other metadata to
WorldCat
- We know you need many interoperable tools to
build and manage your organizations repositories - CONTENTdm is the foundation for over 1,000
digital libraries worldwideover 500 licenses
shared collaboratively - CONTENTdm is also fundamental for the OCLC
Digital Repository - Every item is special to someone, the aggregate
is special to most and the integration with other
content is valuable to all
56Thank you!