Title: Moving to PANDAS 3
1- Moving to PANDAS 3
-
- Future-proofing Institutional Websites
- 19-20 January 2006
- London
- Matthew Walker
- Deputy Director, Collection Infrastructure
- IT Division
- National Library of Australia
2Background
- NLA archiving web sites since October 1996.
- Processes based on a toolkit of public domain
software. - Need for management tool.
- requirements specification began in 1998.
- No suitable off-the-shelf products available at
the time. - Development work commenced in 1999.
- System named PANDAS.
3Definitions
- PANDORA the NLAs archive of website
resources. - PANDAS the NLAs software for managing the
process of gathering, archiving and publishing
website resources. - PANDORA website the delivery mechanism for the
PANDORA archive.
4About PANDAS
- Pandas 1 released in June 2001.
- Pandas 2 released in August 2002.
- Pandas 3 redevelopment began September 2004,
after limitations/issues with Pandas 2 were
identified and examined. - 1st release scheduled March 2006.
5PANDAS 2
- Title/resource management
- Nominating/Identifying
- Selecting
- Gathering
- Processing
- Archiving
- Publishing
- Cataloguing
- Permissions/Restrictions
6PANDAS 2
- Add/edit
- users.
- agencies/institutions.
- collections.
- subject categories.
- Manage display of information on PANDORA delivery
website.
7PANDAS 2 Broad Software Architecture
8PANDAS 2 System Architecture
Web Server
WebDAV
WebObjects Application Server
Resolver Service
Lucene Full-text Search Service
Gatherer
Scripter
PANDAS
Notifier
Restricter
Access Restricter
PANDORA
Reporter
HTTrack
Digital Object Storage System (DOSS)
9PANDAS 3 System Architecture
Web Server
WebDAV
Java Application Server (Tomcat)
Resolver Service
Lucene Full-text Search Service
Gathering Web Service
Pandas Management
Pandas Delivery
HTTrack
Digital Object Storage System (DOSS)
10PANDAS 3 Broad Software Architecture
11PANDAS 3 Broad Software Architecture
- PandasManagement functionality to support
identification, selection, archiving, cataloguing
and publication of titles/resources. Reporting is
also associated with this system. - Gathering Web Service gathering/crawling and
file system manipulation. - PandasDelivery display/delivery of published
archived resources.
12Pandas 3 Whats the difference?
- More robust.
- Implementation consistent with Apple WebObjects
standards. - Improved data model.
- Faster operation.
- More efficient implementation of business logic.
- New user interface focused on core workflows.
- New worktray concept.
13Pandas 3 Whats the difference?
- All gather processing related functionality
incorporated into Gathering Web Service. - Web Services implementation instead of RMI.
- Database knowledge removed from processing
scripts. - Restrictions notifications integrated into
PandasManagement application. - Restrictions notifications are implemented as
modules.
14Pandas 3 Whats the difference?
- Can run in standard servlet container/Java
application server (e.g. Tomcat). - Previously required Apple WebObjects application
server. - Improved error handling.
- Easier to determine source of problems.
-
-
-
15PANDAS 2 Technologies
- Application Suite
- Java
- Apple WebObjects API
- RMI
- Perl
- Lucene
- System Environment
- Apache web server
- Apple WebObjects application server
- WebDAV
- Oracle
16PANDAS 3 Technologies
- Application Suite
- Java
- Apple WebObjects API
- SOAP XML web services
- Perl
- Lucene
- System Environment
- Apache web server
- Tomcat Java application server
- WebDAV
- Oracle
17PANDAS 2 Login Page
18PANDAS 3 Login Page
19PANDAS 2 Homepage
20PANDAS 3 Homepage
21PANDAS 3 Homepage (a closer look)
Side Menu Bar
Status Bar
Search Bar
22PANDAS 3 Homepage (a closer look)
23PANDAS 2 Create Title
24PANDAS 3 Add Title
25PANDAS 2 - Search
26PANDAS 3 Search
27PANDAS 2 View Title
28PANDAS 3 View Title
29PANDAS 2 Edit Title
30PANDAS 2 Edit Title
31PANDAS 2 Edit Title
32PANDAS 2 Edit Title
33PANDAS 3 Edit Title
34PANDAS 2 Gather Queue
35PANDAS 3 Gather Worktray
36Future functionality
- Title hierarchies
- Instance-level restrictions
- Instances as subset of whole domain snapshot
- More automation
- Permission request processes
- Bulk processing and review
- Gather schedule management with priorities
- Integration of Heritrix as gatherer/crawler
- Abstraction of user/institution info and
authentication
37How do I use it?
- Run locally.
- Apache
- Tomcat
- Java 1.4.2
- WebObjects 5.2 Java libraries and deployment
licence - Free with Mac OS X server.
- Purchase from Apple Store (492.77).
- Perl
- Lucene
- Ask UKWAC to archive your website!