Title: Evolving Technology for Electronic Printing and Publishing
1Evolving Technology for Electronic Printing and
Publishing
- Richard Marisa
- Cornell University
- rjm2_at_cornell.edu
2Outline
- EPP Challenges in a period of rapid innovation
- Solutions in place
- Emerging Technology
3ChallengeStudent Printing in Public Labs
- Authentication
- Authorization
- Page Description Languages
- Transport
- Billing
4ChallengePrinting Finished Documents
- character is often different from casual
document - on demand vs. in bulk
- multiple printshops local, special, remote
- multiple page description languages
- push vs. pull printing
5ChallengeDigital Archives
- pull printing
- implementing terms/conditions
- naming document components
- combining multiple document sources
6Student Printing / Public LabsCornells
Net-Print
- replacement for VendaCards
- now ca. 7,000 users
- ca. 1,400,000 paid pages per year
- peaking over 13,000 pages/day
- average document length lt 3 pp
- expected to double volume in 1998-99
7Net-Print Technology
- workstations native print mechanisms
- authentication Kerberos
- authorization SQL database
- page description language Postscript
- transport LPD
- job ticket LPD
8Page description language (PDL) Postscript
- now common technology
- legacy problems old printers
- new features cause incompatibilities color
spaces - hard to do easy things page counting
- Postscript version 3 PDF, prepress, pull
printing, web page for drivers, management
9PDL issue repurposing
- Simple problem duplexing
- create a document
- put ltlt /Duplex true gtgt setpagedevicein the
postscript - publish the postscript
- print again simplex (how?)
10PDL PDF
- open standard published by Adobe
- integrates with web browser via free reader
plug-in - complicated (or expensive) methods to use
advanced features - similar repurposing problemsPDFmarks, fonts
- forms, byteserving, spot rendering
11Net-Print Transport LPD
- out of Berkley Unix
- long lived, widely used
- TCP based, described in RFC1179(1990)
- one file per job, simple job ticket
- originally line printer model, plus other formats
(PS, CIF, nroff) - customized/updated by vendors and users
12Net-Print Challenges
- Large documents on the web in many formats
(PDLs) HTML, PDF, Word, ... - Increasing use of electronic reserves
- Faculty placing course material on the web in
lieu of creating course-paks - Workstations need utilities for pull/push
printing - Large jobs need to be routed to appropriate
printers
13Transport CUPID
- Consortium for University Printing and
Information Distribution - Delivery of high quality printing services to
remote users - Architecture proposal (1992)
- Successive prototype implementations
14CUPIDFeatures and benefits
- a core suite of protocols and network services
- enables a broad range of application specific
services - supports a range of document sources
- supports a range of printers desktop to advanced
function networked printers
15CUPIDApplications
- print on demand from a digital library
- MOA, J-STOR
- local printing of custom textbooks
- local editions of news reports, business
briefings, product reviews
16CUPIDCapabilities
- manage complexity
- ability to query capabilities, price
- mitigates software distribution
- process multiple PDLs
- finishing options
- print preview (print to archive)
- security, transactions, etc.
17CUPIDA 3-layer architecture
- publishers client (user / gui / application)
- originating server (middleware) mediates
end-to-end transactions - printshop client
18EZ-Publish
- pre-web printing to Docutech
- interface, documents and devices all broke
- re-implementation in CUPID EZ-Publish II
- Printing to local printers, on-demand printers,
digital printshop, off-campus
19Internet Printing Protocol
- The Printer Working Group (or PWG) is an
unincorporated alliance among printer
manufacturers, print server developers, operating
system providers and print management application
developers chartered to make printers and the
applications and operating systems supporting
them work together better.
20Internet Printing Protocol
- Focus is to develop requirements and describe a
model and semantics for Internet Printing, and
ultimately define a new application level
Internet Printing Protocol. - IPP 1.0 protocol submitted as Internet Draft to
IETF in January, 1998
21IPP
- Client / Server protocol
- Transport HTTP
- PDL several
- Ticket binary format, content specification
- Struggling with notification, local printer
protocol, universal print drivers
22Job Ticket LPD
- Hcupid.cit.cornell.edu
- Proot
- Jjobname
- Ccupid.cit.cornell.edu
- Lroot
- fdfA059cupid.cit.cornell.edu
- UdfA059cupid.cit.cornell.edu
- Nfilename.txt
23Job Ticket Soleil
- XRXbegin 003.0132
- XRXdisposition PRINT
- XRXprinterName classq
- XRXdeleteSource True
- XRXPDLformat TIFF
- XRXrequirements duplex
- XRXdocumentPaperColors dt8
- XRXpaperType-size 216 279
- XRXcopyCount 1
- XRXtitle Painleve - Math Library
- XRXaccount L70-3700
- XRXjobStartMessage No message supplied
- XRXmessage Raj - 115 White Hall
- XRXsenderName jps21
- XRXxImageShift -5
- XRXsourceFile UNIX/var/spool/xerox/netqreq/CUPI
D895684211/cu1.0001
24Job Ticket CUPID
- opspecs gt
Creator gt 'jps21',
'Number of Copies' gt 1,
Service gt 'Docutech',
NetID gt 'jps21',
'Account Number' gt
xxxxxxxx',
'Paper Color' gt
'dt8',
'Document Sequence' gt 1,
Plex gt 'Duplex',
'X Image Shift' gt -5,
'Banner
Message' gt 'Raj - 115 White Hall',
'Job Name' gt 'Painleve - Math Library',
'Cover Color' gt ''
,
25Job Ticket Adobe PJTF
- PDF format
- Documents
- Page Layout
- signature of sheets to image upon
- Stand alone or embedded in PDF files
26Job Ticket Net-Paper
27XML
- eXtensible Markup Language
- HTML done right, SGML lite
- W3C recommendation (ca. Jan 98)
- ltpresentergtRich Marisalt/presentergt
- ltpresentation audionogt
- Electronic Printing and Publishing
- lt/presentationgt
- ltauthsys programMSPowerPoint /gt
28XML is for ?
- creating extensions to HTML
- object data marshalling
- representing hierarchic (non-flat) data records
- e-commerce transactions
- protocol implementation, content markup,
metadata, structure, extended link information
29Documents
- content
- metadata
- structure
- ancillary data reviews, commentary, citations,
hyperlinks
30XML Document Type Definitions
- what elements and/or attributes are valid?
- Validating parsers
- DTDs provide the rules
- (TITLE, AUTHOR, SUMMARY, NOTE?)
- valid values
- boilerplate content
31XML
- currently supported in IE 4.0
- to be in Netscape 5.0
- several Java, Python and C -based parsers and
utilities are available - full support soon in Perl 5
- Unicode
- assists automated search tools
- needed well-known DTDs
32MathML Content MarkupH multiplied with e
- H e
- H e
- Element
- 1644 notation
- sharps / flats for maxima/minima
- ax b
- ltapplygtltplus/gt
- ltapplygtlttimes/gt
- ltcigt a lt/cigt
- ltcigt x lt/cigt
- lt/applygt
33MathML(a b)2Content and Presentation
- ltapplygt ltpower/gt ltapplygt ltplus/gt
ltcigtalt/cigt ltcigtblt/cigt lt/applygt
ltcngt2lt/cngtlt/applygt
- ltmsupgt ltmfencedgt ltmrowgt ltmigtalt/migt
ltmogtlt/mogt ltmigtblt/migt lt/mrowgt
lt/mfencedgt ltmngt2lt/mngt lt/msupgt
34PGML Precision Graphic Markup Language
- 2D graphics language that provides precise
control of layout, fonts, color and printing - specification that enables Web users to readily
and reliably post, control and interact with
graphics on the Web - PDF in XML scalable, smaller, faster than
raster images
35PGML
- submitted to the W3C by Adobe, IBM, Netscape and
Sun - reference code implemented in Java
- imaging model used by Postscript, PDF, and the
Java 2D API - Java may quickly go from a poor printing platform
to one of the best
36PGML
- lt?XML version"1.0"?gtlt!DOCTYPE PGML SYSTEM
"pgml1.0.dtd"gtltpgml boundingbox"0 0 300 300"gt
ltpath fill"1" fillcolor"100 0 0"gt ltmoveto
x"100" y"100"/gt ltlineto x"200"/gt ltlineto
y"200"/ ltlineto x"100"/gt ltlineto
y"100"/gt lt/pathgtlt/pgmlgt
37PGML
- If the workstation is doing the ripping, what
will the printer look like? - If the printer is pulling documents, what will
document archives look like? - If job tickets are in the documents
- If...
38Dienst 5
- open, distributed digital library
39Dienst structure metadata
- lt?xml version"1.0" ?gt
- ltstructuregt
- ltview type"volume"gt
- lttitlegtGeorgetown Law Journallt/titlegt
- ltdategt1912-1913lt/dategt
- ltdiv type"titlepage"gt
- ltdisplaygtTitle Pagelt/displaygt
- ltpageimage id"1" /gt
- ltpageimage id"2" /gt
- lt/divgt
40Example Hunter
- Digital library project
- Collaborative with
- William S. Hein, Inc.
- Cornell Law Library
- Technology
- Dienst
- XML (based on e-bind dtd)
- CUPID
41Document Object Model
- evolving technology used largely in client side
scripting - makes every paragraph, image, style, even every
character a distinct, individual entity - address, manipulate, change, respond to event
handlers
42MS Style Sheet Printing Extension
- coming soon to a browser near you
- style applied to printing only
- ltSTYLE MEDIAPRINTgt
- properties
- page-break-after
- page-break-before
- values auto always left right
43Summary
- Covered several EPP challenges
- Looked at technologies in place
- Looked at a few emerging technologies
- Wow there are a lot of them!
- Look at / for deconstructions
44Some Projects at Cornell
- Net-Print (student printing)
- Hunter (Dienst)
- EZ-Publish II (CUPID)
- Net-Paper Collaboration (XML)
45Evolving Technology for Electronic Printing and
Publishing
- Richard Marisa
- Cornell University
- rjm2_at_cornell.edu