Title: Designing Intranet Information Services IIS 2
 1Designing Intranet Information Services (IIS) - 2
T.B. RajashekarNational Centre for Science 
InformationIndian Institute of ScienceBangalore 
 560 012 (EMail raja_at_ncsi.iisc.ernet.in) 
 2We covered in Part 1
- Goals of IIS 
 - Design elements - 1 
 - Information auditing 
 - User community and their preferences 
 - Infrastructure and resources 
 - IIS architecture 
 - Content storage and processing
 
  3We will now discuss
- Design elements  2 
 - Web style guide 
 - Metadata 
 - Subject organization 
 - Resource type access 
 - Database driven approach for content management 
 - Customisation/ personalization 
 - Relevance of XML 
 - Resources
 
  4Goals of IIS
- Managed publishing and hosting of content 
 - What, who, where, why, how 
 - Integration of internal/ external, fee/ free 
content  - Integration of legacy content 
 - Structured and controlled access to content 
 - Usage monitoring 
 - Training and maintenance
 
  5User Community
Information Assets
Infrastructure  Resources
IISDesign Components
Content Storage  Processing
Subject Organization
IIS Architecture
Database Approach
Web Style Guide
Metadata
Personalization 
 6Design Elements Web Style Guide
- Essential for all large sites  best practice for 
content creation, publishing and maintenance  - Gives consistent and uniform look to all the 
sites  - To be followed by authors and editors 
 - To be used by programmers for automatic 
generation of pages 
  7Design Elements Web Style Guide  Typical 
Contents
- Page design  HTML coding 
 - Language usage 
 - Navigational elements 
 - Graphics 
 - Fonts, size, colors
 
- Graphics 
 - SSI 
 - Templates 
 - CSS 
 
  8Design Elements Web Style Guide
- Examples 
 - ATT.COM 2001 Design System Guide 
(http//www.att.com/style/)  - BCR Web Site Style Manual (Bibliographical Center 
for Research, USA) http//www.bcr.org/stylemanual/
)  - NYC.GOV Style Guide http//www.ci.nyc.ny.us/html/s
tyle/ 
  9Design Elements Metadata
- Data about other data (e.g. library catalog 
record, bibliographic record)  - Provides context to content 
 - A metadata record consists of a set of 
attributes, or elements, necessary to describe an 
information source  - Basis for resource discovery
 
  10Author Title .Subject Source type 
Format .
META DATA
Documents
Audio
Author Title .Subject Source type 
Format .
Video
Database
Author Title .Subject Source type 
Format .
Digital Content
Print sources 
 11Design Elements Metadata
- Metadata may carry different information about 
the resource  - Descriptive, administrative, rights management, 
security and authentication, etc.  - Applicable for all types of digital content 
 - An international standard Dublin Core 
 - Widely adopted on Internet/ intranets
 
  12Design Elements Metadata
- Dublin Core 
 - International standard for describing electronic 
information resources  - Consists of 15 elements, each repeatable, none 
mandatory  - Conceived in 1994 
 - Has reached standard status  W3C, NISO, ISO 
 - Widely used in several projects around the world 
 - Being refined further
 
  13Design Elements Metadata
DC Metadata element set
- Title 
 - Author/Creator 
 - Subject /Keywords 
 - Description 
 - Publisher 
 - Other Contributor 
 - Date 
 
- Resource Type 
 - Format 
 - Resource Identifier 
 - Source 
 - Language 
 - Relation 
 - Coverage 
 - Rights Management 
 
  14Design Elements Metadata
- Case study Weyerhaeuser (A forest products 
company) intranet  10,000 employees  25,000 
remote locations  - Adopted DC for corporate intranet in 1998 
 - Goals 
 - To provide in-depth, consistent access to 
information on Weyerhaeuser intranet 
  15Design Elements Metadata
- Developed a 19 field metadata scheme 
 - Bibliographic, management and descriptive 
information  - Scope Use in HTML pages (via META tag)ltMETA 
NAMEWY.fieldname CONTENTfieldcontentgt  - Implementation 
 - Metadata generator  HTML form  CGI script 
 - Mechanism for incorporating the generated 
metadata into HTML pages 
Study the extracts given in your course material 
 16DC in HTML
- lthtmlgtltheadgt 
 - lttitlegtUKOLN Home Pagelt/titlegt 
 - ltmeta name"DC.Title content"UKOLN UK Office 
for Library and Information Networking"gt  - ltmeta name"DC.Subject" content"national centre, 
network information support, library community, 
awareness, research, information services, public 
library networking, bibliographic management, 
distributed library systems, metadata, resource 
discovery, conferences, lectures, workshops"gt  - ltmeta name"DC.Description" content"UKOLN is a 
national centre for support in network 
information management in the library and 
information communities. It provides awareness, 
research and information services"gt  - ltmeta name"DC.Creator" contentUKOLN 
Information Services Group"gt  - lt/headgt 
 - ...
 
  17Design Elements Metadata
- How do we use the metadata? 
 - Embed with the content 
 - Possible only with HTML content 
 - Store as records in a database separate from the 
sources, each record describing a source  - Ideal will support all content, facilitate 
search, better content management and 
personalization  
  18Design Elements Subject Organization
- Essential for improving quality of search 
 - List of controlled subject terms/ categories  
used in subject/ keyword fields  - Simple list 
 - Hierarchical  useful for browsing (as in Yahoo!) 
 - Notation-based (e.g. ACM) 
 - Thesaurus-based
 
  19Design Elements Resource Type
- Apart from subject, users often access resources 
by resource type  - Databases, directories, e-journals, patents, 
courseware, etc.  - Useful to support this  via metadata 
 - Can be combined with subject, to support 
cross-listing of resources 
  20Design Elements Database Approach
- Information about each individual document, 
database (multi-record resources) is kept as a 
record in the meta database  - Advantages 
 - Web-based content management 
 - Improved searching and resource discovery 
 - Dynamic and consistent output presentation 
 - Easy maintenance  redesign becomes easy 
 - Usage patterns can be captured easily 
 - Customisation is easier
 
  21Design Elements Database Approach
- This is not possible with static HTML pages 
 - How does it work? 
 - Middleware applications for content management, 
search  retrieval, and output generation  - Downside 
 - Development and maintenance of the database and 
the applications  - Integration into the workflow 
 - User training
 
  22Design Elements Database Approach 
 23Design Elements Database Approach 
 24Database driven content management tool - Example 
 25Database driven content management tool - Example 
 26Database driven content management tool - Example 
 27Design Elements Customisation  Personalization
- Most users use only a small number of resources 
 - Allow them to set up a profile of preferred 
sources  - These become the default menu picks when they 
visit next time  - Easy to handle in a meta database-driven approach 
 - Support for customizing look-and-feel of the site
 
  28Design Elements Customisation  Personalization
- How is this achieved? 
 - User id/ login 
 - Cookies 
 - Client IP 
 - User profiles can also be used for push based 
delivery of content  - Example E-TOC and HPIS services of NCSI 
 - Users create/edit the profiles using web-based 
forms 
  29An example Push service 
 30An example Push service (profile set up) 
 31An example Push service (modification) 
 32Relevance of XML
- XML  Extensible Markup Language 
 - Designed for the web  for data exchange and as a 
neutral data format  - Simplified subset of Standard Generalized Markup 
Language (SGML)  - Formally ratified as a W3C standard in February 
1998  - Specification provides a set of grammar and 
syntax rules for describing the structure of data 
  33Relevance of XML
- Separates content from presentation 
 - Supports marking up of structural components of a 
document  - We can define our own tags 
 - ASCII of the Web  OS independent  create once 
and use in different ways  - Quality searching 
 - Can search for field-based content, not just any 
content  - computer (name/model) price lt 700 
 - Enables data interchange and sharing between 
applications  
  34Relevance of XML
- What does it mean to use XML for intranet content 
management?  - Use XML as the native format for content 
 - XML content creation/ publishing tools 
 - Use HTML as the rendering language 
 - Browsers understand HTML 
 - Use style sheets for rendering XML in HTML 
 - Use XML for data exchange
 
  35XML Document Structure
- All XML documents start with the XML declaration 
lt?xml version"1.0"?gt  - DTD (Document Type Definition) (part of document 
type declaration in prolog), provides the 
definition for the XML documents, and defines 
the document hierarchy, elements, tags and 
syntactic rules for the document structure  - DTD can accompany XML documents or reside outside 
the documents  - Lets take an example and see the two 
possibilities, using IE browser (version 5.0 or 
above) (adbook.xml)  
  36XML Document With External DTD
XML Declaration
 lt?xml version"1.0"?gtlt!DOCTYPE addressbook 
SYSTEM "adbook.dtd"gtltaddressbookgt ltperson 
id"B.WALLACE" gender"male"gt ltnamegt 
 ltfamilygtWallacelt/familygt 
ltgivengtBoblt/givengt lt/namegt 
ltemailgtbwallace_at_megacorp.comlt/emailgt 
ltlink manager"C.TUTTLE"/gt lt/persongt 
ltperson id"C.TUTTLE" gender"female"gt 
ltnamegtltfamilygtTuttlelt/familygtltgivengtClairelt/givengt
lt/namegt ltemailgtctuttle_at_megacorp.comlt/email
gt ltlink subordinates"B.WALLACE"/gt 
lt/persongtlt/addressbookgt
Body 
 37DTD Stored in adbook.dtd File
lt!-- DTD for a simple address book --gtlt!ELEMENT 
addressbook (person)gtlt!ELEMENT person 
(name,email,link?)gtlt!ATTLIST person id ID 
REQUIREDgtlt!ATTLIST person gender (malefemale) 
IMPLIEDgtlt!ELEMENT name (family,given)gtlt!ELEMENT
 family (PCDATA)gtlt!ELEMENT given 
(PCDATA)gtlt!ELEMENT email (PCDATA)gtlt!ELEMENT 
link EMPTYgtlt!ATTLIST link manager IDREF IMPLIED 
subordinates IDREFS IMPLIEDgt 
 38XML Document With Accompanying DTD
lt?xml version"1.0"?gtlt!DOCTYPE addressbook 
lt!ELEMENT addressbook (person)gtlt!ELEMENT 
person (name,email,link?)gtlt!ATTLIST person id 
ID REQUIREDgtlt!ATTLIST person gender 
(malefemale) IMPLIEDgtlt!ELEMENT name 
(family,given)gtlt!ELEMENT family 
(PCDATA)gtlt!ELEMENT given (PCDATA)gtlt!ELEMENT 
email (PCDATA)gtlt!ELEMENT link EMPTYgtlt!ATTLIST 
link manager IDREF IMPLIED subordinates IDREFS 
IMPLIEDgtgt
XML Declaration
DTD 
 39XML Document With Accompanying DTD
ltaddressbookgt ltperson id"B.WALLACE" 
gender"male"gt ltnamegt 
ltfamilygtWallacelt/familygt 
ltgivengtBoblt/givengt lt/namegt 
ltemailgtbwallace_at_megacorp.comlt/emailgt 
ltlink manager"C.TUTTLE"/gt lt/persongt 
ltperson id"C.TUTTLE" gender"female"gt 
ltnamegtltfamilygtTuttlelt/familygtltgivengtClairelt/givengt
lt/namegt ltemailgtctuttle_at_megacorp.comlt/email
gt ltlink subordinates"B.WALLACE"/gt 
lt/persongtlt/addressbookgt
Body 
 40Do We Need a DTD?
- DTD is not mandatory for XML documents 
 - Lets try this out (use adbook.xml) 
 - Why do we need DTD then? 
 - DTD provides the Definition for a specific 
class of documents (e.g. reports, theses) - 
document hierarchy, elements, tags and syntactic 
rules for the document structure  - Essential for information exchange between 
applications and services  - Forms the basis for validating the correctness of 
XML documents  - Lets see another example  IOP DTD and XML files.
 
  41Do We Need a DTD? 
 42Inside an XML Document (File)
- An XML document file contains one and only one 
document root element  - Root element contains one or more documents 
 - Document content is marked up using user defined 
elements (tags) and element attributes  - Document content may also contain entity 
references (replacement text, external 
references, etc.).  - Document content may also contain comments and 
processing instructions 
  43Inside XML DTD
- DTD defines the document hierarchy in terms of 
the elements, elements themselves and their 
syntactic structure, entities and rules for their 
usage  - This is used for validating XML documents  
Examplelt!--DTD for Books--gtlt!ENTITY cright 
"169"gtlt!ELEMENT books (book)gtlt!ELEMENT book 
(title, isbn, authors, description?,price)gtlt!ELEM
ENT title (PCDATA)gtlt!ATTLIST title lang 
(englishfrench) REQUIREDgtlt!ELEMENT isbn 
(PCDATA)gtlt!ELEMENT authors (PCDATA)gtlt!ELEMENT 
description (PCDATA)gtlt!ELEMENT price 
(PCDATA)gtlt!ATTLIST price curr (RsDollar) 
IMPLIEDgt  
  44Well Formed and Valid XML Document
- Well formed XML documents must be syntactically 
correct. What does this mean?  - There is only one Root element and it contains 
the documents content  - Match start-tags with end-tags (except for empty 
element tag)  - Nested elements never overlap 
 - Attributes are unique and values are in quotes 
 - Only entity references permitted are amp for 
, lt for lt, gt for gt, apos for , and 
quot for . 
  45Well Formed and Valid XML Document
- Valid XML documents 
 - Valid XML documents are well-formed XML documents 
which include an XML declaration and document 
type declaration  - Valid documents must also adhere to the DTD 
 
Lets look at an example using sample Medline 
XML document 
 46Rendering of XML
- How can we view XML documents on the Web? 
 - XML is about data, not presentation  
presentation has to be handled separately  - This can be handled using CSS (Cascading Style 
Sheets) and XSLT (Extensible Stylesheet Language 
Transformations)  - CSS and XSLT can be used for rendering XML 
documents on the Web  - IE 5.0 supports viewing XML document trees and 
HTML rendering using CSS  - Lets see a CSS example (syllabus2)
 
  47Resources
- Darlene Fichter. Intranet databases for small and 
large databases. Online, May/June 2000, 88-90.  - Online magazine. Jan/Feb 1999 and May/June 
2001. Focus issues on intranets. 
www.onlineinc.com/onlinemag/  - Dublin Core 
 - dublincore.org 
 - purl.oclc.org/metadata/ 
 - Digital library toolkit. Dr. Peter Noerr. Sun 
Micro Systems. 2000. www.sun.com/edu  - XML 
 - www.oasis-open.org/cover/ (Robin Covers SGML/XML 
page)  - XML4Lib A discussion forum related to XML use on 
libraries (sunsite.berkeley.edu/XML4Lib/)