CALISHKCAN beyond Chinese cataloging - PowerPoint PPT Presentation

1 / 26
About This Presentation
Title:

CALISHKCAN beyond Chinese cataloging

Description:

Babel Fish, etc. Translate entire or partial website by user's choice ... multi-scripts and fonts; Babel Fish for translating website; Website has ... – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 27
Provided by: w2kM6
Category:

less

Transcript and Presenter's Notes

Title: CALISHKCAN beyond Chinese cataloging


1
???????
  • CALIS/HKCAN beyond Chinese cataloging
  • by Charlene Chou
  • May 23, 2005 at Peking University

2
???????
  • ?Internet????????????????,?????????????????????
  • Unicode????????????????(OPAC)??????????
  • ????????????(crosswalk)????
  • CALIS??????????????

3
???????
  • ?????????????(??HKCAN?CALIS)
  • ????????????????????
  • ????????(OPAC)???????????????????
  • ????????????(crosswalk)
  • ??????????????????????

4
????????????? HKCANXML Version
  • ?????????????????(Cross Searching)
  • ???????,NLC,CNAD,LC/NAF?HKCAN?????????????????
  • VIAF/LEAF?????
  • ???????????LC/NAF?NLC????????
  • ??OCLC,HKCAN?????????
  • CALIS will be the next very soon, right?

5
HKCAN?LC/NAF???
  • ??non-unique names???????
  • Verifying Chinese names in vernacular scripts at
    LC and Columbia University
  • LC breaking undifferentiated record based on
    vernacular script in HKCAN
  • Romanized/English form in HKCAN with the
    qualifier of birth date
  • CUL adding 4XX field for variant form
  • Is CALIS next, esp. with OCLC access?

6
???????????????(??)
  • OCLC/Connexion and RLIN
  • Parallel fields are available in bibliographic
    records for years
  • The key is if ILS can display vernacular scripts
  • LC/NAF
  • To include vernacular scripts in authority
    records in the near future?
  • ILS problem for implementing Unicode and others
  • Cataloging policies and economic perceptions to
    be dealt with

7
???????????????(??)
  • MARBI
  • MARC21 has the standards for bibliographic and
    authority records
  • Currently has no further plans after 2001-DP05
  • Can be communicated through the full Unicode
    UTF-8 character sets or limited to MARC-8
    characters?

8
????????????
  • Economic concerns in USA
  • LC/PCC access records for Web resources
  • Subject vs. descriptive access
  • Costly for authority control
  • Multi-level policy controlled vs. uncontrolled
    headings
  • FRANARinternational efforts standards
  • UBCIMMinimal Level
  • Shift from ISADN to VIAF INTERPARTY
  • VIAF linking is key to retain flexibility and
    local autonomy

9
?????????????
  • ISSN international harmonization for continuing
    resources
  • FRBR and AACR3 (now called RDA after JSC meeting
    in May)
  • FRBR for serials MFHD/ISSN/FRBR-super record or
    authority record for serials
  • FRBRized display in OPAC collocation
  • Contents vs. carrier
  • Grouping multi-manifestation records (only 20
    records in OCLC WorldCat)

10
???????????(??)
  • Digital Library Federation
  • Clearinghouse for all DL projects in US
  • Sharing and communicating among projects
  • Digitization Registry of Digital Master
  • CONSER will seek communication for cataloging
    policy

11
Unicode?????????(OPAC)????
  • ILS vendors active interest in recent years
  • MARC21 spec now support Universal Coded Character
    Set
  • Mapping to Unicode from MARC character sets
    completed
  • Unicode fonts more characters available now
  • Newer operating system support Unicode now

12
Unicode ?ILS????????
  • Which scripts have been implemented
  • ILS implement language by language/Which scripts
    are supported, e.g. Arabic, but may not support
    the additional extended Arabic characters in such
    languages as Persian
  • How sorting of particular language is handled
  • Depending on the language
  • Which encoding formats are supported to
    import/export records in Unicode

13
Unicode
  • Not complete solution for language support in
    library system
  • A foundation to build and will make development
    for multiple language much easier
  • A solution for now, maybe new technology will
    have a better solution in future?

14
?????OPAC???????
  • CSULA transaction logs analysis
  • Next generation of the OPAC interface
  • Concept-based search allow natural-language
    searching with keyword search first, and focus on
    quick-search need
  • Relevance feedback bring back related pages
  • Spelling correction
  • Relevance-ranked output

15
???????????
  • Multilingual dictionary
  • Google definitions are multilingual, e.g.
    definitions of voip on the Web in all
    languages--Chinese (Simplified), English, French,
    German, Italian, Russian, and Spanish
  • Translation tool for website
  • Babel Fish, etc.
  • Translate entire or partial website by users
    choice
  • Users choice of language on website

16
???????????
  • Google--Multilingualwebmaster.com
  • How to implement the translation process of the
    search engine?
  • From Translation vendors in multiple languages to
    direct communication with the translators--Individ
    ual translators specialized in a given language
  • Self-service translation application
  • Localize the content of its multilingual site
  • Currently support native interface in14
    languages (English) search in 25 languages (
    English)

17
???????????
  • Open Language Archives Community
    (OLAC)--Specialized OAI sub-domain
  • Features of the search engine include
  • A thesaurus of alternative language names
  • Language code searching
  • Keyword-in-context display in search results
  • Search for similarly spelled words
  • Search for similar items
  • Support for standard string search operators and
    domain-specific inline syntax
  • Automatically derived search links for other web
    search engines
  • Contribution the inclusion in the search engine
    results of a metadata quality-centric sorting
    algorithm

18
???????????
  • Currently Web browser displays multi-scripts and
    fonts Babel Fish for translating website
    Website has language choice Speech recognition
    software available
  • Translation machine is just one component of
    complex multilingual system (John White)
  • Multilingual metadata is a cross-cultural
    retrieval--far more than mere cross-language
    searching (Cliff Lynch)

19
?????(library portals)?????????(Crosswalk)
  • Why Google is courting libraries for scanning the
    text of collections?
  • Web OPAC, Hot-link to resources, OpenURL, etc.
    linked to Google, and Amazon, etc?
  • Searching precision vs. recall
  • Library and Google Scholar
  • Future OPAC Topic maps, controlled vocabulary,
    preferred languages, available translation and
    edition (RLG Google), and online seller

20
??FRBR????????( Semantic Web )
  • IFLAs FRBR a semantic expression of the
    relationships between items in the library
    catalog
  • Berners-Lees ultimate vision for XML the
    Semantic Weblayering a logic layer over the
    topinfo on WWW to have logical or semantic
    relationships
  • RDFtriplestwo nodes (one subject one object),
    connected with a predicate relationship-DAMLOIL
    Topic Maps triplestopics, associations and
    scopes
  • MARC XML open to web harvesters, such as OAI
  • FRBR, RDF, DAMLOIL, or Topic Maps all used to
    set up a FRBR ???(ontology)
  • RDF can serve as a semantic layer to make FRBR
    expressive and be layered over MARC XML

21
?????can be ???(semantic map) ?building block
and beyond
  • Tilletts perspectives on the future
  • VIAF an integral part of a future Semantic Web
  • Ontology infrastructure and controlled
    vocabularies can be used to enable displays in
    the users own language and script
  • Help users of the Web for collocation and search
    precision
  • Tools to connect biographical dictionaries,
    telephone directories, AI, etc.

22
??????
  • Language is an important perspective or a major
    barrier, but there are much more
  • Multilingual searching across diverse collections
  • Five layers of technology (Noriko Kando)
  • The pragmatic layer cultural and social aspects,
    convention the semantic layer concept mapping
    the lexical layer language identification,
    indexing the symbolic layer encoding issues
    the physical layer the network

23
??????
  • Multilingual, multicultural and multimedia
    digital libraries
  • HKCAN is bilingual but may be multicultural in
    certain contexts CALIS can be more diverse
  • MARBI 2001-DP05 rename as context-sensitive
    authority record
  • From multilingual to multi-context?
  • Global vs. localized library
  • VIAF/LEAFglocatization movement?
  • Google multilingual vs. localized site

24
(No Transcript)
25
CALIS??????
  • OCLC in China or much more
  • Sharing Chinese bibliographic and authority
    records from Asia to other continents
  • Wider range of libraries beyond academic
    libraries
  • Multilingual choice for international users
  • Wider range of users from librarians to students
  • Hotlinks to extended resources

26
CALIS??????
  • Leading research in library and information
    science, such as FRBR projects in OCLC
  • Digitizing more historical materials
  • Supporting use of rare book collections
  • Expanding from subject-oriented collections to
    Semantic Maps for Chinese collections and
    information
  • What is the vision of CALIS to participate in the
    global library?
Write a Comment
User Comments (0)
About PowerShow.com