Unikodo standartas - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Unikodo standartas

Description:

... 112 code points available for assigning the repertoire of abstract characters. ... The Basic Multilingual Plane (BMP, or Plane 0) (consists of the range 000016. ... – PowerPoint PPT presentation

Number of Views:60
Avg rating:3.0/5.0
Slides: 31
Provided by: Rimga
Category:

less

Transcript and Presenter's Notes

Title: Unikodo standartas


1
Unikodo standartas
  • Rimgaudas Laucius
  • 2006

2
Keletas faktu
  • Pradetas kurti 1988 m. Iejo kelios versijos.
    Paskutine dar oficialiai nepaskelbta, taciau
    baigiama rengti versija 5.0.
  • Standartas atviras.
  • Tinklalapis www.unicode.org

3
Santykis su ISO 10646
  • enklu lenteles sutampa
  • Unikodas apibreia ne tik enklu lentele, taciau
    ir taisykles susijusias su ju realizacija
    kompiuteryje.

4
Pagrindiniai Unikodo principai
5
Universalumas, efektyvumas,vienareikmikumas
6
Apimama enklu aibe
  • The Unicode Standard is a superset of all
    characters in widespread use today. It contains
    the characters from major international and
    national standards as well as prominent industry
    character sets.

7
4 versijos enklu aibe
  • The Unicode Standard, Version 4.0, contains
    96,382 characters from the worlds scripts. These
    characters are more than sufficient not only for
    modern communication in most languages, but also
    for the classical forms of many languages. The
    standard includes the European alphabetic
    scripts, Middle Eastern right-to-left scripts,
    and scripts of Asia, as well as many others. The
    unified Han subset contains 70,207 ideographic
    characters defined by national and industry
    standards of China, Japan, Korea, Taiwan,
    Vietnam, and Singapore. In addition, the Unicode
    Standard includes punctuation marks, mathematical
    symbols, technical symbols, geometric shapes, and
    dingbats.

8
Kodo vienetu kiekis
  • In the Unicode Standard, the codespace consists
    of the integers from 0 to 10FFFF16, comprising
    1,114,112 code points available for assigning the
    repertoire of abstract characters.
  • Klaidinga manyti, kad Unikodas tai tik 16 bitu
    enklu kodavimo budas (taip galima ukoduoti 64K
    enklu)

9
Sritys
  • The Unicode codespace consists of the numeric
    values from 0 to 10FFFF16, but in practice it has
    proven convenient to think of the codespace as
    divided up into planes of characterseach plane
    consisting of 64K code points.
  • The Basic Multilingual Plane (BMP, or Plane 0)
    (consists of the range 000016..FFFF16) contains
    all the common-use characters for all the modern
    scripts of the world, as well as many historical
    and rare characters.
  • Sekancios dvi sritys turi nedaug labai retai
    naudojamu enklu. Kitos apskritai dar yra tucios.

10
Lietuvikos abecelesraidiukodai Unikode
11
enklu kodavimo budai
  • UTF-8, UTF-16, UTF-32 (visi budai ekvivalentus ir
    lygiaverciai)

12
Baitu idestymo kryptis
  • BOM
  • FEFF
  • FFFE
  • EF BB BF

13
Unikodo enklu ymejimas
  • Unikodo kodus priimta ymeti Uxxxx. Cia xxxx yra
    keturenklis eioliktainis skaicius kodo eiles
    numeris.
  • Unikodo enklai taip pat turi vardus. Pvz.
    (parayti didiosiomis raidemis)

14
Unikodas apireia enklus, o ne paveikslelius
15
Pagrindines Unikodo sritiessandara
16
Kombinaciniai enklai
17
Kombinacines sekos
18
enklu ekvivalentumas
19
Privati sritis
  • 6400 enklu pradedant nuo UE000
  • Tucia sritis, kuri gali buti naudojama saviems
    tikslams

20
Surogatu sritis
  • UD800 UDFFF
  • Naudojama taikant UTF-16 metoda, likusiu sriciu
    (be pagrindines) kodavimui kodo vienetu poromis

21
Kaip atpainti kodavimo metodus
22
WindowsDOS
23
Rusikas tekstas
  • Jei matosi vien neaikus enklai, tuomet verta
    pabandyti rusikas Windows ir DOS koduotes (1251,
    866

24
(No Transcript)
25
(No Transcript)
26
UTF-8
  • Matosi tik nesamones, taciau labai danai ir ypac
    del to kad odiu pradioje kartojasi tas pats
    enklas (iuo atveju a)

27
Unikodas
  • Kas antra raide kartojasi tas pats enklas. Prie
    anglikas raides tarpai. (failo pradioje raide)

28
(No Transcript)
29
Nemokamas, geras teksto doroklis mokantis daug
koduociu - UniRed
30
Kodav
Write a Comment
User Comments (0)
About PowerShow.com