Title: Technology
1Technology for Hangul Viewing / Writing
2 Courses
1. Unicode
2. Hangul Code
3. Using Hangul in English OS
4. Web Browsing
5. Web Authoring
6. E-mailing
3- Korean Character Sets and Encoding Forms
- http//www.geocities.com/fontboard/cjk/csmore
.html - http//www.geocities.com/fontboard/cjk/cslang
.html -
41. Unicode http//www.unicode.org
1.1 What is Unicode?
- Unicode provides a unique number for every
character, - no matter what the platform,
- no matter what the program,
- no matter what the language.
Unique Number
A or ?
5 1.2 Why Unicode?
People in different countries use different
characters to represent the words of their
native languages.
1)ISO represents 8-bit character set. This
isUnicode .
- It is impossible to store text with characters
from different character sets in the same
document.
- As long as every document has its own
characters, and recognition of character set is
not automatic, manual user intervention is
inevitable.
1) ISO International Standard Organization
6 1.3 Unicode is changing all that!
- The Unicode Standard has been adopted by such
industry leaders as - Apple, HP, IBM, JustSystem, Microsoft,
Oracle, SAP, Sun, Sybase, - Unisys and many others.
- Unicode is required by modern standards such as
XML, Java, LDAP, CORBA, WML, etc.
- Incorporating Unicode into client-server or
multi-tiered applications and websites offers
significant cost savings over the use of legacy
character sets.
72. Hangul code http//pantheon.yale.edu/js
hin/faq/qa8.html
2.1 History of Hangul Code
In 1997, Korean standard body made a rather
drastic change in the naming scheme of standards
for information exchange and processing. What
used to be referred to as KS C 56xx - KS C 59xx
were renamed as KS X xxxx. The following
summarizes the change.
- KS C 5601 -gt KS X 1001 The most widely used
coded character set - KS C 5657 -gt KS X 1002 Additional
characters for information -
exchange - KS C 5636 -gt KS X 1003 Korean version of
ISO 646/US-ASCII - KS C 5620 -gt KS X 1004 ISO/IEC 2022
- KS C 5700 -gt KS X 1005-1 Unicode 2.0/ISO-10646
- KS C 5697 -gt KS X 1023 ISO 2375
- KS C 5861 -gt KS X 2901 Korean Unix
environment
82.2. What is Hangul Code?
- Korean Alphabetic
- ??????????????
- ??????????
- ?????
- 2 Bytes per character
- Classified into 3 types
- WanSungHyung(built-in)
- assigned code per character
- ChoHabHyung (combined)
- assigned code as a Initial, medium, and final
- sound separately
- TongHabHyung (Unified)
- As a Microsoft own Hangul encoding,
- integrated
Wansunghyung with Chohabhyung
92.3 Microsoft UHC (Unified Hangul Code)
KSC-5601 is a basic Wansunghyung(built-in) code
thats comprised of 2350 characters. Due to
limitation to numbers of characters, Microsoft
introduced Unified Hangul Code. Microsoft Windows
95/NT use UHC ( in case of NT 4.0, all internal
processings are done in Unicode, but on the
surface, it used UHC).
UHC KSC-5601(limited 2350 characters)
Additional codes
102.4 What is Hangul O/S ?
Hangul O/S Unified Hangul Code
Windows95/98/NT
1)UHC
Windows95/NT
1) UHC Unified Hangul Code
11- 3. Using Hangul in English OS
- http//www.geocities.com/fontboard/cjk/apps
- Korean Input Methods
- http//www.geocities.com/fontboard/cjk/input.
htmlkorean - Unicode-Compliant Applications
- http//www.geocities.com/fontboard/cjk/uniapps
.html - Windows Fonts that Support Unicode
- http//www.hclrss.demon.co.uk/unicode/fonts.h
tml - Luc Devroyes Page on Korean Fonts
- http//cgm.cs.mcgill.ca/luc/korea.html
-
12- Global IME
- - Supports MS IE, MS Outlook Express, Word
2000, NSC 4.72 etc - Windows 2000 support built-in Korean
Language - - Product Info
- http//www.microsoft.com/Windows/ie/Features/
ime.asp - - Download
- http//www.microsoft.com/msdownload/iebuild/i
me5_win32/en/ime5_win32.htm - - Global IME support in Communicator 4.72
- http//home.netscape.com/eng/intl/gimesuppor
t.html
13(No Transcript)
14- UnionWay AsianSuite 2000 (CJK Unionway)
-
- - Supports Windows 2000, NT, 95/98 and lot of
applications - such as Office 2000, 97
- - Product Info and Download
- http//www.unionway.com/servlet/Folder?Node
111
15(No Transcript)
16- Korean Partner 2000 by Twinbridge
- - Product Info
- http//www.twinbridge.com/products/kp/kp2000.
html - - Download
- http//www.twinbridge.com/free_download/downl
oad_korean.htm - - Download Unicode Font
- http//www.twinbridge.com/products/AsianTTF/
kttfont.html
17(No Transcript)
18- NJStar Communicator 2.23
- Product Info
- http//www.njstar.com/communicator/
- Download
- http//www.njstar.com/communicator/download.htm
19- Download Arial Unicode MS
- - http//officeupdate.microsoft.com/2000/downloadD
etails/aruniupd.htm - A full Unicode font containing all of the
approximately 40,000 alphabetical characters,
ideographic characters, and symbols defined in
the Unicode 2.1 standard. - Note This font is included with all versions of
Microsoft Office 2000 Service Release 1 (SR-1).
20- Download Visual Keyboard
- http//officeupdate.microsoft.com/2000/downloadde
tails/viskeyboard.htm - Supports typing in more than one language on the
same computer by showing - you a keyboard for another language on your
screen.
214. Web browsing
4.1 Viewing Hangul WWW pages http//www.geocities
.com/fontboard/cjk/browsers.html http//www.hclrss
.demon.co.uk/unicode/browsers.html
- Set Hangul Programs (MS Global IME, Unionway,
etc) - or Get Korean Fonts
- Use IE 3.0 or later or NSC 4.0 or later
225. Web Authoring
5.1 Namo WebEditor 4
- Multilingual HTML editor
- Supports Unicode
- Supports Microsofts Global IME and Visual
Keyboards - Allows files to be saved in UTF-8 encoding, in
addition to some CJK and other encoding - A 45-day trial version
- http//www.namo.com/products/we4/index.html
235. 2 Netscape Composer
- Multilingual HTML editor
- Supports Unicode
- Supports Microsofts Global IME
- Allows files to be saved in UTF-8 encoding
- Available only as part of Netscape 6, which
includes Netscape Navigator
245. 3 Word 2000
- Can save files as HTML
- HTML Filter 2.0 to remove the Office-specific
markup tags - http//officeupdate.microsoft.com/2000/downloadDe
tails/Msohtmf2.htm
255. 4 Metatag
- Browsers that understand the META tag will
automatically display the page in the right
language. - Multilingual
- ltmeta http-equivcontent-type
contenttext/htmlcharsetutf-8gt - ltmeta http-equivcontent-type
contenttext/htmlcharsetutf-7gt - Korean
- ltmeta http-equivcontent-type
contenttext/htmlcharsetks_c_5601gt -
26 6. E-mailing
http//pantheon.cis.yale.edu/jshin/faq/qa9.html
6.1 E-mail setup
- Setup Hangul program ( MS Global IME, Unionway,
etc) - Use E-mail client program ( Nescape mail, MS
outlook Express, etc) - Selecting MIME encoding (base 64, or quoted
pritable)
276.2 Considerations in E-mailing
- Dont use Korean in the subject
- If you cant input Korean in E-mailing, use
Hangul editor and send a file attached. - Use cvt8.exe to converse codes if you cant see
mail in Korean - http//ain.icube.co.kr/jhlee/cvt8.html
286.3 Reason for Hangul Mail Problem
- It may not be broken data but be a CODE problem.
- E-mail does not guarantee 8 bit data
transfer. But Hangul sent in 8 bit code, such as
KS-WanSungHyung can be broken. - So the way of send as conversing Hangul 8
bit to 7 bit is used. - (code conversion in Mail server, code
conversion in Mail User Agent)
29- IE 3.0 and 4.0
- - View / Options / Font / Character Sets / Korean
/ - Proportional Fonts and Fixed Width Font
- - View / Fonts / Korean
- IE 5.0
- - Tools / Internet Options / Font / Language
Script Korean - Web page font and Plain text font
- - View / Encoding / Korean
30- Netscape Communicator 4.0, 4.5 and 4.6
- - Edit / Preferences / Appearance / Fonts /
Encoding Korean - Variable Width Font and Fixed Width Font
- - View / Character Set ( or Encoding) Korean
(Auto-Detect) - Netscape Communicator 4.5
- - Edit / Preferences / Mail Newsgroups /
Formatting - Check Use the HTML editor to compose messages
- When sending HTML mail messages to recipients
- Concert the message into Plain text
- - Edit / Preferences / Mail Newsgroups /
Messages - Send messages that use 8-bit characters As
is - Message Composition Window / View / Character set
(or Encoding) / - Korean (Auto-Detect)
31- MS Outlook Express
- - Tools / Options / Send TAB
- Mail Sending Format / Plain Text Settings
TAB - Message format MIME
- Encode text using None
- - News Sending Format / Plain Text Settings
TAB - Message format MIME
- Encode text using None
- Check Allow 8-bit characters in headers
- - Turn off the check box Reply to messages
using the format in which they are sent - - Message Composition Widow / Format / Encoding
(or Language ) / Korean (EUC) (or Korean)
32Useful Sites
- Hangul and Internet in Korea FAQ
- http//pantheon.cis.yale.edu/jshin/faq/
- Sources and Information for Multi-lingual CJK
Fonts - http//staff.washington.edu/rrbritt/ceal/clt/
multilingual.htm - Unicode and Multilingual Support in Web Browsers
and HTML - http//www.hclrss.demon.co.uk/unicode/
- Chinese, Japanese and Korean characters in
English Windows - http//www.geocities.com/fontboard/cjk/