1 .\" Hey Emacs! This file is -*- nroff -*- source.
3 .\" Copyright (C) Markus Kuhn, 1995, 2001
5 .\" This is free documentation; you can redistribute it and/or
6 .\" modify it under the terms of the GNU General Public License as
7 .\" published by the Free Software Foundation; either version 2 of
8 .\" the License, or (at your option) any later version.
10 .\" The GNU General Public License's references to "object code"
11 .\" and "executables" are to be interpreted as the output of any
12 .\" document formatting or typesetting system, including
13 .\" intermediate and printed output.
15 .\" This manual is distributed in the hope that it will be useful,
16 .\" but WITHOUT ANY WARRANTY; without even the implied warranty of
17 .\" MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
18 .\" GNU General Public License for more details.
20 .\" You should have received a copy of the GNU General Public
21 .\" License along with this manual; if not, write to the Free
22 .\" Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111,
25 .\" 1995-11-26 Markus Kuhn <mskuhn@cip.informatik.uni-erlangen.de>
26 .\" First version written
27 .\" 2001-05-11 Markus Kuhn <mgk25@cl.cam.ac.uk>
30 .\" Japanese Version Copyright (c) 1997 HANATAKA Shinya
31 .\" all rights reserved.
32 .\" Translated Thu Jun 3 20:36:31 JST 1997
33 .\" by HANATAKA Shinya <hanataka@abyss.rim.or.jp>
34 .\" Updated & Modified Sat Jun 23 07:30:09 JST 2001
35 .\" by Yuichi SATO <ysato@h4.dion.ne.jp>
38 .\"WORD: diacritical mark ȯ²»¶èÊÌÉä¹æ
39 .\"WORD: International Phonetic Alphabet ¹ñºÝ²»À¼»úÊì
42 .TH UNICODE 7 2001-05-11 "GNU" "Linux Programmer's Manual"
44 Unicode \- ÈÆÍÑʸ»ú½¸¹ç
49 .B "ÈÆÍÑʸ»ú½¸¹ç (Universal Character Set (UCS))"
51 UCS ¤Ï¾µ¬³Ê¤Îʸ»ú½¸¹ç¤Îʸ»ú¤òÁ´¤Æ´Þ¤ó¤Ç¤¤¤ë¡£
53 .B "ÁÐÊý¸þ¤Î¸ß´¹À (round-trip compatibility)"
55 Î㤨¤Ð¾¤ÎÉä¹æ¤«¤é UCS ¤ËÊÑ´¹¤·¤µ¤é¤Ë¸µ¤ÎÉä¹æ¤ËÊÑ´¹¤·¤¿¤È¤·¤Æ¤â¡¢
56 ²¿¤Î¾ðÊó¤â¼º¤Ê¤ï¤ì¤Ê¤¤¤è¤¦¤ËÊÑ´¹¥Æ¡¼¥Ö¥ë¤òºîÀ®¤¹¤ë¤³¤È¤¬¤Ç¤¤ë¡£
58 UCS ¤Ï¸½¼ÂŪ¤ËÃΤé¤ì¤Æ¤¤¤ëÁ´¤Æ¤Î¸À¸ì¤òɽ¸½¤¹¤ë¤Î¤ËɬÍפÊʸ»ú¤ò´Þ¤ó¤Ç¤¤¤ë¡£
59 ¤³¤ì¤Ë¤Ï¥é¥Æ¥óʸ»ú¡¢¥®¥ê¥·¥ãʸ»ú¡¢¥¥ê¥ëʸ»ú¡¢¥Ø¥Ö¥é¥¤Ê¸»ú¡¢¥¢¥é¥Ó¥¢Ê¸»ú¡¢
60 ¥¢¥ë¥á¥Ë¥¢Ê¸»ú¡¢¥°¥ë¥¸¥¢Ê¸»ú¤À¤±¤Ç¤Ê¤¯¡¢Ãæ¹ñ¡¦ÆüËÜ¡¦´Ú¹ñ¤Ç»È¤ï¤ì¤Æ¤¤¤ë´Á»ú¡¢
61 ¤µ¤é¤Ë¤Ï¡¢Ê¿²¾Ì¾¡¢ÊÒ²¾Ì¾¡¢¥Ï¥ó¥°¥ëʸ»ú¡¢
62 ¥Ç¡¼¥ô¥¡¥Ê¡¼¥¬¥ê¡¼Ê¸»ú¡¢¥Ù¥ó¥¬¥ëʸ»ú¡¢¥°¥ë¥à¥¡¼Ê¸»ú¡¢¥°¥¸¥ã¥é¡¼¥Èʸ»ú¡¢
63 ¥ª¥ê¥ä¡¼Ê¸»ú¡¢¥¿¥ß¡¼¥ëʸ»ú¡¢¥Æ¥ë¥°Ê¸»ú¡¢¥«¥Ê¥éʸ»ú¡¢¥Þ¥é¥ä¡¼¥é¥àʸ»ú¡¢
64 ¥¿¥¤Ê¸»ú¡¢¥é¥ª¥¹Ê¸»ú¡¢¥¯¥á¡¼¥ëʸ»ú¡¢¥Ü¥Ý¥â¥Õ¥©Ê¸»ú (Ãí²»»úÊì)¡¢
65 ¥Á¥Ù¥Ã¥Èʸ»ú¡¢¥ë¡¼¥óʸ»ú¡¢¥¨¥Á¥ª¥Ô¥¢Ê¸»ú¡¢¥«¥Ê¥À²»Àáʸ»ú¡¢
66 ¥Á¥§¥í¥¡¼Ê¸»ú¡¢¥â¥ó¥´¥ëʸ»ú¡¢
67 ¥ª¥¬¥àʸ»ú¡¢¥ß¥ã¥ó¥Þ¡¼Ê¸»ú¡¢¥·¥ó¥Ï¥éʸ»ú¡¢
68 ¥¿¡¼¥Êʸ»ú¡¢¥¤ (׳) ʸ»ú¤Ê¤É¤¬´Þ¤Þ¤ì¤ë¡£
69 ¤Þ¤À¥«¥Ð¡¼¤µ¤ì¤Æ¤¤¤Ê¤¤Ê¸»ú¤ËÉÕ¤¤¤Æ¤â¡¢
70 ¥³¥ó¥Ô¥å¡¼¥¿¤Ç»ÈÍѤ¹¤ë¤¿¤á¤Ë
71 ¤É¤Î¤è¤¦¤Ê¥¨¥ó¥³¡¼¥É¤¬¤â¤Ã¤È¤âÎɤ¤¤«¤È¤¤¤¦¸¦µæ¤¬¿Ê¤á¤é¤ì¤Æ¤ª¤ê¡¢
72 ºÇ½ªÅª¤Ë¤ÏÄɲ䵤ì¤ë¤À¤í¤¦¡£
73 ¥Ò¥¨¥í¥°¥ê¥Õ¤äÎò»ËŪ¤Ê¤¤¤í¤¤¤í¤Ê¥¤¥ó¥É¡á¥è¡¼¥í¥Ã¥Ñ¸À¸ì¤À¤±¤Ç¤Ê¤¯¡¢
74 ¥Æ¥ó¥°¥ï¡¼¥ëʸ»ú¡¢¥¥¢¥¹Ê¸»ú¡¢¥¯¥ê¥ó¥´¥óʸ»ú¤Ê¤É¤Î¿Í¹©Åª¤Ê¸À¸ì¤âÁª¤Ð¤ì¤Æ¤¤¤ë¡£
75 UCS ¤Ï¡¢¤³¤ì¤é¤Îʸ»ú¤Ë²Ã¤¨¤Æ¡¢TeX, PostScript, APL, MS-DOS, MS-Windows,
76 Macintosh, OCR ¥Õ¥©¥ó¥È¡¢¿ô¿¤¯¤Î¥ï¡¼¥É¥×¥í¥»¥Ã¥µ¡¼¤ä
77 ½ÐÈÇ¥·¥¹¥Æ¥à¡¢¤Ê¤É¤¬Ä󶡤¹¤ë
78 ¿Þ·Áµ¹æ¡¦°õ»úµ¹æ¡¦¿ô³Øµ¹æ¡¦²Ê³Øµ¹æ¤Ê¤É¤Î¿¤¯¤ò´Þ¤à¤è¤¦¤Ë¤Ê¤Ã¤¿¡£
80 UCS µ¬³Ê (ISO 10646) ¤Ï
81 .I "31¥Ó¥Ã¥È¤Îʸ»ú½¸¹ç¥¢¡¼¥¥Æ¥¯¥Á¥ã¡¼"
82 ¤òµ½Ò¤·¤Æ¤ª¤ê¡¢128 ¸Ä¤Î 24 ¥Ó¥Ã¥È
85 ³Æ·²¤Ï 256 ¸Ä¤Î 16 ¥Ó¥Ã¥È
87 ¤Ëʬ³ä¤µ¤ì¤Æ¤ª¤ê¡¢³Æʸ»ú¤Ï 256 ¸Ä¤Î 8 ¥Ó¥Ã¥È
94 ¤Ç¤Ï¡¢ºÇ½é¤Î 65534 ¸Ä¤Î¥³¡¼¥É°ÌÃÖ (0x0000 ¡Á 0xfffd) ¤òÄêµÁ¤·¤Æ¤¤¤ë¡£
95 ¤³¤ì¤ÏÂè 0 ·²¤ÎÂè 0 Ì̤Ǥ¢¤ë
96 .IR "´ðËÜ¿¸À¸ìÌÌ (Basic Multilingual Plane (BMP))"
100 ¤Ç¤Ï¡¢Âè 0 ·²¤Î BMP ¤Î³°Éô¤Ç¤¢¤ë
101 0x10000 ¡Á 0x10ffff ¤ÎÈϰϤˤ¢¤ë
104 ¤³¤Îµ¬³Ê¤Ç¤Ï 0x10ffff ¤ò±Û¤¨¤¿°ÌÃÖ¤Ëʸ»ú¤òÄɲ乤ëͽÄê¤Ï¤Ê¤¤¤Î¤Ç¡¢
105 ͽÁۤǤ¤ë¾Íè¤Ë¤ª¤¤¤Æ¤Ï¡¢
106 Á´¥³¡¼¥É¶õ´Ö¤Î¤¦¤Á¥°¥ë¡¼¥× 0 ¤Î°ìÉôʬ¤Ï¼ÂºÝ¤Ë¤Ï»È¤ï¤ì¤ë¤³¤È¤Ï¤Ê¤¤¡£
107 BMP ¤Ë¤Ï¾¤Îʸ»ú½¸¹ç¤Ç°ìÈ̤˻Ȥï¤ì¤ëÁ´¤Æ¤Îʸ»ú¤¬´Þ¤Þ¤ì¤Æ¤¤¤ë¡£
108 ISO 10646-2 ¤ÇÄɲ䵤줿Êä½õÌ̤ϡ¢
109 ÆÃÄê¤Î²Ê³ØʬÌ¼½ñ½ÐÈÇ¡¦°õºþ»º¶È¡¦¹â¼¡¥×¥í¥È¥³¥ë¡¦
110 ²¿¤«¤Î¥Õ¥¡¥ó¤Î´Ö¤Ê¤É¤Ç»È¤ï¤ì¤ëÆüì¤Êʸ»ú¤À¤±¤ò¥«¥Ð¡¼¤¹¤ë¡£
112 UCS ʸ»ú¤ò 2 ¥Ð¥¤¥È¤Î¥ï¡¼¥É¤Çɽ¸½¤¹¤ë¤Î¤¬
114 ·Á¼°¤Ç¤¢¤ë (BMP ʸ»ú¤Î¤ß)¡£
117 ¤Ç¤Ïʸ»ú¤ò 4 ¥Ð¥¤¥È¤Î¥ï¡¼¥É¤Çɽ¸½¤¹¤ë¡£
118 ¤µ¤é¤Ë¡¢ASCII ¤ò½èÍý¤¹¤ë¥½¥Õ¥È¥¦¥§¥¢¤Ø¤Î²¼°Ì¸ß´¹¤Î¤¿¤á¤Ë
120 ¥¨¥ó¥³¡¼¥É·Á¼°¤¬¤¢¤ë¡£
121 ¤Þ¤¿¡¢0x10ffff ¤Þ¤Ç¤ÎÈó BMP ʸ»ú¤ò°·¤¦
122 UCS-2 Âбþ¥½¥Õ¥È¥¦¥§¥¢¤È¤Î¸ß´¹¤Î¤¿¤á¤Ë
124 ¥¨¥ó¥³¡¼¥É·Á¼°¤¬¤¢¤ë¡£
126 UCS ʸ»ú½¸¹ç¤Î 0x0000 ¤«¤é 0x007f ¤Ï¡¢¸ÅŵŪ¤Ê
128 ʸ»ú½¸¹ç¤Îʸ»ú¤ÈƱ¤¸¤Ç¤¢¤ë¡£
129 ¤Þ¤¿ 0x0000 ¤«¤é 0x00ff ¤ÎÈϰϤǤϡ¢
130 .B ISO 8859-1 Latin-1
131 ʸ»ú½¸¹ç¤Îʸ»ú¤ÈƱ¤¸¤Ç¤¢¤ë¡£
132 .SS "¹çÀ®Ê¸»ú (Combining Characters)"
134 ¤Î¤¤¤¯¤Ä¤«¤Î¥³¡¼¥É¡¦¥Ý¥¤¥ó¥È¤Ï
135 .I "¹çÀ®Ê¸»ú (combining characters)"
136 ¤Ë³ä¤êÅö¤Æ¤é¤ì¤Æ¤¤¤ë¡£
137 ¤³¤ì¤é¤Ï¥¿¥¤¥×¥é¥¤¥¿¡¼¤Î°ÜÆ°¤·¤Ê¤¤¥¢¥¯¥»¥ó¥È¡¦¥¡¼¤Ë»÷¤Æ¤¤¤ë¡£
138 ¹çÀ®Ê¸»ú¤ÏľÁ°¤Îʸ»ú¤Ë¥¢¥¯¥»¥ó¥È¤Î¤ß¤ò²Ã¤¨¤ë¡£
139 ºÇ¤â½ÅÍפʥ¢¥¯¥»¥ó¥ÈÉÕ¤¤Îʸ»ú¤Ï¤½¤ì¼«¿È¤Î¥³¡¼¥É¤ò UCS ¤Ë»ý¤Ã¤Æ¤¤¤ë¡£
140 °ìÊý¤Ç¹çÀ®Ê¸»úµ¡¹½¤ÏÁ´¤Æ¤Îʸ»ú¤Ë¥¢¥¯¥»¥ó¥È¤äȯ²»¶èÊÌÉä¹æ¤ò²Ã¤¨¤ë¤³¤È¤¬¤Ç¤¤ë¡£
141 ¹çÀ®Ê¸»ú¤Ï¾ï¤Ë¤½¤ì¤¬½¤Àµ¤¹¤ëʸ»ú¤Ë³¤¯¡£
142 Î㤨¤Ð¥É¥¤¥Ä¸ì¤Îʸ»ú A ¥¦¥à¥é¥¦¥È ("Latin capital letter A with diaeresis") ¤Ï
143 UCS ¤ËÁ°¤â¤Ã¤Æ½àÈ÷¤µ¤ì¤¿¥³¡¼¥É 0x00c4 ¤Ç¤â¡¢
144 Ä̾ï¤Î A "Latin capital letter A" ¤Ë
145 "combining diaeresis (¹çÀ®Ê¬²»µ¹æ)" ¤ò³¤±¤¿Áȹ礻
146 (0x0041 0x0308) ¤Î¤É¤Á¤é¤Ç¤âɽ¸½¤¹¤ë¤³¤È¤¬¤Ç¤¤ë¡£
148 ¹çÀ®Ê¸»ú¤Ï¡¢¥¿¥¤Ê¸»ú¤ä¿ô³Ø¿¢»ú¤Î¥¨¥ó¥³¡¼¥É¡¦
149 ¹ñºÝ²»À¼»úÊì¤ò»È¤¦¥æ¡¼¥¶¡¼¤Ê¤É¤Ë¤Ïɬ¿Ü¤Ç¤¢¤ë¡£
151 Á´¤Æ¤Î¥·¥¹¥Æ¥à¤Ë¹çÀ®Ê¸»ú¤Î¤è¤¦¤Ê¿Ê¤ó¤À¥µ¥Ý¡¼¥È¤ò´üÂÔ¤·¤Æ¤¤¤ë¤ï¤±¤Ç¤Ï¤Ê¤¤¡£
152 ISO 10646-1 ¤Ï°Ê²¼¤Î»°Ãʳ¬¤Î UCS ¤Î¼ÂÁõ¥ì¥Ù¥ë¤ò»ØÄꤷ¤Æ¤¤¤ë¡£
156 .B ¥Ï¥ó¥°¥ë¡¦¥¸¥ã¥âʸ»ú
157 (¤¤¤í¤¤¤í¤Ê´Ú¹ñ¡¦Ä«Á¯Ê¸»ú¤ÎÉä¹æ²½¡£
158 ¤³¤ÎÉä¹æ²½¤Ç¤Ï¡¢¥Ï¥ó¥°¥ë²»Àá¤Î¥°¥ê¥Õ¤¬
159 3 ¤Ä¤Þ¤¿¤Ï 2 ¤Ä¤ÎÊì²»¡¦»Ò²»¥³¡¼¥É¤ÎÁȤ߹ç¤ï¤»¤ÇÉä¹æ²½¤µ¤ì¤ë) ¤Ï¥µ¥Ý¡¼¥È¤·¤Ê¤¤¡£
162 Level 1 ¤ÈƱÍͤÀ¤¬¡¢¹çÀ®Ê¸»ú¤òɬ¿Ü¤È¤¹¤ë¸À¸ì¤Î¤¿¤á¤Îʸ»ú
163 (Î㤨¤Ð¡¢¥¿¥¤Ê¸»ú¡¦¥é¥ª¥¹Ê¸»ú¡¦¥Ø¥Ö¥é¥¤Ê¸»ú¡¦¥¢¥é¥Ó¥¢Ê¸»ú¡¦
164 ¥Ç¡¼¥ô¥¡¥Ê¡¼¥¬¥ê¡¼Ê¸»ú¡¦¥Þ¥ì¥ä¡¼¥é¥àʸ»ú¤Ê¤É) ¤Ï»È¤¨¤ë¡£
171 .B ¥æ¥Ë¥³¡¼¥É¡¦¥³¥ó¥½¡¼¥·¥¢¥à (Unicode Consortium)
173 .B Unicode 3.0 Standard
174 ¤Ï¡¢ISO 10646-1:2000 ¤Ëµ½Ò¤µ¤ì¤¿
175 .B UCS Basic Multilingual Plane
176 ¤Î level 3 ¼ÂÁõ¤ÈÁ´¤¯Æ±¤¸¤Ç¤¢¤ë¡£
178 ¤Ç¤Ï ISO 10646-2 ¤ÎÊä½õÌ̤¬Äɲ䵤ì¤Æ¤¤¤ë¡£
179 Unicode Consortium ¤«¤éȯ¹Ô¤µ¤ì¤ë Unicode µ¬³Ê¤Èµ»½Ñ¥ì¥Ý¡¼¥È¤Ë¤è¤ê¡¢
180 ¤¤¤í¤¤¤í¤Êʸ»ú¤Î°ÕÌ£¤È¿ä¾©¤µ¤ì¤ë»ÈÍÑË¡¤Ë¤Ä¤¤¤Æ¤Î¹¹¤Ê¤ë¾ðÊó¤¬ÆÀ¤é¤ì¤ë¡£
181 ¤³¤ì¤é¤Îµ¬³Ê½ñ¤äµ»½Ñ¥ì¥Ý¡¼¥È¤Ç¡¢Unicode ʸ»úÎó¤ò
182 ÊÔ½¸¡¦Ê¤ÙÂؤ¨¡¦Èæ³Ó¡¦Àµµ¬²½¡¦ÊÑ´¹¡¦É½¼¨¤¹¤ë¤¿¤á¤Î
183 ¥¬¥¤¥É¥é¥¤¥ó¤È¥¢¥ë¥´¥ê¥º¥à¤¬Ê¬¤«¤ë¡£
184 .SS "Linux ¤Ë¤ª¤±¤ë Unicode"
185 GNU/Linux ¤Ç¤Ï¡¢C ¸À¸ì¤Î·¿
187 ¤ÏÉä¹æÉÕ¤ 32 ¥Ó¥Ã¥ÈÀ°¿ô·¿¤Ç¤¢¤ë¡£
188 ¤½¤ÎÃÍ¤Ï C ¥é¥¤¥Ö¥é¥ê¤Ë¤è¤ê (¤¹¤Ù¤Æ¤Î¥í¥±¡¼¥ë¤Ë¤ª¤¤¤Æ) ¾ï¤Ë
190 ¥³¡¼¥É¤ÎÃͤȤ·¤Æ²ò¼á¤µ¤ì¤ë¡£
191 ¤³¤ì¤ò GNU C ¥é¥¤¥Ö¥é¥ê¤¬¥¢¥×¥ê¥±¡¼¥·¥ç¥ó¤ËÃΤ餻¤ë¤¿¤á¤Îµ¬Ìó¤È¤·¤Æ¡¢
193 .B __STDC_ISO_10646__
195 ¤³¤ì¤Ï ISO C99 µ¬³Ê¤Ç»ØÄꤵ¤ì¤Æ¤¤¤ë¡£
199 ¥Þ¥ë¥Á¥Ð¥¤¥È¥¨¥ó¥³¡¼¥É¤Ç¤Ï¡¢Æþ½ÐÎÏ¥¹¥È¥ê¡¼¥à¡¦Ã¼ËöÄÌ¿®¡¦
200 ¥×¥ì¡¼¥ó¥Æ¥¥¹¥È¥Õ¥¡¥¤¥ë¡¦¥Õ¥¡¥¤¥ë̾¡¦´Ä¶ÊÑ¿ô¤Ë¤ª¤¤¤Æ¡¢
201 UCS/Unicode ¤ò ASCII ¤Î¤è¤¦¤Ë»È¤¦¤³¤È¤¬¤Ç¤¤ë¡£
202 UTF-8 ¤òʸ»ú¥¨¥ó¥³¡¼¥É¤È¤·¤Æ»È¤¦¤³¤È¤ò
203 Á´¤Æ¤Î¥¢¥×¥ê¥±¡¼¥·¥ç¥ó¤ËÃΤ餻¤ë¤¿¤á¤Ë¤Ï¡¢
204 ("LANG=en_GB.UTF-8" ¤Î¤è¤¦¤Ë) ´Ä¶ÊÑ¿ô¤ò»È¤Ã¤ÆŬÀÚ¤Ê
206 ¤òÁªÂò¤·¤Ê¤±¤ì¤Ð¤Ê¤é¤Ê¤¤¡£
208 .B nl_langinfo(CODESET)
209 ´Ø¿ô¤ÏÁªÂò¤µ¤ì¤¿¥¨¥ó¥³¡¼¥É¤Î̾Á°¤òÊÖ¤¹¡£
212 ʸ»ú¤äʸ»úÎó¤ò¥·¥¹¥Æ¥àʸ»úÎ󥨥󥳡¼¥É¤ËÊÑ´¹ (µÕÊÑ´¹) ¤¹¤ë¤Î¤Ë»È¤ï¤ì¤ë
218 ¤È¤¤¤Ã¤¿¥é¥¤¥Ö¥é¥ê´Ø¿ô¤Ï¡¢
219 ʸ»ú½ÐÎϤǤɤì¤À¤±¥«¡¼¥½¥ë¤¬¿Ê¤ó¤À¤« (0\(en2) ¤òÊÖ¤¹¡£
221 °ìÈÌŪ¤Ë¸À¤¦¤È¡¢Linux ¤Ç¤Ï¸½ºß¤Î¤È¤³¤í
222 BMP ¤Î level 1 ¼ÂÁõ¤Î¤ß¤ò»È¤¦¤Ù¤¤Ç¤¢¤ë¡£
223 ¤¢¤ë¸À¸ì¤Îʸ»ú (¤È¤¯¤Ë¥¿¥¤Ê¸»ú) ¤Ç¤Ï¡¢
224 ¥Ù¡¼¥¹Ê¸»úÅö¤¿¤ê 2 ¤Ä¤Þ¤Ç¤Î¹çÀ®Ê¸»ú¤ò»È¤¦¤³¤È¤¬
225 UTF-8 üËö¥¨¥ß¥å¥ì¡¼¥¿¤È ISO 10646 ¥Õ¥©¥ó¥È (level 2) ¤Ç¥µ¥Ý¡¼¥È¤µ¤ì¤Æ¤¤¤ë¡£
226 ¤·¤«¤·°ìÈÌŪ¤Ë¸À¤¨¤Ð¡¢¤â¤·²Äǽ¤Ê¤é¤Ð¤¢¤é¤«¤¸¤á¹çÀ®¤·¤¿Ê¸»ú¤ò»È¤¦¤Ù¤¤Ç¤¢¤ë
227 (Unicode ¤Ç¤Ï¡¢¤³¤ì¤ò
228 .B "Normalization Form C (¹çÀ®Ê¸»ú¤ÎÀµµ¬²½·Á¼°)"
230 .SS ¥×¥é¥¤¥Ù¡¼¥È¡¦¥¨¥ê¥¢
232 ¤Î 0xe000 ¡Á 0xf8ff ¤ÎÈϰϤϡ¢µ¬³Ê¤Ç¤Ï¤¤¤«¤Ê¤ëʸ»ú¤â³ä¤êÅö¤Æ¤º¡¢
233 »äŪ¤Ê»ÈÍѤΤ¿¤á¤ËͽÌ󤵤ì¤Æ¤¤¤ë¡£
234 Linux ¥³¥ß¥å¥Ë¥Æ¥£¤Ç¤Ï¡¢
235 ¤³¤Î¥×¥é¥¤¥Ù¡¼¥È¡¦¥¨¥ê¥¢¤ò¤µ¤é¤ËºÙ¤«¤¯Ê¬³ä¤·¤Æ»ÈÍѤ¹¤ë¡£
236 0xe000 ¡Á 0xefff ¤ÎÈϰϤϥ¨¥ó¥É¡¦¥æ¡¼¥¶¡¼¤¬¸Ä¡¹¤Ë»ÈÍѤ¹¤ë¤³¤È¤¬¤Ç¤¤ë¡£
237 0xf000 ¡Á 0xf8ff ¤ÎÈÏ°Ï¤Ï Linux Zone ¤Ç
238 Á´¤Æ¤Î Linux ¥æ¡¼¥¶¡¼¤Ç¶¦Ä̤˻ÈÍѤ¹¤ë¡£
239 Linux Zone ¤Ø¤Îʸ»ú³ä¤êÅö¤Æ¤ÎÅÐÏ¿¤Ï¡¢
240 ¸½ºß H. Peter Anvin <Peter.Anvin@linux.org> ¤Ë¤è¤Ã¤Æ´ÉÍý¤µ¤ì¤Æ¤¤¤ë¡£
244 Information technology \(em Universal Multiple-Octet Coded Character
245 Set (UCS) \(em Part 1: Architecture and Basic Multilingual Plane.
246 International Standard ISO/IEC 10646-1, International Organization
247 for Standardization, Geneva, 2000.
252 http://www.iso.ch/ ¤«¤éÃíʸ¤Ç¤¤ë CD-ROM ¤Ç PDF ¥Õ¥¡¥¤¥ë¤È¤·¤ÆÆþ¼ê¤Ç¤¤ë¡£
255 The Unicode Standard, Version 3.0.
256 The Unicode Consortium, Addison-Wesley,
257 Reading, MA, 2000, ISBN 0-201-61633-5.
260 S. Harbison, G. Steele. C: A Reference Manual. Fourth edition,
261 Prentice Hall, Englewood Cliffs, 1995, ISBN 0-13-326224-3.
263 C ¥×¥í¥°¥é¥à¸À¸ì¤Ë¤Ä¤¤¤Æ¤Î¤È¤Æ¤âÎɤ¤»²¹Í½ñ¤Ç¤¢¤ë¡£
264 Âè»ÍÈǤǤϡ¢¥ï¥¤¥Éʸ»ú¤ä¥Þ¥ë¥Á¥Ð¥¤¥Èʸ»ú¥¨¥ó¥³¡¼¥É¤ò°·¤¦¤¿¤á¤Î
265 ¿¤¯¤Î¿·¤·¤¤ C ¥é¥¤¥Ö¥é¥ê´Ø¿ô¤¬
266 ²Ã¤¨¤é¤ì¤¿ ISO C90 µ¬³Ê¤Î 1994 Amendment 1 ¤ò¥«¥Ð¡¼¤·¤Æ¤¤¤ë¡£
267 ¤·¤«¤·¡¢¥ï¥¤¥Éʸ»ú¤ä¥Þ¥ë¥Á¥Ð¥¤¥Èʸ»ú¤Î¥µ¥Ý¡¼¥È¤ò
268 ¹¹¤Ë²þÁ±¤·¤¿ ISO C99 ¤Ï¡¢¤Þ¤À¥«¥Ð¡¼¤·¤Æ¤¤¤Ê¤¤¡£
271 Unicode µ»½Ñ¥ì¥Ý¡¼¥È¡£
273 http://www.unicode.org/unicode/reports/
277 Markus Kuhn: Unix/Linux ¤Î¤¿¤á¤Î UTF-8 ¤È Unicode ¤Î FAQ¡£
279 http://www.cl.cam.ac.uk/~mgk25/unicode.html
282 ¥á¡¼¥ê¥ó¥°¥ê¥¹¥È¤ò¹ØÆɤ¹¤ë¤¿¤á¤Î¾ðÊ󤬤¢¤ë¡£
283 Linux ¤Ç Unicode ¤ò»È¤¦¾ì¹ç¤Î¥¢¥É¥Ð¥¤¥¹¤òõ¤¹¤Î¤Ë°ìÈÖÎɤ¤¾ì½ê¤Ç¤¢¤ë¡£
287 Bruno Haible: Unicode HOWTO.
289 ftp://ftp.ilog.fr/pub/Users/haible/utf8/Unicode-HOWTO.html
292 ¤³¤Î¥Þ¥Ë¥å¥¢¥ë¡¦¥Ú¡¼¥¸¤òºÇ¸å¤Ë²þÄû¤·¤¿»þÅÀ¤Ç¡¢
295 ¥µ¥Ý¡¼¥È¤Ï´°À®¤·¤Æ¤¤¤ë¡£
296 XFree86 ¤Ë¤è¤ë¥µ¥Ý¡¼¥È¤Ï¿Ê¹ÔÃæ¤Ç¤¢¤ë¡£
298 ¥í¥±¡¼¥ë¤Ç²÷Ŭ¤Ë»È¤¨¤ë¥¢¥×¥ê¥±¡¼¥·¥ç¥ó
299 (¿¤¯¤ÎÍ̾¤Ê¥¨¥Ç¥£¥¿) ¤ÎºîÀ®¤Ï¡¢¤Þ¤À¿Ê¹ÔÃæ¤Ç¤¢¤ë¡£
302 ¥µ¥Ý¡¼¥È¤Ç¤ÏÄ̾ï CJK ¤Î 2 ¥ï¥¤¥Éʸ»ú¤¬Ä󶡤µ¤ì¤ë¡£
303 ñ½ã¤Ê½Å¤ÍÂǤÁ¤Ë¤è¤ë¹çÀ®Ê¸»ú¤¬Ä󶡤µ¤ì¤ë¾ì¹ç¤â¤¢¤ë¡£
304 ¤·¤«¤·¡¢±¦¤«¤éº¸¤Ø½ñ¤¯Ê¸»ú¤ä¥Ø¥Ö¥é¥¤Ê¸»ú¡¦¥¢¥é¥Ó¥¢Ê¸»ú¡¦¥¤¥ó¥É¸ì·Ïʸ»ú¤Ê¤É¤Î
305 ¹ç»ú¤ÎÃÖ¤´¹¤¨¤òɬÍפȤ¹¤ëʸ»ú¤Ï¥µ¥Ý¡¼¥È¤µ¤ì¤Æ¤¤¤Ê¤¤¡£
306 ¸½ºß¡¢¤³¤ì¤é¤Îʸ»ú¤ÏÀöÎý¤µ¤ì¤¿¥Æ¥¥¹¥ÈÉÁ²è¥¨¥ó¥¸¥ó¤òÈ÷¤¨¤¿
307 GUI ¥¢¥×¥ê¥±¡¼¥·¥ç¥ó (HTML ¥Ó¥å¡¼¥¢¡¦¥ï¡¼¥É¥×¥í¥»¥Ã¥µ) ¤Ç¤Î¤ß
310 .\" Markus Kuhn <mgk25@cl.cam.ac.uk>