The Unicode Standard Version 10.0 Core Specification

Size: px
Start display at page:

Download "The Unicode Standard Version 10.0 Core Specification"

Transcription

1 The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trademark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at For information about the Unicode terms of use, please see The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. Version Includes bibliographical references and index. ISBN ( 1. Unicode (Computer character set) I. Unicode Consortium. QA268.U ISBN Published in Mountain View, CA June 2017

2 987 I Index The index covers the contents of this core specification. To find topics in the Unicode Standard Annexes, Unicode Technical Standards, and Unicode Technical Reports, use the search feature on the Unicode website. For definitions of terms used, see the glossary on the Unicode website. To find the code points for specific characters or the code ranges for particular scripts, use the Character Index on the Unicode website. (See Section B.3, Other Unicode Online Resources.) A abbreviation, Coptic abjads , 363 abstract character sequences definition abstract characters definition abugidas , 262, 445, 615 accent marks see diacritics accented characters encoding Latin normalization accounting numbers, ideographic acrophonic numerals , 311 Adlam reference materials Aegean numbers Africa scripts of Afrikaans Ahom reference materials Ainu Aiton Alchemical Symbols reference materials Algonquian Ali Gali aliases character name , 183 informative normative property property value allocation areas allocation of encoded characters Alphabetic (informative property) alphabets European mathematical alternate format characters (deprecated). 194, Americas scripts of Amharic Anatolian hieroglyphs reference materials Ancient Symbols angle brackets (U+2329 and U+232A) deprecated for technical publication Annexes, Unicode Standard (UAX) xxxiii, 901 as components of Unicode Standard conformance list of annotation characters use in plain text discouraged ANSI/ISO C wchar_t and Unicode apostrophe (U+0027) Arabic digits Arabic-Indic digits signs used with ArabicShaping.txt , 384, 399 Aramaic , 445, 528, 555, 560 areas of the Unicode Standard ARIB Armenian arrows ASCII characters with multiple semantics transparency of UTF Unicode modeled on zero extension , 913 Assamese assigned code points , 30 Athapascan

3 Index 988 atomic character boundaries Avestan reference materials B Balinese reference materials Bamum reference materials Bangla base characters definition multiple ordered before combining marks , 330 Basic Multilingual Plane (BMP) , 44 allocation areas representation in UTF Basque Bassa Vah reference materials Batak reference materials benefits of Unicode Bengali Bhaiksuki reference materials Bidi Class (normative property) Bidi Mirrored (normative property) Bidi Mirroring Glyph (informative property) BidiMirroring.txt Bidirectional Algorithm, Unicode , 84 bidirectional ordering controls bidirectional text , 84 Middle Eastern scripts nonspacing marks in punctuation in big-endian definition Bihari binary comparison and sort order caution for UTF UTF differences , 235 UTF block , 90, 259, 875 headers BMP see Basic Multilingual Plane BNF (Backus-Naur Form) BOCU-1 see UTN #6, BOCU-1 MIME-Compatible Unicode Compression Bodhi Bodo BOM (U+FEFF) , 67, , Bopomofo boundaries, text , 191, , 230 see also UAX #14, Unicode Line Breaking Algorithm see also UAX #29, Unicode Text Segmentation boustrophedon , 353 box drawing symbols Brahmi , 555, , 560, 617 reference materials Braille Breton Buginese Buhid Bulgarian bullets numeric Burmese see Myanmar Byelorussian byte order mark (BOM) (U+FEFF). 40, 67, , byte ordering changing conformance byte serialization , 67 Byzantine Musical Symbols C C language wchar_t and Unicode C0 and C1 control codes , 189, 840 Cambodian see Khmer Canadian Aboriginal Syllabics reference materials candrabindu , 590 canonical composite characters see canonical decomposable characters canonical composition algorithm canonical decomposable characters definition canonical decomposition definition mappings canonical equivalence definition nonspacing marks canonical equivalent character sequences conformance canonical mappings see canonical decomposition mappings canonical ordering algorithm canonical precomposed characters see canonical decomposable characters Cantonese

4 Index 989 capital letters , 238, 291 Carian reference materials carriage return (U+000D) (CR) , 841 carriage return and line feed (CRLF) case and text processes beyond ASCII camelcase case folding case operations (conformance) , case operations and normalization case operations, reversibility cased (definition) case-insensitive comparison , 233, 242 casing context (definition) conversion detection European alphabets exceptional Latin pairs , 299 Georgian lowercase , 238, 291 mapping tables mappings , 168, mappings noted in code charts titlecase , 238 Turkish I , 295 uppercase , 238, 291 see also default case Case (normative property) , 238 CaseFolding.txt , 242 caseless letters Catalan Caucasian Albanian reference materials cedilla CEF see character encoding forms CES see character encoding schemes Chakma reference materials Cham reference materials character encoding forms (CEF) , 913 see also Unicode encoding forms character encoding model , 42 see also UTR #17, Unicode Character Encoding Model character encoding schemes (CES) see also Unicode encoding schemes character encoding standards coverage by Unicode Character Index character literals, Unicode code point notation U character names , , 917 aliases , 183 conventions for CJK ideographs for control codes , 189 in code charts matching character properties see properties see also individual properties, e.g. Combining Class character semantics , 80, 87 88, 918 as Unicode design principle ASCII definition character sequences abstract see abstract character sequences canonical equivalent see canonical equivalent character sequences compatibility equivalent see compatibility equivalent character sequences conformance named character sequences, combining character shaping selectors (deprecated) character tabulation (U+0009) characters abstract see abstract characters arrangement in Unicode assigned , 30 boundaries canonical decomposable see canonical decomposable characters classes code charts , 902 coded see encoded characters combining see combining characters compatibility decomposable see compatibility decomposable characters composite see decomposable characters concept of , 60 conformance definitions confusable conversion decomposable see decomposable characters deprecated see deprecated characters encoded see encoded characters encoding forms see encoding forms encoding schemes see encoding schemes end-user perceived format control , 68, 267, glyphs, relationship to graphic identity (definition) ignored in processing

5 Index 990 interpretation layout control , modification names list names see character names not encoded in Unicode number encoded in Version precomposed see decomposable characters properties see properties semantics see character semantics special , supplementary see supplementary characters transcoding unsupported characters, not glyphs in spoofing Unicode principle charsets IANA registered names charts, character code see code charts Cherokee reference materials Chinese Cantonese Hakka Mandarin Minnan (Hokkien/Fujian, incl. Taiwanese). 706 simplified and traditional Chu hán Chu Nôm citations for properties Unicode algorithms Unicode Standard CJK ideographs , accounting numbers CJK Compatibility Ideographs CJK Compatibility Supplement CJK Strokes , 931 CJK Unified Ideographs CJK Unified Ideographs Extension A CJK Unified Ideographs Extension B CJK Unified Ideographs Extension C CJK Unified Ideographs Extension D CJK Unified Ideographs Extension E CJK Unified Ideographs Extension F code charts compatibility ideographs in Plane component structure encoding blocks ideographic description sequences ideographic variation mark (U+303E) KangXi radicals , names numbers numeric values , 207 order of encoding radicals source standards unknown or unavailable Vietnamese CJK Miscellaneous Area CJK punctuation and symbols compatibility forms overscores and underscores quotation marks sesame dots vertical forms CJK-JRG (Chinese/Japanese/Korean Joint Research Group) CJKV Ideographs Area CLDR (Unicode Common Locale Data Repository). 903 cluster boundaries code charts , 902 representative glyphs code point sequences notation code points , 29 assigned , 30 assignment categories default ignorable , 254 definition designated notation number in Unicode Standard private-use see private-use code points reserved see reserved code points semantics surrogate see surrogates unassigned see unassigned code points undesignated code positions see code points code set independence code unit sequences definition ill-formed (definition) notation well-formed (definition) code units definition isolated code values see code units coded character representations see coded character sequences coded character sequences definition coded characters see encoded characters

6 Index 991 codespace see Unicode codespace coeng , 634 Collation Algorithm, Unicode (UCA) collation see sorting collation tables combining character sequences , 106 defective definition Latin line breaking matching order of base character and marks , 330 rendering selection truncation combining characters , , blocking reordering canonical ordering , 138, 170 combining marks definition dependence display order keyboard input ligatures multiple multiple base characters normalization of ordering conventions rendering of marks reordrant script-specific split strikethrough subjoined typographical interaction , 170 vertical stacking see also diacritics Combining Class (normative property) combining classes , 170, class zero characters definition combining grapheme joiner (U+034F) combining half marks , 338 combining marks see combining characters comma below Compatibility and Specials Area , 50 compatibility characters compatibility composite characters see compatibility decomposable characters compatibility decomposable characters definition compatibility decomposition definition compatibility decomposition mappings compatibility equivalence definition compatibility equivalent character sequences conformance compatibility mappings see compatibility decomposition mappings compatibility precomposed characters see compatibility decomposable characters compatibility variants mapping composite characters see decomposable characters Composition Exclusion (normative property) compression see also UTS #6, A Standard Compression Scheme for Unicode (SCSU) conferences conformance definitions examples ISO/IEC implementations requirements confusables conjunct consonants Indic , 451 Myanmar selection of clusters contextual shaping apostrophe Arabic not used for Hebrew final forms quotation marks Syriac contour tones control codes , 68, 840 graphics for names properties semantics , 841 specified in Unicode control sequences conversion of characters , , 256 convertibility as Unicode design principle Coptic , reference materials Coptic Epact numbers corporate use subarea corrigenda CR (U+000D carriage return) , 841 CRLF (carriage return and line feed) Croatian digraphs culturally expected sorting , 232

7 Index 992 Cuneiform Old Persian Sumero-Akkadian Ugaritic Cuneiform and Hieroglyphic Area Cuneiform and Hieroglyphs currency symbols block currency symbols encoded in other blocks currency symbols, other dollar sign, form and usage euro sign lari sign lira sign, compatibility usage lira sign, Turkish peso signs, usage ruble sign rupee signs, Indian, usage yen and yuan signs, usage cursive joining Arabic control characters for , , 531, 844 Mandaic Mongolian N Ko Phags-pa Syriac transparency cursive scripts Cypriot reference materials see also Linear B Cyrillic Czech D danda, in Devanagari block Danish dashes Database, Unicode Character see Unicode Character Database (UCD) dead consonants, Indic dead keys decomposable characters definition normalization of decomposition , canonical see canonical decomposition compatibility see compatibility decomposition definition in normalization mapping, definition mappings noted in code charts default case algorithms , conversion detection folding default caseless matching default grapheme clusters see also UAX #29, Unicode Text Segmentation Default Ignorable Code Point (property) default ignorable code points , 254 default property values definition defective combining character sequences definition dependent vowel signs Indic Khmer Philippine scripts deprecated characters , 879 alternate format , definition Derived Age (property) derived properties definition DerivedCoreProperties.txt , 166, 254 DerivedNormalizationProps.txt Deseret reference materials design goals of Unicode design principles of Unicode designated code points Devanagari Dhivehi diacritics , 330 alternative glyphs , 330 Czech display in isolation , 269, 331 double , 192, 332 German dialectology Greek , 310 Latin Latvian mathematical on i and j rendering Slovak spacing clones of , 332 symbol , 337 see also combining characters dictionary symbols digit form names digits Arabic Arabic-Indic

8 Index 993 compatibility decimal glyph variants hexadecimal Myanmar national shapes Shan superscript and subscript Tai Laing Tai Tham digraphs , 301, 303 dingbats directionality , 53 East Asian scripts Middle Eastern scripts Mongolian musical symbols normative property Ogham Old Italic Philippine scripts Runic discussion list for Unicode Dogri Domino Tiles dotless i , 295 dotted circle in code charts , 331 in fallback rendering to indicate diacritic to indicate vowel sign placement double diacritics , 192, 332 Duployan reference materials Dutch , 298 dynamic composition as Unicode design principle Dzongkha E East Asian scripts writing direction see also CJK ideographs Eastern Arabic-Indic digits EBCDIC newline function editing, text boundaries for efficiency as Unicode design principle Egyptian hieroglyphs reference materials Elbasan reference materials ellipsis discussion list for Unicode emoji , 902 animal symbols charts cultural symbols zodiacal symbols emoji modifiers emoticons Enclosed Alphanumerics enclosing marks definition encoded characters , 29 allocation definition encoding form conversion definition encoding forms ISO/IEC definitions encoding forms, Unicode see Unicode encoding forms encoding model for Unicode characters , 42 see also UTR #17, Unicode Character Encoding Model encoding schemes encoding schemes, Unicode see Unicode encoding schemes endian ordering see byte order mark (BOM) (U+FEFF) end-user subarea English equivalent sequences as Unicode design principle case-insensitivity , 242 combining characters in matching conformance Hangul syllables in sorting and searching language-specific security implications see also canonical equivalence see also compatibility equivalence see also encoding forms, encoding schemes errata xxxvi, 76, 903 escape sequences not used in Unicode , 4 Esperanto Estonian Ethiopic reference materials Etruscan European scripts ancient eyelash-ra

9 Index 994 F fallback rendering of nonspacing marks FAQ (Frequently Asked Questions) Faroese Farsi , 374 featural syllabaries FF (U+000C form feed) , 841 file separator (U+001C) Finnish Finno-Ugric Transcription (FUT) see Uralic Phonetic Alphabet (UPA) fixed-width Unicode encoding form (UTF-32)... 35, 124 flat tables Flemish fleurons fonts and Unicode characters for mathematical alphabets style variation for symbols form feed (U+000C) (FF) , 841 format control characters , 68, 267, deprecated prefixed , 334 stateful fraction characters fraction slash (U+2044) , 801 French Frisian FTP site, Unicode Consortium fullwidth forms in East Asian encodings futhark G Garshuni Ge ez General Category (normative property) list of values general punctuation General Scripts Area geometrical symbols Georgian German geta mark (U+3013) Glagolitic reference materials Glossary glyph selection tables glyphs , 15 characters, relationship to diacritics alternative , 330 Greek alternative Latin alternative mathematical alternative missing representative in code charts standardized variants symbols alternative golden numbers Gothic reference materials Grantha reference materials grapheme base definition grapheme clusters , see also UAX #29, Unicode Text Segmentation default definition grapheme extender definition grapheme joiner, combining (U+034F) graphic characters Greek acrophonic numerals , 311 alternative glyphs ancient musical notation editorial marks letters as symbols , 808 see also Cypriot, Linear B Greek editorial marks reference materials Greenlandic group separator (U+001D) guillemets Gujarati Gurmukhi H Hakka halant see also virama half marks, combining , 338 half-consonants, Indic halfwidth forms in East Asian encodings Han ideographs see CJK ideographs Han unification and language tags history language usage source separation rule , 690 source standards hand symbols Hangul Area

10 Index 995 Hangul syllables , and combining marks as grapheme clusters canonical decomposition collation composition conjoining jamo equivalent sequences Hangul Compatibility Jamo Hangul Jamo Hangul Syllables block Johab set name generation normalization standard Hangzhou numerals Hanja see CJK ideographs Hanunóo Hanzi see CJK ideographs harakat hasant hash tables Hatran reference materials Hebrew hentaigana hieroglyphs Anatolian Egyptian Meroitic high surrogate definition high-surrogate code points , 862 high-surrogate code units higher-level protocols definition Hindi Hiragana horizontal tab (U+0009) HTML newline function Hungarian hyphenation as a text process hyphens , 844 I I Ching symbols IANA charset names Icelandic identifiers see also UAX #31, Unicode Identifier and Pattern Syntax Ideographic (informative property) ideographic description sequences Ideographic Rapporteur Group (IRG) ideographs see also CJK ideographs IICore , 928 ill-formed definition Imperial Aramaic reference materials implementation guidelines in a Unicode encoding form definition in-band mechanisms India Official scripts Indian rupee signs, usage Indic scripts principles, in terms of Devanagari relation to ISCII standard Indonesia and Oceania scripts of Indonesian industry character sets covered in Unicode information separators (U+001C..U+001F) informative properties definition Inscriptional Pahlavi Inscriptional Parthian inside-out rule interchange restrictions International Phonetic Alphabet (IPA) 260, reference materials Spacing Modifier Letters see also phonetic alphabets internationalization Internationalization & Unicode Conference Internet protocols UTF-8 as preferred encoding Inuktitut invisible operators iota subscript IPA see International Phonetic Alphabet IRG (Ideographic Rapporteur Group) Irish , 360 ISCII standard and Unicode ISO/IEC conformance of Unicode implementations..918 encoding forms synchrony with Unicode Standard timeline compared to Unicode versions Italian ITC Zapf Dingbats IUC see Internationalization & Unicode Conference

11 Index 996 J jamos see Hangul syllables Japanese Javanese reference materials Jawi jihvamuliya , 590 Johab joiners combining grapheme joiner (U+034F) word joiner (U+2060) zero width joiner (U+200D) , 846 justification K Kaithi reference materials Kana (Hiragana and Katakana) Kanbun KangXi radicals , Kanji see CJK ideographs Kannada Kashmiri Katakana Kawi , 667 Kayah Li reference materials KC (normalization form) see Normalization Form KC KD (normalization form) see Normalization Form KD keytop labels Khamti Shan Kharoshthi reference materials Khmer characters not recommended syllable components, order of Khojki reference materials Khudawadi reference materials killer Batak Brahmi Meetei Mayek Myanmar (asat) see also virama Konkani Korean Hangul see Hangul Kurdish L Ladino language tags , and Han unification use strongly discouraged , 873 Lanna Lao last-resort glyphs Latin alternative glyphs Basic Latin encoding blocks IPA Extensions Latin Extended Additional Latin Extended-A Latin Extended-B Latin Extended-C Latin Extended-D Latin Extended-E Latin Ligatures Latin-1 Supplement Phonetic Extensions Latvian , 305 cedilla layout control characters , leading surrogates see high-surrogate code units legibility criterion for plain text Lepcha reference materials letter spacing letterlike symbols LF (U+000A line feed) , 841 ligatures Arabic combining characters on control characters for for nonspacing marks Latin selection Syriac Limbu reference materials line breaking , control characters in South Asian scripts , 627, 641 recommendations see also UAX #14, Unicode Line Breaking Algorithm line feed (U+000A) (LF) , 841 line separator (U+2028) (LS) , 845 line tabulation (U+000B) (VT)

12 Index 997 Linear A reference materials Linear B reference materials see also Cypriot linear boundaries Lisu reference materials Lithuanian little-endian definition logical order as Unicode design principle exceptions to logograph logosyllabaries low surrogate definition low-surrogate code points , 862 low-surrogate code units lowercase , 238, 291 LS (U+2028 line separator) , 845 Lycian reference materials Lydian reference materials M MacOS newline function Mahajani reference materials Mahjong Tiles mail discussion list for Unicode Maithili major version Malay Malay, Patani Malayalam Suriyani , 502 Maltese Manchu Mandaic reference materials Mandarin Manden Manichaean reference materials map symbols mapping tables see tables of character data Marathi , 456, 463 Marchen reference materials markup languages and Unicode conformance line breaking Masaram Gondi reference materials Mathematical (informative property) mathematical expression format characters see also UTR #25, Unicode Support for Mathematics mathematical symbols alphabets alphanumeric fonts format characters fragments for typesetting invisible operators operators reference materials standardized variants MathML matras , 449 Meetei Mayek reference materials Mende Kikakui reference materials Meroitic cursive hieroglyphs reference materials Miao reference materials Middle Eastern scripts ancient Min Minnan (Hokkien/Fujian, incl. Taiwanese) minor version minus sign commercial (U+2052) mirrored property see Bidi Mirrored (normative property) mirroring of paired punctuation Miscellaneous Symbols missing glyphs Modi reference materials modifier letters Modifier Letters, Spacing Mongolian , 568 writing direction Mro reference materials , 965 Multani reference materials

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Unicode definition list

Unicode definition list abstract character D3 3.3 2 abstract character sequence D4 3.3 2 accent mark alphabet alphabetic property 4.10 2 alphabetic sorting annotation ANSI Arabic digit 1 Arabic-Indic digit 3.12 1 ASCII assigned

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

(URW) ++ UNICODE APERÇU 1. Nimbus Sans Block Name. Regular. Bold. Light Vers Regular. Regular. Bold. Medium. Vers Vers Vers. 4.

(URW) ++ UNICODE APERÇU 1. Nimbus Sans Block Name. Regular. Bold. Light Vers Regular. Regular. Bold. Medium. Vers Vers Vers. 4. UNICODE APERÇU 1 Unicode Code points (Plane, Plane 2) 93+9 HKSCS Alternates 8498 8498 31 425 1 Latin Extended-A 5 U+2FF U+52F U+4FF U+F U+5 U+5FF U+7 U+74F U+6FF U+77F U+7 U+7BF U+ U+97F U+7FF U+9FF U+A7F

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS ISO/IEC JTC1/SC2/WG2 N2316 Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2001-01-09 Action: For confirmation

More information

The Unicode Standard Version 12.0 Core Specification

The Unicode Standard Version 12.0 Core Specification The Unicode Standard Version 12.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Title: Graphic representation of the Roadmap to the BMP of the UCS

Title: Graphic representation of the Roadmap to the BMP of the UCS ISO/IEC JTC1/SC2/WG2 N2045 Title: Graphic representation of the Roadmap to the BMP of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 1999-08-15 Action: For confirmation by ISO/IEC

More information

Thu Jun :48:11 Canada/Eastern

Thu Jun :48:11 Canada/Eastern Roadmaps to Unicode Thu Jun 24 2004 17:48:11 Canada/Eastern Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

To the BMP and beyond!

To the BMP and beyond! To the BMP and beyond! Eric Muller Adobe Systems Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 1 Content 1. Why Unicode 2. Character model 3. Principles of the Abstract Character Set 4.

More information

The Unicode Standard. Version 3.0. The Unicode Consortium ADDISON-WESLEY. An Imprint of Addison Wesley Longman, Inc.

The Unicode Standard. Version 3.0. The Unicode Consortium ADDISON-WESLEY. An Imprint of Addison Wesley Longman, Inc. The Unicode Standard Version 3.0 The Unicode Consortium ADDISON-WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts Harlow, England Menlo Park, California Berkeley, California Don

More information

ISO/IEC JTC 1/SC 2 N 3426

ISO/IEC JTC 1/SC 2 N 3426 ISO/IEC JTC 1/SC 2 N 3426 Date: 2000-04-04 Supersedes SC 2 N 2830 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: TITLE: Other document Graphic representation of the Roadmap

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.2 Core Specification

The Unicode Standard Version 6.2 Core Specification The Unicode Standard Version 6.2 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS

JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS http://www.tutorialspoint.com/java/lang/java_lang_character.unicodehtm Copyright tutorialspoint.com Introduction The java.lang.character.unicodeblock class is a family

More information

The Unicode Standard Version 12.0 Core Specification

The Unicode Standard Version 12.0 Core Specification The Unicode Standard Version 12.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 7.0 Core Specification

The Unicode Standard Version 7.0 Core Specification The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

2011 Martin v. Löwis. Data-centric XML. Character Sets

2011 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

2007 Martin v. Löwis. Data-centric XML. Character Sets

2007 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

Glossary. The Unicode Standard

Glossary. The Unicode Standard G Abstract Character. A unit of information used for the organization, control, or representation of textual data. (See Definition D3 in Section 3.3, Characters and Coded Representations.) Accent Mark.

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.2 Core Specification

The Unicode Standard Version 6.2 Core Specification The Unicode Standard Version 6.2 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

General Structure 2. Chapter Architectural Context

General Structure 2. Chapter Architectural Context This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

The Unicode Standard Version 7.0 Core Specification

The Unicode Standard Version 7.0 Core Specification The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 9.0 Core Specification

The Unicode Standard Version 9.0 Core Specification The Unicode Standard Version 9.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Code Charts 17. Chapter Character Names List. Disclaimer

Code Charts 17. Chapter Character Names List. Disclaimer This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

Introduction 1. Chapter 1

Introduction 1. Chapter 1 This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

3494 Date: Supersedes SC 2 N 3426

3494 Date: Supersedes SC 2 N 3426 ISO/IEC JTC 1/SC 2 N 3494 3494 Date: 2000-10-06 Supersedes SC 2 N 3426 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: Other document TITLE: ISO/IEC 10646 Roadmap [WG 2 N2313,

More information

Unicode: What is it and how do I use it?

Unicode: What is it and how do I use it? Abstract: The rationale for Unicode and its design goals and detailed design principles are presented. The correspondence between Unicode and ISO/IEC 10646 is discussed, the scripts included or planned

More information

Information, Characters, Unicode

Information, Characters, Unicode Information, Characters, Unicode Information Characters In modern computing, natural-language text is very important information. ( Number-crunching is less important.) Characters of text are represented

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard. Unicode Summary description. Unicode character database (UCD)

The Unicode Standard. Unicode Summary description. Unicode character database (UCD) http://www.unicode.org/versions/beta-7.0.0.html 1 of 7 The Unicode Standard Home Site Map Search Contents Related Unicode Technical Standards Review and Feedback Notable Issues for Beta Reviewers 7.0.0

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Conformance 3. Chapter Versions of the Unicode Standard

Conformance 3. Chapter Versions of the Unicode Standard This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

UNICODE SCRIPT NAMES PROPERTY

UNICODE SCRIPT NAMES PROPERTY 1 of 10 1/29/2008 10:29 AM Technical Reports Proposed Update to Unicode Standard Annex #24 UNICODE SCRIPT NAMES PROPERTY Version Unicode 5.1.0 draft2 Authors Mark Davis (mark.davis@google.com), Ken Whistler

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

RomanCyrillic Std v. 7

RomanCyrillic Std v. 7 https://doi.org/10.20378/irbo-52591 RomanCyrillic Std v. 7 Online Documentation incl. support for Unicode v. 9, 10, and 11 (2016 2018) UNi code A З PDF! Ѿ Sebastian Kempgen 2018 RomanCyrillic Std: new

More information

Google Search Appliance

Google Search Appliance Google Search Appliance Search Appliance Internationalization Google Search Appliance software version 7.2 and later Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com GSA-INTL_200.01

More information

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video Multimedia Data Multimedia Data Text Vector Graphics 3-D Vector Graphics Raster Graphics Digital Image Voxel Audio Digital Video 1 Text There are three types of text that are used to produce pages of documents

More information

Proposed Update Unicode Standard Annex #34

Proposed Update Unicode Standard Annex #34 Technical Reports Proposed Update Unicode Standard Annex #34 Version Unicode 6.3.0 (draft 1) Editors Addison Phillips Date 2013-03-29 This Version Previous Version Latest Version Latest Proposed Update

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Unicode and Standardized Notation. Anthony Aristar

Unicode and Standardized Notation. Anthony Aristar Data Management and Archiving University of California at Santa Barbara, June 24-27, 2008 Unicode and Standardized Notation Anthony Aristar Once upon a time There were people who decided to invent computers.

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Proposed Update. Unicode Standard Annex #11

Proposed Update. Unicode Standard Annex #11 1 of 12 5/8/2010 9:14 AM Technical Reports Proposed Update Unicode Standard Annex #11 Version Unicode 6.0.0 draft 2 Authors Asmus Freytag (asmus@unicode.org) Date 2010-03-04 This Version Previous http://www.unicode.org/reports/tr11/tr11-19.html

More information

Roadmap to the SMP. Michael Everson, Rick McGowan, Ken Whistler.

Roadmap to the SMP. Michael Everson, Rick McGowan, Ken Whistler. SMP Home Site Map Search Tables Roadmap Introduction BMP (Plane 0) SMP (Plane 1) SIP (Plane 2) SSP (Plane 14) Not the Roadmap More Information The Unicode Standard, Version 3.0 Proposed characters Submitting

More information

ISO/TC46/SC4/WG1 N 240, ISO/TC46/SC4/WG1 N

ISO/TC46/SC4/WG1 N 240, ISO/TC46/SC4/WG1 N L2/00-220 Title: Finalized Mapping between Characters of ISO 5426 and ISO/IEC 10646-1 (UCS) Source: The Research Libraries Group, Inc. Status: L2 Member Contribution References: ISO/TC46/SC4/WG1 N 240,

More information

This document is to be used together with N2285 and N2281.

This document is to be used together with N2285 and N2281. ISO/IEC JTC1/SC2/WG2 N2291 2000-09-25 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по

More information

Domain Names in Pakistani Languages. IDNs for Pakistani Languages

Domain Names in Pakistani Languages. IDNs for Pakistani Languages ا ہ 6 5 a ز @ ں ب Domain Names in Pakistani Languages س a ی س a ب او اور را < ہ ر @ س a آف ا ر ا 6 ب 1 Domain name Domain name is the address of the web page pg on which the content is located 2 Internationalized

More information

NRSI: Computers & Writing Systems

NRSI: Computers & Writing Systems NRSI: Computers & Writing Systems SIL HOME CONTACT US Search You are here: Encoding > Unicode Search Home Contact us General Initiative B@bel WSI Guidelines Encoding Principles Unicode Tutorials PUA Character

More information

General Structure 2. Chapter Architectural Context

General Structure 2. Chapter Architectural Context Chapter 2 General Structure 2 This chapter discusses the fundamental principles governing the design of the Unicode Standard and presents an informal overview of its main features. The chapter starts by

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet A. Sean Pue South Asia Language Resource Center Pre-SASLI Workshop 6/7/09 1 Objectives To understand how

More information

UNICODE IDENTIFIER AND PATTERN SYNTAX

UNICODE IDENTIFIER AND PATTERN SYNTAX Technical Reports Proposed Update Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 11.0.0 (draft 1) Editors Mark Davis (markdavis@google.com) Date 2018-04-10 This Version

More information

Request for Comments: 3536 Category: Informational May Terminology Used in Internationalization in the IETF

Request for Comments: 3536 Category: Informational May Terminology Used in Internationalization in the IETF Network Working Group P. Hoffman Request for Comments: 3536 IMC & VPNC Category: Informational May 2003 Status of this Memo Terminology Used in Internationalization in the IETF This memo provides information

More information

Two distinct code points: DECIMAL SEPARATOR and FULL STOP

Two distinct code points: DECIMAL SEPARATOR and FULL STOP Two distinct code points: DECIMAL SEPARATOR and FULL STOP Dario Schiavon, 207-09-08 Introduction Unicode, being an extension of ASCII, inherited a great historical mistake, namely the use of the same code

More information

Proposed Update Unicode Standard Annex #11 EAST ASIAN WIDTH

Proposed Update Unicode Standard Annex #11 EAST ASIAN WIDTH Page 1 of 10 Technical Reports Proposed Update Unicode Standard Annex #11 EAST ASIAN WIDTH Version Authors Summary This annex presents the specifications of an informative property for Unicode characters

More information

Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS

Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS ISO/IEC JTC1/SC2 N3427 ISO/IEC JTC1/SC2/WG2 N2214 Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2000-03-28

More information

ISO/IEC INTERNATIONAL STANDARD

ISO/IEC INTERNATIONAL STANDARD INTERNATIONAL STANDARD Provläsningsexemplar / Preview ISO/IEC 10646 First edition 2003-12-15 AMENDMENT 3 2008-02-15 Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 3:

More information

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 TABLE OF CONTENTS Introduction... 2 Input Requirements... 2 Input file... 2 Input File Processing... 2 Commands... 3 Categories of Commands... 4 Formatting Commands...

More information

Leaks in the Unicode pipeline: script, script, script

Leaks in the Unicode pipeline: script, script, script Michael Everson, Everson Typography, www.evertype.com Some 52 scripts are currently allocated in the Unicode Standard. This reflects an enormous amount of work on the part of a great many people. An examination

More information

Michael Everson, Rick McGowan, Ken Whistler

Michael Everson, Rick McGowan, Ken Whistler Roadmaps to Unicode Fri May 27 2005 23:35:41 Europe/Dublin Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

UNICODE IDENTIFIER AND PATTERN SYNTAX

UNICODE IDENTIFIER AND PATTERN SYNTAX 1 of 21 1/29/2008 10:32 AM Technical Reports Proposed Update to Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 5.1 (draft 6) Authors Mark Davis (mark.davis@google.com)

More information

Michael Everson, Rick McGowan, Ken Whistler

Michael Everson, Rick McGowan, Ken Whistler Roadmaps to Unicode Mon Dec 13 2004 12:29:02 Europe/Dublin Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

Transliteration of Tamil and Other Indic Scripts. Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA

Transliteration of Tamil and Other Indic Scripts. Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA Transliteration of Tamil and Other Indic Scripts Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA Main points of Powerpoint presentation This talk gives

More information

ISO/IEC JTC 1/SC 2/WG 2 N2895 L2/ Date:

ISO/IEC JTC 1/SC 2/WG 2 N2895 L2/ Date: ISO International Organization for Standardization Organisation Internationale de Normalisation ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) ISO/IEC JTC 1/SC 2/WG 2 N2895

More information

Network Working Group. Category: Informational July 1995

Network Working Group. Category: Informational July 1995 Network Working Group M. Ohta Request For Comments: 1815 Tokyo Institute of Technology Category: Informational July 1995 Status of this Memo Character Sets ISO-10646 and ISO-10646-J-1 This memo provides

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Consent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R.

Consent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R. L2/98-389R Consent docket re WG2 Resolutions at its Meeting #35 as amended For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R. RESOLUTION M35.4 (PDAM-24 on Thaana): Unanimous to prepare

More information

Licensed Program Specifications

Licensed Program Specifications AFP Font Collection for MVS, OS/390, VM, and VSE Program Number 5648-B33 Licensed Program Specifications AFP Font Collection for MVS, OS/390, VM, and VSE, hereafter referred to as AFP Font Collection,

More information

Extract of Action items, section 16 from document N minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; /29)

Extract of Action items, section 16 from document N minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; /29) ISO/IEC JTC 1/SC 2/WG 2 N3103-A 2006-08-25 Extract of Action items, section 16 from document N3103 - minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; 2006-09-25/29) All action items recorded

More information

UNICODE IDNA COMPATIBLE PREPROCESSSING

UNICODE IDNA COMPATIBLE PREPROCESSSING 1 of 12 1/23/2009 2:51 PM Technical Reports Proposed Draft Unicode Technical Standard #46 UNICODE IDNA COMPATIBLE PREPROCESSSING Version 1 (draft 1) Authors Mark Davis (markdavis@google.com), Michel Suignard

More information

Proposed Update Unicode Technical Standard #10

Proposed Update Unicode Technical Standard #10 of 69 7/14/2010 12:04 PM Technical Reports Proposed Update Unicode Technical Standard #10 Version 6.0.0 draft 5 Authors Editors Mark Davis (markdavis@google.com), Ken Whistler (ken@unicode.org) Date 2010-07-09

More information

Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters

Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters Page 1, Clause 1 Scope In the note, update the Unicode Standard version

More information

Talk2You User Manual Smartphone / Tablet

Talk2You User Manual Smartphone / Tablet Talk2You User Manual Smartphone / Tablet Don t Translate it. Lingmo It! language translation technology for the global market The World s First Translating Voice Messaging Software Communicate with cross-border

More information

COSC 243 (Computer Architecture)

COSC 243 (Computer Architecture) COSC 243 Computer Architecture And Operating Systems 1 Dr. Andrew Trotman Instructors Office: 123A, Owheo Phone: 479-7842 Email: andrew@cs.otago.ac.nz Dr. Zhiyi Huang (course coordinator) Office: 126,

More information

108_GILLAM.index.fm Page 817 Monday, August 19, :35 PM. Index

108_GILLAM.index.fm Page 817 Monday, August 19, :35 PM. Index 108_GILLAM.index.fm Page 817 Monday, August 19, 2002 3:35 PM Index A AAT (Apple Advanced Typography), 675 baseline adjustment, 681 caret positioning, 681 682 glyphs compound, 680 selection/placement, 678

More information

draft-hoffman-i18n-terms-02.txt July 18, 2001 Expires in six months Terminology Used in Internationalization in the IETF Status of this memo

draft-hoffman-i18n-terms-02.txt July 18, 2001 Expires in six months Terminology Used in Internationalization in the IETF Status of this memo Internet Draft draft-hoffman-i18n-terms-02.txt July 18, 2001 Expires in six months Paul Hoffman IMC & VPNC Status of this memo Terminology Used in Internationalization in the IETF This document is an Internet-Draft

More information

Google 1 April A Generalized Unified Character Code: Western European and CJK Sections

Google 1 April A Generalized Unified Character Code: Western European and CJK Sections Network Working Group Request for Comments: 5242 Category: Informational J. Klensin H. Alvestrand Google 1 April 2008 A Generalized Unified Character Code: Western European and CJK Sections Status of This

More information

FileMaker 15 Specific Features

FileMaker 15 Specific Features FileMaker 15 Specific Features FileMaker Pro and FileMaker Pro Advanced Specific Features for the Middle East and India FileMaker Pro 15 and FileMaker Pro 15 Advanced is an enhanced version of the #1-selling

More information

Proposed Update Unicode Standard Annex #9

Proposed Update Unicode Standard Annex #9 Technical Reports Proposed Update Unicode Standard Annex #9 Version Unicode 6.2.1 (draft 3) Editors Date 2012-10-26 This Version Previous Version Latest Version Latest Proposed Update Revision 28 Summary

More information

Unicode Standard Annex #9

Unicode Standard Annex #9 http://www.unicode.org/reports/tr9/tr9-24.html 1 of 30 Technical Reports Unicode Standard Annex #9 Version Unicode 6..0 Editors Date This Version Previous Version Latest Version Latest Proposed Update

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Draft. Unicode Technical Report #49

Draft. Unicode Technical Report #49 1 of 9 Technical Reports Draft Unicode Technical Report #49 Editors Ken Whistler Date 2011-07-12 This Version http://www.unicode.org/reports/tr49/tr49-2.html Previous Version http://www.unicode.org/reports/tr49/tr49-1.html

More information