The Unicode Standard Version 10.0 Core Specification
|
|
- Doris Long
- 6 years ago
- Views:
Transcription
1 The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trademark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at For information about the Unicode terms of use, please see The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. Version Includes bibliographical references and index. ISBN ( 1. Unicode (Computer character set) I. Unicode Consortium. QA268.U ISBN Published in Mountain View, CA June 2017
2 xxv Tables Table 2-1. The 10 Unicode Design Principles Table 2-2. User-Perceived Characters with Multiple Code Points Table 2-3. Types of Code Points Table 2-4. The Seven Unicode Encoding Schemes Table 2-5. Interaction of Combining Characters Table 2-6. Nondefault Stacking Table 3-1. Named Unicode Algorithms Table 3-2. Normative Character Properties Table 3-3. Informative Character Properties Table 3-4. Examples of Unicode Encoding Forms Table 3-5. UTF-16 Bit Distribution Table 3-6. UTF-8 Bit Distribution Table 3-7. Well-Formed UTF-8 Byte Sequences Table 3-8. Use of U+FFFD in UTF-8 Conversion Table 3-9. Summary of UTF-16BE, UTF-16LE, and UTF Table Summary of UTF-32BE, UTF-32LE, and UTF Table Combining Marks and Starter Status Table Reorderable Pairs Table Hangul Characters Used in Examples Table Context Specification for Casing Table Case Detection Examples Table 4-1. Relationship of Casing Definitions Table 4-2. Case Function Values for Strings Table 4-3. Sources for Case Mapping Information Table 4-4. General Category Table 4-5. Primary Numeric Ideographs Table 4-6. Ideographs Used as Accounting Numbers Table 4-7. Types of Character Name Aliases Table 4-8. Name Derivation Rule Prefix Strings Table 4-9. Construction of Code Point Labels Table Unusual Properties Table 5-1. Hex Values for Acronyms Table 5-2. NLF Platform Correlations Table 5-3. Typing Order Differing from Canonical Order Table 5-4. Permuting Combining Class Weights Table 5-5. Casing and Normalization in Strings Table 6-1. Typology of Scripts in the Unicode Standard Table 6-2. Unicode Space Characters Table 6-3. Unicode Dash Characters Table 6-4. Models of Visual Relationship between Quote Glyphs
3 Tables xxvi Table 6-5. East Asian Quotation Marks Table 6-6. Opening and Closing Forms Table 6-7. Names for 280 Table 6-8. Unicode Danda Characters Table 7-1. Preferred Rendering of Cedilla versus Comma Below Table 7-2. Nonspacing Marks Used with Greek Table 7-3. Greek Spacing and Nonspacing Pairs Table 7-4. Typicon Kavyka Symbols Table 8-1. Similar Characters in Linear B and Cypriot Table 8-2. Combining Marks Used in Old Permic Table 9-1. Arabic Digit Names Table 9-2. Glyph Variation in Eastern Arabic-Indic Digits Table 9-3. Primary Arabic Joining Types Table 9-4. Derived Arabic Joining Types Table 9-5. Arabic Glyph Types Table 9-6. Arabic Obligatory Ligature Joining Groups Table 9-7. Arabic Ligature Notation Table 9-8. Dual-Joining Arabic Characters Table 9-9. Right-Joining Arabic Characters Table Forms of the Arabic Letter yeh Table Arabic Letters With Hamza Above Table Miscellaneous Syriac Diacritic Use Table Syriac Final Alaph Glyph Types Table Dual-Joining Syriac Characters Table Right-Joining Syriac Characters Table Syriac Alaph Glyph Forms Table Syriac Ligatures Table Samaritan Performative Punctuation Marks Table Dual-Joining Mandaic Characters Table Right-Joining Mandaic Characters Table Old South Arabian Numeric Characters Table Number Formation in Old South Arabian Table Number Formation in Aramaic Table Dual-Joining Manichaean Letters Table Right-Joining Manichaean Letters Table Left-Joining Manichaean Letters Table Non-Joining Manichaean Letters Table Manichaean Ligatures Table Inscriptional Parthian Shaping Behavior Table Avestan Shaping Behavior Table Cuneiform Script Usage Table Hieroglyphic Character Sequence Table Devanagari Vowel Letters Table Sample Devanagari Half-Forms Table Sample Devanagari Ligatures
4 Tables xxvii Table RA + Vocalic Letter Ligature Forms Table Sample Devanagari Half-Ligature Forms Table Marathi and Nepali Allographs Table Devanagari Vowels Used in Bihari Languages Table Prishthamatra Orthography Table Bengali Vowel Letters Table Diphthong Vowel Letters in Kokborok Table Assamese Consonant-Vowel Combinations Table Bengali Consonant-Vowel Combinations Table Use of Apostrophe in Bangla Table Gurmukhi Vowel Letters Table Gurmukhi Conjuncts Table Additional Pairin and Addha Forms in Gurmukhi Table Use of Joiners in Gurmukhi Table Gujarati Vowel Letters Table Gujarati Conjuncts Table Oriya Vowel Letters Table Oriya Conjuncts Table Oriya Vowel Placement Table Ligation for the Syllable om Table Tamil Ligatures with u Table Tamil Vowels, Consonants, and Syllables Table Telugu Vowel Letters Table Rendering of Telugu na + virama Table Kannada Vowel Letters Table Rendering of Kannada na + virama Table Malayalam Vowel Letters Table Malayalam Orthographic Reform Table Malayalam Conjuncts Table Candrakkala Examples Table Use of Joiners in Malayalam Table Malayalam /rara/ and /uua/ Table Malayalam /nr/ and /nt/ Table Atomic Encoding of Malayalam Chillus Table Thaana Glyph Placement Table Sinhala Vowel Letters Table Murmured Resonants in Nepal Bhasa Table Positions of Limbu Combining Characters Table Lepcha Syllabic Structure Table Various Signs in Masaram Gondi Table Brahmi Vowel Letters Table Brahmi Positional Digits Table Kharoshthi Vowel Signs Table Kharoshthi Vowel Modifiers Table Kharoshthi Consonant Modifiers
5 Tables xxviii Table Examples of Kharoshthi Virama Table Phags-pa Positional Forms of I, U, E, and O Table Contextual Glyph Mirroring in Phags-pa Table Phags-pa Standardized Variants Table Takri Vowel Letters Table Siddham Punctuation Characters Table Khudawadi Vowel Letters Table Representation of Arabic Sounds in Khudawadi Table Tirhuta Vowel Letters Table Modi Vowel Letters Table Rendering of Explicit Virama Forms in Grantha Table Additional Svara Marks used in Grantha Table Glyph Positions in Thai Syllables Table Glyph Positions in Lao Syllables Table Modern Burmese Syllabic Structure Table Khamti Shan Tone Marks Table Independent Khmer Vowel Characters Table Two Registers of Khmer Consonants Table Khmer Subscript Consonant Signs Table Khmer Composite Dependent Vowel Signs with Nikahit Table Khmer Subscript Independent Vowel Signs Table Tai Le Tone Marks Table Myanmar Digits in Tai Le Table New Tai Lue Vowel Placement Table New Tai Lue Registers and Tones Table Tai Viet Symbols and Punctuation Table Cham Syllabic Structure Table Hanunóo and Buhid Vowel Sign Combinations Table Balinese Base Consonants and Conjunct Forms Table Sasak Extensions for Balinese Table Balinese Consonant Clusters with u and u: Table Modern Sundanese Syllabic Structure Table Blocks Containing Han Ideographs Table Small Extensions to the URO Table Common Han Characters Table Source Encoding for Sword Variants Table Ideographs Not Unified Table Ideographs Unified Table Han Ideograph Arrangement Table Mandarin Tone Marks Table Minnan and Hakka Tone Marks Table Separating Jamo Characters Table Line-Based Placement of Jungseong Table Lisu Tone Letters Table Punctuation Adopted in Lisu Orthography
6 Tables xxix Table Labialized Forms in Ethiopic -WAA Table Labialized Forms in Ethiopic -WE Table N Ko Diacritic Usage Table N Ko Tone Diacritics on Vowels Table N Ko Letter Shaping Table Number Formation in Mende Kikakui Table Combining Marks used in Osage Table IPA Transcription of Deseret Table Examples of Ornamentation Table Representation of Ancient Greek Vocal and Instrumental Notation 772 Table Currency Symbols Encoded in Other Blocks Table Mathematical Alphanumeric Symbols Table Script-Specific Decimal Digits Table Compatibility Digits Table Mathematical Operators Disunified from Punctuation Table Use of Mathematical Symbol Pieces Table Geometric Shape Collections Table Japanese Era Names Table Control Codes Specified in the Unicode Standard Table Letter Spacing Table Bidirectional Ordering Controls Table Paired Stateful Controls Table Paired Stateful Controls (Deprecated) Table Unicode Encoding Scheme Signatures Table U+FEFF Signature in Other Charsets Table IRG Sources Table A-1. Extended BNF Table A-2. Character Class Examples Table A-3. Operators Table C-1. Timeline Table C-2. Zero Extending Table D-1. Versions of Unicode and ISO/IEC Table D-2. Allocation of Code Points by Type (Versions to 3.0) Table D-3. Allocation of Code Points by Type (Versions 3.1 to 5.1) Table D-4. Allocation of Code Points by Type (Versions 5.2 to 7.0) Table D-5. Allocation of Code Points by Type (Versions 8.0 to 10.0) Table F-1. CJK Strokes
7 Tables xxx
The Unicode Standard Version 6.1 Core Specification
The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More information(URW) ++ UNICODE APERÇU 1. Nimbus Sans Block Name. Regular. Bold. Light Vers Regular. Regular. Bold. Medium. Vers Vers Vers. 4.
UNICODE APERÇU 1 Unicode Code points (Plane, Plane 2) 93+9 HKSCS Alternates 8498 8498 31 425 1 Latin Extended-A 5 U+2FF U+52F U+4FF U+F U+5 U+5FF U+7 U+74F U+6FF U+77F U+7 U+7BF U+ U+97F U+7FF U+9FF U+A7F
More informationThe Unicode Standard Version 10.0 Core Specification
The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 12.0 Core Specification
The Unicode Standard Version 12.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationTitle: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS
ISO/IEC JTC1/SC2/WG2 N2316 Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2001-01-09 Action: For confirmation
More informationTitle: Graphic representation of the Roadmap to the BMP of the UCS
ISO/IEC JTC1/SC2/WG2 N2045 Title: Graphic representation of the Roadmap to the BMP of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 1999-08-15 Action: For confirmation by ISO/IEC
More informationThu Jun :48:11 Canada/Eastern
Roadmaps to Unicode Thu Jun 24 2004 17:48:11 Canada/Eastern Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationISO/IEC JTC 1/SC 2 N 3426
ISO/IEC JTC 1/SC 2 N 3426 Date: 2000-04-04 Supersedes SC 2 N 2830 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: TITLE: Other document Graphic representation of the Roadmap
More informationThe Unicode Standard. Version 3.0. The Unicode Consortium ADDISON-WESLEY. An Imprint of Addison Wesley Longman, Inc.
The Unicode Standard Version 3.0 The Unicode Consortium ADDISON-WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts Harlow, England Menlo Park, California Berkeley, California Don
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationUnicode definition list
abstract character D3 3.3 2 abstract character sequence D4 3.3 2 accent mark alphabet alphabetic property 4.10 2 alphabetic sorting annotation ANSI Arabic digit 1 Arabic-Indic digit 3.12 1 ASCII assigned
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationJAVA.LANG.CHARACTER.UNICODEBLOCK CLASS
JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS http://www.tutorialspoint.com/java/lang/java_lang_character.unicodehtm Copyright tutorialspoint.com Introduction The java.lang.character.unicodeblock class is a family
More informationProposal on Handling Reph in Gurmukhi and Telugu Scripts
Proposal on Handling Reph in Gurmukhi and Telugu Scripts Nagarjuna Venna August 1, 2006 1 Introduction Chapter 9 of the Unicode standard [1] describes the representational model for encoding Indic scripts.
More informationThe Unicode Standard Version 10.0 Core Specification
The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 12.0 Core Specification
The Unicode Standard Version 12.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 7.0 Core Specification
The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 6.0 Core Specification
The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters
INTERNATIONAL STANDARD ISO 15919 First edition 2001-10-01 Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters Information et documentation Translittération
More informationThe Unicode Standard Version 6.1 Core Specification
The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 6.2 Core Specification
The Unicode Standard Version 6.2 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationBlending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet
Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet A. Sean Pue South Asia Language Resource Center Pre-SASLI Workshop 6/7/09 1 Objectives To understand how
More informationProposals For Devanagari, Gurmukhi, And Gujarati Scripts Root Zone Label Generation Rules
Proposals For Devanagari, Gurmukhi, And Gujarati Scripts Root Zone Label Generation Rules Publication Date: 20 October 2018 Prepared By: IDN Program, ICANN Org Public Comment Proceeding Open Date: 27 July
More informationIntroduction 1. Chapter 1
This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates
More informationUnicode and Standardized Notation. Anthony Aristar
Data Management and Archiving University of California at Santa Barbara, June 24-27, 2008 Unicode and Standardized Notation Anthony Aristar Once upon a time There were people who decided to invent computers.
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationTo the BMP and beyond!
To the BMP and beyond! Eric Muller Adobe Systems Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 1 Content 1. Why Unicode 2. Character model 3. Principles of the Abstract Character Set 4.
More informationKannada 2. L2/ Representation of Jihvamuliya and Upadhmaniya in Kannada Srinidhi
TO: UTC L2/14 XXX FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #138 February 2014 on Script Proposals DATE: 26 January 2014
More informationThe Unicode Standard. Unicode Summary description. Unicode character database (UCD)
http://www.unicode.org/versions/beta-7.0.0.html 1 of 7 The Unicode Standard Home Site Map Search Contents Related Unicode Technical Standards Review and Feedback Notable Issues for Beta Reviewers 7.0.0
More informationThe Unicode Standard Version 10.0 Core Specification
The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More information3494 Date: Supersedes SC 2 N 3426
ISO/IEC JTC 1/SC 2 N 3494 3494 Date: 2000-10-06 Supersedes SC 2 N 3426 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: Other document TITLE: ISO/IEC 10646 Roadmap [WG 2 N2313,
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationThe Unicode Standard Version 6.0 Core Specification
The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationDomain Names in Pakistani Languages. IDNs for Pakistani Languages
ا ہ 6 5 a ز @ ں ب Domain Names in Pakistani Languages س a ی س a ب او اور را < ہ ر @ س a آف ا ر ا 6 ب 1 Domain name Domain name is the address of the web page pg on which the content is located 2 Internationalized
More informationSOUTH ASIA Indic 1. Tamil Documents: L2/ Naming Tamil Symbols in SMP Vallinam Characters Ganesan
TO: UTC L2/15 045 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Andrew Glass SUBJECT: Recommendations to UTC #142 February 2015 on Script Proposals DATE: 30 January 2015 The
More information[MS-ISO10646]: Microsoft Universal Multiple-Octet Coded Character Set (UCS) Standards Support Document
[MS-ISO10646]: Microsoft Universal Multiple-Octet Coded Character Set (UCS) Standards Support Document Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation.
More informationUnicode: What is it and how do I use it?
Abstract: The rationale for Unicode and its design goals and detailed design principles are presented. The correspondence between Unicode and ISO/IEC 10646 is discussed, the scripts included or planned
More informationInformation, Characters, Unicode
Information, Characters, Unicode Information Characters In modern computing, natural-language text is very important information. ( Number-crunching is less important.) Characters of text are represented
More informationCode Charts 17. Chapter Character Names List. Disclaimer
This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates
More informationThe Unicode Standard Version 7.0 Core Specification
The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationCOSC 243 (Computer Architecture)
COSC 243 Computer Architecture And Operating Systems 1 Dr. Andrew Trotman Instructors Office: 123A, Owheo Phone: 479-7842 Email: andrew@cs.otago.ac.nz Dr. Zhiyi Huang (course coordinator) Office: 126,
More informationTransliteration of Tamil and Other Indic Scripts. Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA
Transliteration of Tamil and Other Indic Scripts Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA Main points of Powerpoint presentation This talk gives
More informationPROPOSALS FOR MALAYALAM AND TAMIL SCRIPTS ROOT ZONE LABEL GENERATION RULES
PROPOSALS FOR MALAYALAM AND TAMIL SCRIPTS ROOT ZONE LABEL GENERATION RULES Publication Date: 23 November 2018 Prepared By: IDN Program, ICANN Org Public Comment Proceeding Open Date: 25 September 2018
More informationThe Unicode Standard Version 6.2 Core Specification
The Unicode Standard Version 6.2 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationRomanCyrillic Std v. 7
https://doi.org/10.20378/irbo-52591 RomanCyrillic Std v. 7 Online Documentation incl. support for Unicode v. 9, 10, and 11 (2016 2018) UNi code A З PDF! Ѿ Sebastian Kempgen 2018 RomanCyrillic Std: new
More informationComments on the Proposals to Encode Tamil Symbols and Fractions by ICTA Sri Lanka. The document
TO: UTC L2/14 170 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #140 August 2014 on Script Proposals DATE: 28 July 2014 The
More informationMultimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video
Multimedia Data Multimedia Data Text Vector Graphics 3-D Vector Graphics Raster Graphics Digital Image Voxel Audio Digital Video 1 Text There are three types of text that are used to produce pages of documents
More informationInformation technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters
Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters Page 1, Clause 1 Scope In the note, update the Unicode Standard version
More informationஒர ங க ற ததத ற றம ம தகத ட பத ட ம ம னவர ரமணஶர மத இந த யவ யல/ததத ழ லந ட ப ஆய வத ளர தம ழ நத ட
ஒர ங க ற ததத ற றம ம தகத ட பத ட ம ம னவர ரமணஶர மத இந த யவ யல/ததத ழ லந ட ப ஆய வத ளர தம ழ நத ட Genesis and Philosophy of Unicode Shriramana Sharma, Ph D Indology/Technology Research Scholar Tamil Nadu jamadagni
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationGlossary. The Unicode Standard
G Abstract Character. A unit of information used for the organization, control, or representation of textual data. (See Definition D3 in Section 3.3, Characters and Coded Representations.) Accent Mark.
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 9.0 Core Specification
The Unicode Standard Version 9.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationThe Unicode Standard Version 11.0 Core Specification
The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationProposed Update Unicode Standard Annex #34
Technical Reports Proposed Update Unicode Standard Annex #34 Version Unicode 6.3.0 (draft 1) Editors Addison Phillips Date 2013-03-29 This Version Previous Version Latest Version Latest Proposed Update
More informationGeneral Structure 2. Chapter Architectural Context
This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates
More information2011 Martin v. Löwis. Data-centric XML. Character Sets
Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers
More information2007 Martin v. Löwis. Data-centric XML. Character Sets
Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers
More informationProposed New Characters: Pipeline
Page 1 of 15 Character Proposals Home Site Map Search Contents Characters and Scripts for Unicode Variation Sequences for Unicode Named Sequences for Unicode Related Links About the Pipeline Table Characters
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationUNICODE IDENTIFIER AND PATTERN SYNTAX
Technical Reports Proposed Update Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 11.0.0 (draft 1) Editors Mark Davis (markdavis@google.com) Date 2018-04-10 This Version
More informationLeaks in the Unicode pipeline: script, script, script
Michael Everson, Everson Typography, www.evertype.com Some 52 scripts are currently allocated in the Unicode Standard. This reflects an enormous amount of work on the part of a great many people. An examination
More informationISO/IEC JTC 1/SC 2/WG 2 Proposal summary form N2652-F accompanies this document.
Dated: April 28, 2006 Title: Proposal to add TAMIL OM Source: International Forum for Information Technology in Tamil (INFITT) Action: For consideration by UTC and ISO/IEC JTC 1/SC 2/WG 2 Distribution:
More informationEncoding Diversity for Asian and African Languages
Encoding Diversity for Asian and African Languages The Script Encoding Initiative Michael Everson, Evertype Westport, Co. Mayo, Ireland Geneva, Switzerland 9 May 2006 Current State of the Unicode Standard
More informationISO/IEC INTERNATIONAL STANDARD
INTERNATIONAL STANDARD Provläsningsexemplar / Preview ISO/IEC 10646 First edition 2003-12-15 AMENDMENT 3 2008-02-15 Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 3:
More informationThe Unicode Standard Version 6.0 Core Specification
The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationÜù àõ [tai 2 l 6] (in older orthography Üù àõ»). Tai Le orthography is simple and straightforward:
ISO/IEC JTC1/SC2/WG2 N2372 2001-10-05 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по
More informationThe Unicode Standard Version 10.0 Core Specification
The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationProposed Update. Unicode Standard Annex #11
1 of 12 5/8/2010 9:14 AM Technical Reports Proposed Update Unicode Standard Annex #11 Version Unicode 6.0.0 draft 2 Authors Asmus Freytag (asmus@unicode.org) Date 2010-03-04 This Version Previous http://www.unicode.org/reports/tr11/tr11-19.html
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationProposal to encode Devanagari Sign High Spacing Dot
Proposal to encode Devanagari Sign High Spacing Dot Jonathan Kew, Steve Smith SIL International April 20, 2006 1. Introduction In several language communities of Nepal, the Devanagari script has been adapted
More informationConformance 3. Chapter Versions of the Unicode Standard
This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates
More informationProposed Update Unicode Standard Annex #11 EAST ASIAN WIDTH
Page 1 of 10 Technical Reports Proposed Update Unicode Standard Annex #11 EAST ASIAN WIDTH Version Authors Summary This annex presents the specifications of an informative property for Unicode characters
More informationSC2/WG2 N2753A - Action items
SC2/WG2 N2753A - Action items 16 Action items (Post M-45, Pre-M46) All action items recorded in the minutes of the previous meetings from M25 to M42 have been either completed or dropped. of outstanding
More informationExtract of Action items, section 16 from document N minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; /29)
ISO/IEC JTC 1/SC 2/WG 2 N3103-A 2006-08-25 Extract of Action items, section 16 from document N3103 - minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; 2006-09-25/29) All action items recorded
More informationRequest for encoding GRANTHA LENGTH MARK
Request for encoding 11355 GRANTHA LENGTH MARK Shriramana Sharma jamadagni-at-gmail-dot-com 2009-Oct-25 This is a request for encoding a character in the Grantha block. While I have only recently submitted
More informationUNICODE SCRIPT NAMES PROPERTY
1 of 10 1/29/2008 10:29 AM Technical Reports Proposed Update to Unicode Standard Annex #24 UNICODE SCRIPT NAMES PROPERTY Version Unicode 5.1.0 draft2 Authors Mark Davis (mark.davis@google.com), Ken Whistler
More informationUNICODE IDENTIFIER AND PATTERN SYNTAX
1 of 21 1/29/2008 10:32 AM Technical Reports Proposed Update to Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 5.1 (draft 6) Authors Mark Davis (mark.davis@google.com)
More informationRecommendations: We recommend the UTC approve this character, after discussion.
TO: UTC L2/15 204 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #144 July 2015 on Script Proposals DATE: 25 July 2015 The recommendations
More informationThis document is to be used together with N2285 and N2281.
ISO/IEC JTC1/SC2/WG2 N2291 2000-09-25 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по
More informationConsent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R.
L2/98-389R Consent docket re WG2 Resolutions at its Meeting #35 as amended For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R. RESOLUTION M35.4 (PDAM-24 on Thaana): Unanimous to prepare
More informationOpenType Font by Harsha Wijayawardhana UCSC
OpenType Font by Harsha Wijayawardhana UCSC Introduction The OpenType font format is an extension of the TrueType font format, adding support for PostScript font data. The OpenType font format was developed
More informationGONDI and GUNJALA GONDI CHARACTER NAMES Vowels EE and OO. Comment on GONDI (L2/15-005) and GUNJALA GONDI (L2/ ) proposals
GONDI and GUNJALA GONDI CHARACTER NAMES Vowels EE and OO Comment on GONDI (L2/15-005) and GUNJALA GONDI (L2/15-086 ) proposals Naga Ganesan (naa.ganesan@gmail.com) Abstract: This document requests naming
More informationRendering in Dzongkha
Rendering in Dzongkha Pema Geyleg Department of Information Technology pema.geyleg@gmail.com Abstract The basic layout engine for Dzongkha script was created with the help of Mr. Karunakar. Here the layout
More informationTO: UTC L2/15 XXX FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, Anshuman Pandey,
TO: UTC L2/15 XXX FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, Anshuman Pandey, and Andrew Glass SUBJECT: Recommendations to UTC #143 May 2015 on Script Proposals DATE: 3 May
More informationThis PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.
This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however
More informationResolutions from the SC2/WG2 meeting in London, September 21-25, 1998 with comments from Ken Whistler, September 29, 1998
L2/98-312 Resolutions from the SC2/WG2 meeting in London, September 21-25, 1998 with comments from Ken Whistler, September 29, 1998 Resolution M35.1 (FPDAM-18 on Symbols and Other characters including
More informationJoiners (ZWJ/ZWNJ) with Semantic content for words in Indian subcontinent languages
Joiners (ZWJ/ZWNJ) with Semantic content for words in Indian subcontinent languages N. Ganesan This document gives examples of Unicode joiners, ZWJ and ZWNJ where the meanings of words differ substantially
More information108_GILLAM.index.fm Page 817 Monday, August 19, :35 PM. Index
108_GILLAM.index.fm Page 817 Monday, August 19, 2002 3:35 PM Index A AAT (Apple Advanced Typography), 675 baseline adjustment, 681 caret positioning, 681 682 glyphs compound, 680 selection/placement, 678
More informationThe Unicode Standard Version 10.0 Core Specification
The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers
More informationNRSI: Computers & Writing Systems
NRSI: Computers & Writing Systems SIL HOME CONTACT US Search You are here: Encoding > Unicode Search Home Contact us General Initiative B@bel WSI Guidelines Encoding Principles Unicode Tutorials PUA Character
More informationTitle: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS
ISO/IEC JTC1/SC2 N3427 ISO/IEC JTC1/SC2/WG2 N2214 Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2000-03-28
More informationFileMaker 15 Specific Features
FileMaker 15 Specific Features FileMaker Pro and FileMaker Pro Advanced Specific Features for the Middle East and India FileMaker Pro 15 and FileMaker Pro 15 Advanced is an enhanced version of the #1-selling
More informationISO/TC46/SC4/WG1 N 240, ISO/TC46/SC4/WG1 N
L2/00-220 Title: Finalized Mapping between Characters of ISO 5426 and ISO/IEC 10646-1 (UCS) Source: The Research Libraries Group, Inc. Status: L2 Member Contribution References: ISO/TC46/SC4/WG1 N 240,
More informationBuilding Apps Last updated: 12 June 2017
Building Apps Last updated: 12 June 2017 Contents 1. Preparing content for your app... 3 1.1. Preparing your lexicon file... 3 1.2. Preparing images... 3 1.3. Preparing audio... 3 2. How to build your
More information