(URW) ++ UNICODE APERÇU 1. Nimbus Sans Block Name. Regular. Bold. Light Vers Regular. Regular. Bold. Medium. Vers Vers Vers. 4.

Size: px
Start display at page:

Download "(URW) ++ UNICODE APERÇU 1. Nimbus Sans Block Name. Regular. Bold. Light Vers Regular. Regular. Bold. Medium. Vers Vers Vers. 4."

Transcription

1 UNICODE APERÇU 1 Unicode Code points (Plane, Plane 2) 93+9 HKSCS Alternates Latin Extended-A 5 U+2FF U+52F U+4FF U+F U+5 U+5FF U+7 U+74F U+6FF U+77F U+7 U+7BF U+ U+97F U+7FF U+9FF U+A7F U+A U+AFF U+B U+BFF U+B7F U+C7F U+C U+CFF U+D U+DFF U+E U+17F U+ U+D U+3FF U+C 3 U+ U+B U+36F U+A 474 U+ U+9 U+2AF U+7C U+2 U+7 U+24F U U+1 U+5 Basic Latin U+FF U+ U+7F U+ U+2B Taiwan Alternates U+1 Unassigned Glyphs Japanese Alternates U+ U+D7F U+E7F Latin-1 Supplement Latin Extended-B IPA Extensions Spacing Modifier Letters 28 Combining Diacritical Marks 2 Cyrillic Greek and Coptic Cyrillic Supplement Arabic Arabic Supplement 59 Syriac Thaana NKo 77 Devanagari 7 Gurmukhi 79 Bengali Gujarati Oriya Tamil Telugu Kannada Malayalam Sinhala Thai 4 Hebrew 15 Armenian

2 UNICODE APERÇU 2 U+EFF Lao U+1 U+19F Myanmar U+F U+FFF U+1A U+1FF U+ U+1F U+ U+ U+FF U+F U+A U+FF U+ U+9F U+1 U+7F U+A U+FF U+1 U+173F U+ U+17 U+176 U+17 U+171F U+75F 65 U+177F U+17FF U+1 U+18AF U+19 U+197F U+1 U+1F U+19 U+19DF U+1A U+1A1F U+1B U+1BBF U+1C U+1C7F U+19E U+1B U+1C U+1D U+19FF U+1B7F U+1C4F U+1D7F U+1D U+1DBF U+1E U+1EFF U+1DC U+1F U+2 U+27 U+1DFF U+1FFF U+26F U+29F Limbu Balinese 1 Lepcha 74 Phonetic Extensions Cobining Diacritical Marks Supplement 42 Phonetic Extensions Supplement 64 Latin Extended Additional 247 General Punctuation Greek Extended Superscripts And Subscripts 26 4 Ol Chiki Khmer Sundanese Buginese 9 Runic Khmer Symbols 6 New Tai Lue 4 Unified Canadian Aboriginal Syllabics Tai Le 85 Mongolian 653 Cherokee Tagbanwa 356 Buhid Ethiopic Hanunoo Tapalog Georgian Ogham 25 Ethiopic Supplement Tibetan Hangul Jamo U+E

3 UNICODE APERÇU U+2A U+2CF Currency Symbols 25 4 U+21 U+2F Letterlike Symbols 8 U+2D U+21 U+2FF U+218F U+21 U+21FF U+2 U+23FF U+ U+2 U+24 U+FF U+F U+245F U+ U+24FF U+25 U+259F U+2 U+2F U+25A U+25FF U+ U+BF U+26 U+26FF U+C U+EF U+ U+FF U+F U+2 U+29 U+FF U+297F U+29FF U+2A U+2AFF U+2C U+2C5F U+2B U+2C6 U+2BFF U+2C7F U+2C U+2CFF U+2D U+2D7F U+2D U+2D2F U+2D U+2DDF U+2E U+2E7F U+2DE U+2E U+2F U+2FF U+2DFF U+2EFF U+2FDF U+2FFF Combining Diacritical Marks For Symbols Number Forms Arrows Miscellaneous Technical Mathematical Operators Control Pictures Optical Charakter Recognition 39 Enclosed Alphanumerics Block Elements Box Drawing Geometric Shapes Miscellaneous Mathematical Symbols-A 44 Supplemental Arrows-A Braille Patterns Miscellaneous Mathematical Symbols-B Supplemental Arrows-B Supplemental Mathematical Operators Miscellaneous Symbols and Arrows Glagolitic Latin Extended-C 6 Cyrillic Extended-A Supplemental Punctuation CJ K Radicals Supplement Kangxi Radicals Ideographic Description Charakters Ethiopic Extended Tifinagh Coptic Georgian Supplement Miscellaneous Symbols Dingbats

4 UNICODE APERÇU U+ U+3F CJ K Symbols And Punctuation U+A U+FF Katakana U+31 U+318F U+ U+31 U+31 U+9F U+3F Bopomofo U+319F Kanbun U+31A U+31BF U+31F U+31FF U+31C U+ U+3 U+31EF U+FF U+33FF U+3 U+4DBF U+4E U+9FFF U+4DC U+A U+4DFF CJ K Strokes Katakana Phonetic Extensions CJ K Unified Ideographs Extension A CJ K Unified Ideographs CJ K Compatibility Yijing Hexagram Symbols U+AF Syloti Nagri 44 Yi Radicals Cyrillic Extended-B Latin Extended-D 4 Phags-pa 56 U+A U+A8DF Saurashtra U+A9 U+AF Rejang U+AA5F Enclosed CJ K Letters And Month U+A92F U+AF Modifier Tone Letters U+A8 U+A71F U+A7FF U+A U+AA Vai U+A69F U+A U+A63F U+A6 U+A Bopomofo Extended Yi Syllables U+A4CF U+A7 Hangul Compatibility Jamo 93 U+AF U+A4 U+A Hiragana Kayah Li Cham U+AC U+D7AF Hangul Syllables U+E U+F8FF Private Use Area (plane ) U+D U+DFFF U+F U+FAFF U+FB U+FDFF U+FB U+FE Non-Plane (see OpenType spec.) CJ K Compatibility Ideographs U+FB4F Alphabetic Presentation Forms U+FEF Variation Selectors Arabic Presentation Forms-A

5 UNICODE APERÇU U+FE1 U+FE1F Vertical Forms 1 U+FE U+FE4F CJ K Compatibility 21 U+FE7 U+FEFF Arabic Presentation Forms-B U+FFF U+FFFF 5 1 U+FE2 U+FE U+FF U+1 U+FE2F U+FE6F U+FFEF U+17F U+1 U+1FF U+11 U+118F U+11 U+1F U+11 U+11CF U+1 U+129F U+11D U+11FF U+12A U+12DF U+13 U+134F U+1 U+13 U+1F U+139F U+13A U+13DF U+14 U+147F U+1 U+144F U+1 U+14AF U+1 U+191F U+1 U+192 U+1F U+193F U+1A U+1A5F U+ U+47F U+ U+1D U+3FF U+1DFF Cobining Half Marks Small Form Variants Halfwidth And Fullwidth Forms Specials Linear B Syllabary 75 Ancient Symbols Phaistos Disc Lycian Carian Old Italic Gothic Ugaritic Old Persian Deseret Shavian Osmanya Cypriot Syllabary Phoenician Lydian Kharoshthi Cuneiform 9 Byzantine Musical Symbols Cuneiform Numbers and Punctuation U+1D U+1D35F Tai Xuan Jing Symbols U+1DF Ancient Greek Numbers Aegean Numbers Musical Symbols U+1D U+1D1FF U+1D24F 26 Linear B Ideograms U+1D1 U+1D2 7 Ancient Greek Musical Notation Counting Rod Numerals

6 UNICODE APERÇU 6 U+1D U+1D7FF U+1F U+1F9F Mathematical Alphanumeric Symbols 9 Domino Tiles 1 U+1F2F Mahjong Tiles U+2 U+2A6DF CJ K Unified Ideographs Extension B U+E U+E7F U+1F U+2F U+E1 U+F U+1 U+2FA1F U+E1EF U+FFFFD U+1FFFD CJ K Compatibility Ideographs Supplemt. Tags Variation Selctors Supplement Private Use (plane 15) Private Use (plane )

JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS

JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS JAVA.LANG.CHARACTER.UNICODEBLOCK CLASS http://www.tutorialspoint.com/java/lang/java_lang_character.unicodehtm Copyright tutorialspoint.com Introduction The java.lang.character.unicodeblock class is a family

More information

Thu Jun :48:11 Canada/Eastern

Thu Jun :48:11 Canada/Eastern Roadmaps to Unicode Thu Jun 24 2004 17:48:11 Canada/Eastern Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

The Unicode Standard. Version 3.0. The Unicode Consortium ADDISON-WESLEY. An Imprint of Addison Wesley Longman, Inc.

The Unicode Standard. Version 3.0. The Unicode Consortium ADDISON-WESLEY. An Imprint of Addison Wesley Longman, Inc. The Unicode Standard Version 3.0 The Unicode Consortium ADDISON-WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts Harlow, England Menlo Park, California Berkeley, California Don

More information

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS ISO/IEC JTC1/SC2/WG2 N2316 Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2001-01-09 Action: For confirmation

More information

Title: Graphic representation of the Roadmap to the BMP of the UCS

Title: Graphic representation of the Roadmap to the BMP of the UCS ISO/IEC JTC1/SC2/WG2 N2045 Title: Graphic representation of the Roadmap to the BMP of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 1999-08-15 Action: For confirmation by ISO/IEC

More information

ISO/IEC JTC 1/SC 2 N 3426

ISO/IEC JTC 1/SC 2 N 3426 ISO/IEC JTC 1/SC 2 N 3426 Date: 2000-04-04 Supersedes SC 2 N 2830 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: TITLE: Other document Graphic representation of the Roadmap

More information

To the BMP and beyond!

To the BMP and beyond! To the BMP and beyond! Eric Muller Adobe Systems Adobe Systems - To the BMP and beyond! July 20, 2006 - Slide 1 Content 1. Why Unicode 2. Character model 3. Principles of the Abstract Character Set 4.

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

3494 Date: Supersedes SC 2 N 3426

3494 Date: Supersedes SC 2 N 3426 ISO/IEC JTC 1/SC 2 N 3494 3494 Date: 2000-10-06 Supersedes SC 2 N 3426 ISO/IEC JTC 1/SC 2 CODED CHARACTER SETS SECRETARIAT: JAPAN (JISC) DOC TYPE: Other document TITLE: ISO/IEC 10646 Roadmap [WG 2 N2313,

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

UNICODE IDENTIFIER AND PATTERN SYNTAX

UNICODE IDENTIFIER AND PATTERN SYNTAX 1 of 21 1/29/2008 10:32 AM Technical Reports Proposed Update to Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 5.1 (draft 6) Authors Mark Davis (mark.davis@google.com)

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video Multimedia Data Multimedia Data Text Vector Graphics 3-D Vector Graphics Raster Graphics Digital Image Voxel Audio Digital Video 1 Text There are three types of text that are used to produce pages of documents

More information

Language Processing with Perl and Prolog

Language Processing with Perl and Prolog Language Processing with Perl and Prolog Pierre Nugues Lund University Pierre.Nugues@cs.lth.se http://cs.lth.se/pierre_nugues/ Pierre Nugues Language Processing with Perl and Prolog 1 / 29 Character Sets

More information

ISO/IEC INTERNATIONAL STANDARD

ISO/IEC INTERNATIONAL STANDARD INTERNATIONAL STANDARD Provläsningsexemplar / Preview ISO/IEC 10646 First edition 2003-12-15 AMENDMENT 3 2008-02-15 Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 3:

More information

Domain Names in Pakistani Languages. IDNs for Pakistani Languages

Domain Names in Pakistani Languages. IDNs for Pakistani Languages ا ہ 6 5 a ز @ ں ب Domain Names in Pakistani Languages س a ی س a ب او اور را < ہ ر @ س a آف ا ر ا 6 ب 1 Domain name Domain name is the address of the web page pg on which the content is located 2 Internationalized

More information

The Unicode Standard Version 6.2 Core Specification

The Unicode Standard Version 6.2 Core Specification The Unicode Standard Version 6.2 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Unicode and Standardized Notation. Anthony Aristar

Unicode and Standardized Notation. Anthony Aristar Data Management and Archiving University of California at Santa Barbara, June 24-27, 2008 Unicode and Standardized Notation Anthony Aristar Once upon a time There were people who decided to invent computers.

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Basis Technology Unicode 対応ライブラリスペックシート

Basis Technology Unicode 対応ライブラリスペックシート Adobe-Standard-Encoding Adobe-Symbol-Encoding cshppsmath Adobe-Zapf-Dingbats-Encoding cszapfdingbats Arabic ISO-8859-6, csisolatinarabic, iso-ir-127, ECMA-114, ASMO-708 ASCII US-ASCII, ANSI_X3.4-1968,

More information

Aspects of Computer Architecture

Aspects of Computer Architecture T V Atkinson, Ph D Senior Academic Specialist Department of Chemistry Michigan State University East Lansing, MI 48824 Table of Contents List of Tables...3 List of Figures...3. Introduction...6.. Why should

More information

Unicode: What is it and how do I use it?

Unicode: What is it and how do I use it? Abstract: The rationale for Unicode and its design goals and detailed design principles are presented. The correspondence between Unicode and ISO/IEC 10646 is discussed, the scripts included or planned

More information

Information, Characters, Unicode

Information, Characters, Unicode Information, Characters, Unicode Information Characters In modern computing, natural-language text is very important information. ( Number-crunching is less important.) Characters of text are represented

More information

The Unicode Standard Version 7.0 Core Specification

The Unicode Standard Version 7.0 Core Specification The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

This document is to be used together with N2285 and N2281.

This document is to be used together with N2285 and N2281. ISO/IEC JTC1/SC2/WG2 N2291 2000-09-25 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

NRSI: Computers & Writing Systems

NRSI: Computers & Writing Systems NRSI: Computers & Writing Systems SIL HOME CONTACT US Search You are here: Encoding > Unicode Search Home Contact us General Initiative B@bel WSI Guidelines Encoding Principles Unicode Tutorials PUA Character

More information

RomanCyrillic Std v. 7

RomanCyrillic Std v. 7 https://doi.org/10.20378/irbo-52591 RomanCyrillic Std v. 7 Online Documentation incl. support for Unicode v. 9, 10, and 11 (2016 2018) UNi code A З PDF! Ѿ Sebastian Kempgen 2018 RomanCyrillic Std: new

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

[MS-ISO10646]: Microsoft Universal Multiple-Octet Coded Character Set (UCS) Standards Support Document

[MS-ISO10646]: Microsoft Universal Multiple-Octet Coded Character Set (UCS) Standards Support Document [MS-ISO10646]: Microsoft Universal Multiple-Octet Coded Character Set (UCS) Standards Support Document Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation.

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters

Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters Information technology Universal Multiple-Octet Coded Character Set (UCS) AMENDMENT 2: N Ko, Phags-pa, Phoenician and other characters Page 1, Clause 1 Scope In the note, update the Unicode Standard version

More information

Michael Everson, Rick McGowan, Ken Whistler

Michael Everson, Rick McGowan, Ken Whistler Roadmaps to Unicode Mon Dec 13 2004 12:29:02 Europe/Dublin Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

Michael Everson, Rick McGowan, Ken Whistler

Michael Everson, Rick McGowan, Ken Whistler Roadmaps to Unicode Fri May 27 2005 23:35:41 Europe/Dublin Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet A. Sean Pue South Asia Language Resource Center Pre-SASLI Workshop 6/7/09 1 Objectives To understand how

More information

ISO/IEC JTC 1/SC 2/WG 2 N2953A DATE: Extract of Section 14 - Action Items from N2953

ISO/IEC JTC 1/SC 2/WG 2 N2953A DATE: Extract of Section 14 - Action Items from N2953 ISO/IEC JTC 1/SC 2/WG 2 N2953A DATE: 2006-02-16 Extract of Section 14 - Action Items from N2953 14 Action Items All action items recorded in the minutes of the previous meetings from M25 to M42, M44 and

More information

Roadmap to the SMP. Michael Everson, Rick McGowan, Ken Whistler.

Roadmap to the SMP. Michael Everson, Rick McGowan, Ken Whistler. SMP Home Site Map Search Tables Roadmap Introduction BMP (Plane 0) SMP (Plane 1) SIP (Plane 2) SSP (Plane 14) Not the Roadmap More Information The Unicode Standard, Version 3.0 Proposed characters Submitting

More information

Integration Panel: Maximal Starting Repertoire MSR-2 Overview and Rationale

Integration Panel: Maximal Starting Repertoire MSR-2 Overview and Rationale Integration Panel: Maximal Starting Repertoire MSR-2 REVISION December 4, 2014 Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-2) 3 2.1 Files 3 2.2 Determining the Contents of the MSR

More information

Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS

Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS ISO/IEC JTC1/SC2 N3427 ISO/IEC JTC1/SC2/WG2 N2214 Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2000-03-28

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Character Properties 4

Character Properties 4 Chapter 4 Character Properties 4 Disclaimer The content of all character property tables has been verified as far as possible by the Unicode Consortium. However, the Unicode Consortium does not guarantee

More information

108_GILLAM.index.fm Page 817 Monday, August 19, :35 PM. Index

108_GILLAM.index.fm Page 817 Monday, August 19, :35 PM. Index 108_GILLAM.index.fm Page 817 Monday, August 19, 2002 3:35 PM Index A AAT (Apple Advanced Typography), 675 baseline adjustment, 681 caret positioning, 681 682 glyphs compound, 680 selection/placement, 678

More information

Corso di Biblioteche Digitali

Corso di Biblioteche Digitali Corso di Biblioteche Digitali Vittore Casarosa casarosa@isti.cnr.it tel. 050-621 3115 cell. 348-397 2168 Skype vittore1201 Ricevimento dopo la lezione o per appuntamento Valutazione finale 70% esame orale

More information

EDAN20 Language Technology Chapter 3: Encoding and Annotation Schemes

EDAN20 Language Technology   Chapter 3: Encoding and Annotation Schemes EDAN20 http://cs.lth.se/edan20/ Pierre Nugues Lund University Pierre.Nugues@cs.lth.se http://cs.lth.se/pierre_nugues/ August 31, 2017 Pierre Nugues EDAN20 http://cs.lth.se/edan20/ August 31, 2017 1/34

More information

UNICODE IDENTIFIER AND PATTERN SYNTAX

UNICODE IDENTIFIER AND PATTERN SYNTAX Technical Reports Proposed Update Unicode Standard Annex #31 UNICODE IDENTIFIER AND PATTERN SYNTAX Version Unicode 11.0.0 (draft 1) Editors Mark Davis (markdavis@google.com) Date 2018-04-10 This Version

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Integration Panel: Maximal Starting Repertoire MSR-3 Overview and Rationale

Integration Panel: Maximal Starting Repertoire MSR-3 Overview and Rationale Integration Panel: Maximal Starting Repertoire MSR-3 REVISION March 28, 2018 Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-3) 3 2.1 Files 3 2.1.1 Overview 3 2.1.2 Normative Definition

More information

Integration Panel: Maximal Starting Repertoire MSR-4 Overview and Rationale

Integration Panel: Maximal Starting Repertoire MSR-4 Overview and Rationale Integration Panel: Maximal Starting Repertoire MSR-4 REVISION November 09, 2018 Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-4) 3 2.1 Files 3 2.1.1 Overview 3 2.1.2 Normative Definition

More information

Proposed Update Unicode Standard Annex #14

Proposed Update Unicode Standard Annex #14 Technical Reports Proposed Update Unicode Standard Annex #14 Version Unicode 8.0.0 Editors Date 2014-09-03 This Version Previous Version Latest Version Latest Proposed Update Revision 34 Summary Andy Heninger

More information

UNICODE LINE BREAKING ALGORITHM

UNICODE LINE BREAKING ALGORITHM Page 1 of 53 Technical Reports Summary Proposed Update Unicode Standard Annex #14 UNICODE LINE BREAKING ALGORITHM Version Unicode 6.2.0 (draft 2) Editors Date 2012-06-01 This Version This annex presents

More information

Leaks in the Unicode pipeline: script, script, script

Leaks in the Unicode pipeline: script, script, script Michael Everson, Everson Typography, www.evertype.com Some 52 scripts are currently allocated in the Unicode Standard. This reflects an enormous amount of work on the part of a great many people. An examination

More information

SC2/WG2 N2753A - Action items

SC2/WG2 N2753A - Action items SC2/WG2 N2753A - Action items 16 Action items (Post M-45, Pre-M46) All action items recorded in the minutes of the previous meetings from M25 to M42 have been either completed or dropped. of outstanding

More information

Building Apps Last updated: 12 June 2017

Building Apps Last updated: 12 June 2017 Building Apps Last updated: 12 June 2017 Contents 1. Preparing content for your app... 3 1.1. Preparing your lexicon file... 3 1.2. Preparing images... 3 1.3. Preparing audio... 3 2. How to build your

More information

L2/ Re: Proposal for v10.1 of UTS #39 From: Mark Davis Date: Draft: link

L2/ Re: Proposal for v10.1 of UTS #39 From: Mark Davis Date: Draft: link Re: Proposal for v10.1 of UTS #39 From: Mark Davis Date: 2017-05-10 Draft: link L2/17-166 It has become clear that we need to enhance some of the data and text in UTS #39, especially in light of recent

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Proposed Overhaul of kzvariant Data in the Unihan Database

Proposed Overhaul of kzvariant Data in the Unihan Database Proposed Overhaul of kzvariant Data in the Unihan Database John H. Jenkins 26 October 2015 The kzvariant data in the Unihan database is known to be of uneven quality. I recommend we resolve this problem

More information

Extract of Action items, section 16 from document N minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; /29)

Extract of Action items, section 16 from document N minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; /29) ISO/IEC JTC 1/SC 2/WG 2 N3103-A 2006-08-25 Extract of Action items, section 16 from document N3103 - minutes from meeting M48 (for review at meeting M49, Tokyo, Japan; 2006-09-25/29) All action items recorded

More information

COSC 243 (Computer Architecture)

COSC 243 (Computer Architecture) COSC 243 Computer Architecture And Operating Systems 1 Dr. Andrew Trotman Instructors Office: 123A, Owheo Phone: 479-7842 Email: andrew@cs.otago.ac.nz Dr. Zhiyi Huang (course coordinator) Office: 126,

More information

UNICODE SUPPORT FOR MATHEMATICS

UNICODE SUPPORT FOR MATHEMATICS Technical Reports UTC-Review: Unicode Technical Report #25 UNICODE SUPPORT FOR MATHEMATICS Version 1.0 Authors Date This Version Previous Version Latest Version Barbara Beeton (bnb@ams.org), Asmus Freytag

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

ISO/IEC JTC1/SC2/WG2 N3244

ISO/IEC JTC1/SC2/WG2 N3244 Page 1 of 6 ISO/IEC JTC1/SC2/WG2 N3244 Title Review of CJK-C Repertoire Source UK National Body Document Type National Body Contribution Date 2007-04-14, revised 2007-04-20 The UK national body has carried

More information

ISO/IEC INTERNATIONAL STANDARD

ISO/IEC INTERNATIONAL STANDARD INTERNATIONAL STANDARD ISO/IEC 14651 Second edition 2007-12-01 AMENDMENT 1 2008-10-24 Information technology International string ordering and comparison Method for comparing character strings and description

More information

UNIEDIT USER S GUIDE DUKE UNIVERSITY MULTILINGUAL TEXT EDITOR HUMANITIES COMPUTING FACILITY

UNIEDIT USER S GUIDE DUKE UNIVERSITY MULTILINGUAL TEXT EDITOR HUMANITIES COMPUTING FACILITY UNIEDIT MULTILINGUAL TEXT EDITOR USER S GUIDE HUMANITIES COMPUTING FACILITY DUKE UNIVERSITY Copyright Information COPYRIGHT 1998 BY THE HUMANITIES COMPUTING FACILITY, DUKE UNIVERSITY. ALL RIGHTS RESERVED.

More information

Resolutions from the SC2/WG2 meeting in London, September 21-25, 1998 with comments from Ken Whistler, September 29, 1998

Resolutions from the SC2/WG2 meeting in London, September 21-25, 1998 with comments from Ken Whistler, September 29, 1998 L2/98-312 Resolutions from the SC2/WG2 meeting in London, September 21-25, 1998 with comments from Ken Whistler, September 29, 1998 Resolution M35.1 (FPDAM-18 on Symbols and Other characters including

More information

ISO/IEC TR Information technology. An operational model for characters and glyphs. Version: 20 July, 1998

ISO/IEC TR Information technology. An operational model for characters and glyphs. Version: 20 July, 1998 ISO/IEC TR 15285 Information technology An operational model for characters and glyphs Technologies de l information Modèle pour l utilisation de caractères graphiques et de glyphes Version: 20 July, 1998

More information

General Structure 2. Chapter Architectural Context

General Structure 2. Chapter Architectural Context This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

LINE BREAKING PROPERTIES

LINE BREAKING PROPERTIES Page 1 of 46 Technical Reports Proposed Update Unicode Standard Annex #14 LINE BREAKING PROPERTIES Version Authors Summary This annex presents the specification of line breaking properties for Unicode

More information

ISO/IEC JTC 1/SC 2 N 3891 DATE:

ISO/IEC JTC 1/SC 2 N 3891 DATE: ISO/IEC JTC 1/SC 2 N 3891 DATE: 2006-09-08 ISO/IEC JTC 1/SC 2 Coded Character Sets Secretariat: Japan (JISC) DOC. TYPE TITLE SOURCE PROJECT STATUS ACTION ID Summary of Voting/Table of Replies Summary of

More information

Consent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R.

Consent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R. L2/98-389R Consent docket re WG2 Resolutions at its Meeting #35 as amended For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R. RESOLUTION M35.4 (PDAM-24 on Thaana): Unanimous to prepare

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

The Unicode Standard Version 7.0 Core Specification

The Unicode Standard Version 7.0 Core Specification The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI

ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI ISO/IEC JTC 1/SC 2/WG 2 N3103 DATE: 2006-08-25 ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC 10646 Secretariat: ANSI DOC TYPE: Meeting Minutes TITLE: Unconfirmed

More information

Dictionary App Builder: Building Apps

Dictionary App Builder: Building Apps Building Apps Dictionary App Builder: Building Apps 2018, SIL International Last updated: 13 March 2018 You are free to print this manual for personal use and for training workshops. The latest version

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Unicode definition list

Unicode definition list abstract character D3 3.3 2 abstract character sequence D4 3.3 2 accent mark alphabet alphabetic property 4.10 2 alphabetic sorting annotation ANSI Arabic digit 1 Arabic-Indic digit 3.12 1 ASCII assigned

More information

TUTORIAL: INTERNET LANGUAGES, CHARACTER SETS AND ENCODINGS

TUTORIAL: INTERNET LANGUAGES, CHARACTER SETS AND ENCODINGS TUTORIAL: INTERNET LANGUAGES, CHARACTER SETS AND ENCODINGS by Michael K. Bergman BrightPlanet Corporation March 23, 2006 Broad-scale, international open source harvesting from the Internet poses many challenges

More information

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters INTERNATIONAL STANDARD ISO 15919 First edition 2001-10-01 Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters Information et documentation Translittération

More information

Overview of Unicode and Indian Scripts

Overview of Unicode and Indian Scripts CHAPTER: 2 Overview of Unicode and Indian Scripts Introduction History and Development of Human Languages History and Development of Scripts Character Representation in Computers Brief History of Character

More information

2011 Martin v. Löwis. Data-centric XML. Character Sets

2011 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

2007 Martin v. Löwis. Data-centric XML. Character Sets

2007 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

UNICODE IDNA COMPATIBLE PREPROCESSSING

UNICODE IDNA COMPATIBLE PREPROCESSSING 1 of 12 1/23/2009 2:51 PM Technical Reports Proposed Draft Unicode Technical Standard #46 UNICODE IDNA COMPATIBLE PREPROCESSSING Version 1 (draft 1) Authors Mark Davis (markdavis@google.com), Michel Suignard

More information

Comments on the Proposals to Encode Tamil Symbols and Fractions by ICTA Sri Lanka. The document

Comments on the Proposals to Encode Tamil Symbols and Fractions by ICTA Sri Lanka. The document TO: UTC L2/14 170 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #140 August 2014 on Script Proposals DATE: 28 July 2014 The

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Code Charts 17. Chapter Character Names List. Disclaimer

Code Charts 17. Chapter Character Names List. Disclaimer This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

The Unicode Standard Version 7.0 Core Specification

The Unicode Standard Version 7.0 Core Specification The Unicode Standard Version 7.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

SC22/WG20 N532R December 14, 1997

SC22/WG20 N532R December 14, 1997 Disposition of comments against DTR 10176 SC22/WG20 N532R December 14, 1997 Technical Comments: (1) Annex A: (Denmark, Japan, Netherlands, U.S.A.) - Add notes: (a) The character repertoire listed in this

More information

German National Body comment on SC 2 N4052 Date: Document: WG2 N3592-Germany

German National Body comment on SC 2 N4052 Date: Document: WG2 N3592-Germany German National Body on SC N405 Date: 009-03-11 Document: WG N359-Germany 1 (3) 4 5 (6) (7) DE te (1) Kana on each submitted Germany recommends the addition the character U+1B000 KATAKANA LETTER ARCHAIC

More information

ISO/TC46/SC4/WG1 N 240, ISO/TC46/SC4/WG1 N

ISO/TC46/SC4/WG1 N 240, ISO/TC46/SC4/WG1 N L2/00-220 Title: Finalized Mapping between Characters of ISO 5426 and ISO/IEC 10646-1 (UCS) Source: The Research Libraries Group, Inc. Status: L2 Member Contribution References: ISO/TC46/SC4/WG1 N 240,

More information

Glossary. The Unicode Standard

Glossary. The Unicode Standard G Abstract Character. A unit of information used for the organization, control, or representation of textual data. (See Definition D3 in Section 3.3, Characters and Coded Representations.) Accent Mark.

More information

Recommendations: We recommend the UTC approve this character, after discussion.

Recommendations: We recommend the UTC approve this character, after discussion. TO: UTC L2/15 204 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #144 July 2015 on Script Proposals DATE: 25 July 2015 The recommendations

More information

The Unicode Standard Version 6.0 Core Specification

The Unicode Standard Version 6.0 Core Specification The Unicode Standard Version 6.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

ISO/IEC JTC 1/SC 2/WG 2. Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI

ISO/IEC JTC 1/SC 2/WG 2. Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI ISO/IEC JTC 1/SC 2/WG 2 N2550 Date: 2002-12-12 ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC 10646 Secretariat: ANSI Title: Source: SC2/WG2 partial document register

More information

Package utf8latex. December 26, 2016

Package utf8latex. December 26, 2016 Type Package Package utf8latex December 26, 2016 Title Importing, Exporting and Converting Between Datasets and LaTeX Version 1.0.4 Encoding UTF-8 Author c(person(given = ``Jose'', family = ``Gama'', role

More information

ISO/IEC JTC1/SC2/WG2 N 2490

ISO/IEC JTC1/SC2/WG2 N 2490 ISO/IEC JTC1/SC2/WG2 N 2490 Date: 2002-05-21 ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC) Doc. Type: Disposition of comments Title: Proposed Disposition of comments on SC2 N 3585

More information