Structure Vowel signs are used in a manner similar to that employed by other Brahmi-derived scripts. Consonants have an inherent /a/ vowel sound.

Similar documents
B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Yes.

097B Ä DEVANAGARI LETTER GGA 097C Å DEVANAGARI LETTER JJA 097E Ç DEVANAGARI LETTER DDDA 097F É DEVANAGARI LETTER BBA

ä + ñ ISO/IEC JTC1/SC2/WG2 N

ISO/IEC JTC1/SC2/WG2 N2641

Doc Type: Working Group Document Title: Recommendations for encoding Xiàngqí game symbols Source: Michael Everson (

ISO International Organization for Standardization Organisation Internationale de Normalisation

0CE2 Ä KANNADA VOWEL SIGN VOCALIC L 0CE3 Å KANNADA VOWEL SIGN VOCALIC LL 0CE4 Ç KANNADA DANDA 0CE5 É KANNADA DOUBLE DANDA

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Introduction. Acknowledgements

5c. Are the character shapes attached in a legible form suitable for review?

Form number: N2352-F (Original ; Revised , , , , , , ) N2352-F Page 1 of 7

****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete***

transcribing Urdu or Arabic words. Accordingly, the KHHA and GHHA should be considered atomic, as Tibetan TSA, TSHA, and DZA are.

5 U+1F641 SLIGHTLY FROWNING FACE

Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR

5 U+1F641 SLIGHTLY FROWNING FACE

also represented by combnining vowel matras with ē, and ō: ayɯ, eyi, ayi;

1. Introduction.

ISO/IEC JTC1/SC2/WG2 N4445

JTC1/SC2/WG

Proposal to encode three Arabic characters for Arwi

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS. Yes (or) More information will be provided later:

Proposal to Add Four SENĆOŦEN Latin Charaters

JTC1/SC2/WG2 N

PHOENICIAN NUMBER ONE. More recent research into Imperial Aramaic (N3273R2, L2/07-197R2, 2007-

ARABIC LETTER BEH WITH SMALL ARABIC LETTER MEEM ABOVE MBEH (BEH WITH MEEM ABOVE) Represents mba sound

Ꞑ A790 LATIN CAPITAL LETTER A WITH SPIRITUS LENIS ꞑ A791 LATIN SMALL LETTER A WITH SPIRITUS LENIS

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

ISO/IEC JTC1/SC2/WG2. Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI

Proposal to Encode Oriya Fraction Signs in ISO/IEC 10646

Proposal to encode the END OF TEXT MARK for Malayalam

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

Proposal to encode the SANDHI MARK for Newa

1. Introduction 2. TAMIL DIGIT ZERO JTC1/SC2/WG2 N Character proposed in this document About INFITT and INFITT WG

Yes. Form number: N2652-F (Original ; Revised , , , , , , , )

Revised proposal to encode NEPTUNE FORM TWO. Eduardo Marin Silva 02/08/2017

Although the International Phonetic Alphabet deprecated the letters. Unicode Character Properties. Character properties are proposed here.

John H. Jenkins If available now, identify source(s) for the font (include address, , ftp-site, etc.) and indicate the tools used:

A B Ɓ C D Ɗ Dz E F G H I J K Ƙ L M N Ɲ Ŋ O P R S T Ts Ts' U W X Y Z

1. Introduction. 2. Proposed Characters Block: Latin Extended-E Historic letters for Sakha (Yakut)

1. Introduction. 2. Encoding Considerations

1 ISO/IEC JTC1/SC2/WG2 N

Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set

Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS

JTC1/SC2/WG2 N4945R

Proposal to encode Devanagari Sign High Spacing Dot

1. Introduction 2. TAMIL LETTER SHA Character proposed in this document About INFITT and INFITT WG

ISO/IEC JTC/SC2/WG Universal Multiple Octet Coded Character Set (UCS)

Proposal to encode the Prishthamatra for Nandinagari

ꟸ A7F8 LATIN SUBSCRIPT SMALL LETTER S ꟹ A7F9 LATIN SUBSCRIPT SMALL LETTER T ꟺ A7FA LATIN LETTER SMALL CAPITAL TURNED M

JTC1/SC2/WG2 N3514. Proposal to encode a SOCCER BALL symbol Karl Pentzlin, U+26xx SOCCER BALL

Proposal to Encode An Outstanding Early Cyrillic Character in Unicode

JTC1/SC2/WG2 N3048. Form number: N2352-F (Original ; Revised , , , , , , ) Page 1 of 5

Üù àõ [tai 2 l 6] (in older orthography Üù àõ»). Tai Le orthography is simple and straightforward:

Proposal to encode MALAYALAM LETTER VEDIC ANUSVARA

ISO/IEC JTC1/SC2/WG2 N4078

ISO/IEC JTC 1/SC 2/WG 2/N2789 L2/04-224

ISO/IEC JTC 1/SC 2/WG 2 Proposal summary form N2652-F accompanies this document.

ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

ISO/IEC JTC1/SC2/WG2 N2817

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Fig. 2 of E.161 Fig. 3 of E.161 Fig. 4 of E.161

Proposal to encode fourteen Pakistani Quranic marks

Yes 11) 1 Form number: N2652-F (Original ; Revised , , , , , , , 2003-

If this proposal is adopted, the following three characters would exist: with the following properties (including a change from Ll to Lo for U+0294):

UC Irvine Unicode Project

L2/ ISO/IEC JTC1/SC2/WG2 N4671

ISO/IEC JTC1/SC2/WG2 N3734 L2/09-144R

The solid pentagram, as found in the top row of the figure above, is also used in mysticism and occult contexts (see figures in section 6).

L2/ ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Proposal to Encode an Abbreviation Sign for Bengali. Srinidhi A and Sridatta A. Tumakuru, Karnataka, India

Proposal to encode the TELUGU SIGN SIDDHAM

ISO/IEC JTC1/SC2/WG2 N4210

1.1 The digit forms propagated by the Dozenal Society of Great Britain

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

Andrew Glass and Shriramana Sharma. anglass-at-microsoft-dot-com jamadagni-at-gmail-dot-com November-2

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Introduction. Requests. Background. New Arabic block. The missing characters

ISO/IEC JTC1/SC2/WG2 N4157 L2/12-121

Proposal to encode the TAKRI VOWEL SIGN VOCALIC R

Cibu Johny, 2014-Dec-26

See also:

Request for encoding GRANTHA LENGTH MARK

Two distinct code points: DECIMAL SEPARATOR and FULL STOP

This proposal is limited to the addition and rearrangement of some of the Korean character part of ISO/IEC (UCS2).

L2/ Proposal to encode archaic vowel signs O OO for Kannada. 1. Thanks. 2. Introduction

ISO/IEC/JTC 1/SC2/WG2 N 4066

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC A.

YES (or) More information will be provided later:

Proposal to encode Kannada Sign Spacing Candrabindu

Form number: N2652-F (Original ; Revised , , , , , , , )

ISO/IEC JTC/1 SC/2 WG/2 N2095

2. Requester's name: Urdu and Regional Language Software Development Forum, Ministry of Science and Technology, Government of Pakistan

Proposal to Encode Some Outstanding Early Cyrillic Characters in Unicode

Proposal to encode. 1. Discussion

ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS)

Proposal to encode MALAYALAM SIGN PARA

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

Title: A proposal to encode the Akarmatrik music notation symbols in UCS. Author: Chandan Misra. Submission Date:

ISO/IEC JTC1 SC2 WG2 N3997

Transcription:

ISO/IEC JTC1/SC2/WG2 N3023 L2/06-003 2006-01-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Preliminary proposal for encoding the Rejang script in the BMP of the UCS Source: Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2006-01-11 This is a preliminary proposal to encode the Rejang script in the BMP of the UCS. Rejang is spoken by a million people living in Indonesia on the island of Sumatra in the southwest highlands, north Bengkulu Province, around Argamakmur, Muaraaman, Curuo, and Kapahiang. Most of its users live in fairly remote rural areas, of whom slightly less than half of them are literate. The traditional Rejang corpus consists chiefly of ritual texts, medical incantations, and poetry. Structure Vowel signs are used in a manner similar to that employed by other Brahmi-derived scripts. Consonants have an inherent /a/ vowel sound. Ordering An ordering based on the traditional order of the Brahmic script, though with some differences. The Javanese ha na ca ra ka ordering is not used. A. Administrative 1. Title Proposal for encoding the Rejang script in the UCS. 2. Requester s name 3. Requester type (Member body/liaison/individual contribution) Individual contribution. 4. Submission date 2006-01-11 5. Requester s reference (if applicable) 6. Choose one of the following: 6a. This is a complete proposal 6b. More information will be provided later B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Proposed name of script Rejang. 1b. The proposal is for addition of character(s) to an existing block 1

1c. Name of the existing block 2. Number of characters in proposal 37 3. Proposed category (see section II, Character Categories) Category B.1. 4a. Proposed Level of Implementation (1, 2 or 3) (see clause 14, ISO/IEC 10646-1: 2000) Level 2 4b. Is a rationale provided for the choice? 4c. If YES, reference Rejang requires Level 2 implementation as other Brahmic scripts do. 5a. Is a repertoire including character names provided? 5b. If YES, are the names in accordance with the character naming guidelines in Annex L of ISO/IEC 10646-1: 2000? 5c. Are the character shapes attached in a legible form suitable for review? 6a. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard?. 6b. If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools used:, Fontographer. 7a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? 7b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached? 8. Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? 9. Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties are: Casing information, Numeric information, Currency information, Display behaviour information such as line breaks, widths etc., Combining behaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts, Compatibility equivalence and other Unicode normalization related information. See the Unicode standard at http://www.unicode.org for such information on other scripts. Also see Unicode Character Database http://www.unicode.org/public/unidata/ UnicodeCharacterDatabase.html and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard. To be done. C. Technical Justification 1. Has this proposal for addition of character(s) been submitted before? If YES, explain. 2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? 2b. If YES, with whom? 2c. If YES, available relevant documents 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? Rejang is used on the island of Bali in Indonesia. 4a. The context of use for the proposed characters (type of use; common or rare) Used to write the Rejang language. 4b. Reference 5a. Are the proposed characters in current use by the user community? 5b. If YES, where? In Bali. 6a. After giving due considerations to the principles in Principles and Procedures document (a WG 2 standing document) must the proposed characters be entirely in the BMP? 2

Positions 1B00-1B7F are proposed. 6b. If YES, is a rationale provided? 6c. If YES, reference Contemporary use and accordance with the Roadmap. 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? 8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence? 8b. If YES, is a rationale for its inclusion provided? 8c. If YES, reference 9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters? The nuktated extensions for Arabic can all be composed of base characters and the proposed BALINESE SIGN NUKTA. 9b. If YES, is a rationale for its inclusion provided? This is a preliminary proposal. One question which would need to be dealt with is the interaction of the nukta with the other vowel matras. Another is the repertoire: Rejang seems to have six nuktated consonants for Arabic, but Javanese appears to have a repertoire of eight. 9c. If YES, reference 10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character? 10b. If YES, is a rationale for its inclusion provided? 10c. If YES, reference 11a. Does the proposal include use of combining characters and/or use of composite sequences (see clauses 4.12 and 4.14 in ISO/IEC 10646-1: 2000)? 11b. If YES, is a rationale for such use provided? 11c. If YES, reference Brahmic vowels. 12a. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? 12b. If YES, reference 13a. Does the proposal contain characters with any special properties such as control function or similar semantics? 13b. If YES, describe in detail (include attachment if necessary) 14a. Does the proposal contain any Ideographic compatibility character(s)? 14b. If YES, is the equivalent corresponding unified ideographic character(s) identified? 3

Proposal for encoding the Rejang script in the UCS TABLE XX - Row A9: REJANG A03 A94 A95 0 1 2 3 4 5 6 7 8 9 A B C D E F Ä ê ˇ Å ë ˇ Ç í ˇ É ì ˇ Ñ î Ö ï Ü ñ á ˇó à ˇò â ˇô ä ˇö ã ˇõ å ˇú ç ˇù é ˇû è ˇü G = 00 P = 00 4

Proposal for encoding the Rejang script in the UCS TABLE XX - Row A9: REJANG hex Name hex Name 30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F 40 41 42 43 44 45 46 47 48 49 4A 4B 4C 4D 4E 4F 50 51 52 53 54 55 56 57 58 59 5A 5B 5C 5D 5E 5F REJANG LETTER KA REJANG LETTER GA REJANG LETTER NGA REJANG LETTER TA REJANG LETTER DA REJANG LETTER NA REJANG LETTER PA REJANG LETTER BA REJANG LETTER MA REJANG LETTER CA REJANG LETTER JA REJANG LETTER NYA REJANG LETTER SA REJANG LETTER RA REJANG LETTER LA REJANG LETTER YA REJANG LETTER WA REJANG LETTER HA REJANG LETTER MBA REJANG LETTER NGGA REJANG LETTER NDA REJANG LETTER NYJA REJANG LETTER A REJANG KATULANG (ng) REJANG DUO DEATAS (n) REJANG KAJUNJUNG (r) REJANG VISARGA REJANG VOWEL SIGN KALUAN (i) REJANG VOWEL SIGN KAMITAN (u) REJANG VOWEL SIGN KAMICA (e) REJANG VOWEL SIGN KATILING (ai) REJANG VOWEL SIGN O REJANG VOWEL SIGN KATULUNG (au) REJANG VOWEL SIGN EU REJANG VOWEL SIGN KAJINA (ea) REJANG VIRAMA REJANG SECTION MARK Group 00 Plane 00 Row A9 5