Proposal to encode MALAYALAM SIGN PARA

Similar documents
Cibu Johny, 2014-Dec-26

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Proposal to encode MALAYALAM LETTER VEDIC ANUSVARA

Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR

Proposal to encode three Arabic characters for Arwi

Proposal to encode the END OF TEXT MARK for Malayalam

Revised proposal to encode NEPTUNE FORM TWO. Eduardo Marin Silva 02/08/2017

Form number: N2352-F (Original ; Revised , , , , , , ) N2352-F Page 1 of 7

Proposal to encode the SANDHI MARK for Newa

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS. Yes (or) More information will be provided later:

Proposal to Add Four SENĆOŦEN Latin Charaters

1. Character to be encoded. 2. Background

****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete***

Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set

L2/ Proposal to encode archaic vowel signs O OO for Kannada. 1. Thanks. 2. Introduction

5 U+1F641 SLIGHTLY FROWNING FACE

Proposal to encode fourteen Pakistani Quranic marks

Introduction. Acknowledgements

Proposal to Encode an Abbreviation Sign for Bengali. Srinidhi A and Sridatta A. Tumakuru, Karnataka, India

Proposal to encode the Prishthamatra for Nandinagari

Andrew Glass and Shriramana Sharma. anglass-at-microsoft-dot-com jamadagni-at-gmail-dot-com November-2

A B Ɓ C D Ɗ Dz E F G H I J K Ƙ L M N Ɲ Ŋ O P R S T Ts Ts' U W X Y Z

1. Introduction 2. TAMIL LETTER SHA Character proposed in this document About INFITT and INFITT WG

ISO/IEC JTC1/SC2/WG2. Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC Secretariat: ANSI

5 U+1F641 SLIGHTLY FROWNING FACE

John H. Jenkins If available now, identify source(s) for the font (include address, , ftp-site, etc.) and indicate the tools used:

ISO/IEC JTC 1/SC 2/WG 2 Proposal summary form N2652-F accompanies this document.

ä + ñ ISO/IEC JTC1/SC2/WG2 N

ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS)

1. Introduction 2. TAMIL DIGIT ZERO JTC1/SC2/WG2 N Character proposed in this document About INFITT and INFITT WG

Request for encoding 1CF3 ROTATED ARDHAVISARGA

Proposal to encode Devanagari Sign High Spacing Dot

JTC1/SC2/WG

Proposal to Encode An Outstanding Early Cyrillic Character in Unicode

Proposal to Encode Oriya Fraction Signs in ISO/IEC 10646

1. Introduction.

ISO/IEC JTC1/SC2/WG2 N4445

Yes. Form number: N2652-F (Original ; Revised , , , , , , , )

Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS

Proposal to encode the TELUGU SIGN SIDDHAM

Request for encoding GRANTHA LENGTH MARK

JTC1/SC2/WG2 N3514. Proposal to encode a SOCCER BALL symbol Karl Pentzlin, U+26xx SOCCER BALL

ISO/IEC JTC1/SC2/WG2 N4157 L2/12-121

ISO/IEC JTC1/SC2/WG2 N2641

Doc Type: Working Group Document Title: Recommendations for encoding Xiàngqí game symbols Source: Michael Everson (

Two distinct code points: DECIMAL SEPARATOR and FULL STOP

transcribing Urdu or Arabic words. Accordingly, the KHHA and GHHA should be considered atomic, as Tibetan TSA, TSHA, and DZA are.

Proposal to encode. 1. Discussion

See also:

ISO/IEC JTC1/SC2/WG2 N4599 L2/

L2/ ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

1. Introduction. 2. Proposed Characters Block: Latin Extended-E Historic letters for Sakha (Yakut)

ISO/IEC JTC 1/SC 2/WG 2/N2789 L2/04-224

1 ISO/IEC JTC1/SC2/WG2 N

Proposal to encode Kannada Sign Spacing Candrabindu

0CE2 Ä KANNADA VOWEL SIGN VOCALIC L 0CE3 Å KANNADA VOWEL SIGN VOCALIC LL 0CE4 Ç KANNADA DANDA 0CE5 É KANNADA DOUBLE DANDA

Ꞑ A790 LATIN CAPITAL LETTER A WITH SPIRITUS LENIS ꞑ A791 LATIN SMALL LETTER A WITH SPIRITUS LENIS

Request for encoding 1CF4 VEDIC TONE CANDRA ABOVE

B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Yes.

ISO/IEC JTC1/SC2/WG2 N3734 L2/09-144R

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

YES (or) More information will be provided later:

Introduction. Requests. Background. New Arabic block. The missing characters

PHOENICIAN NUMBER ONE. More recent research into Imperial Aramaic (N3273R2, L2/07-197R2, 2007-

ISO International Organization for Standardization Organisation Internationale de Normalisation

Title: A proposal to encode the Akarmatrik music notation symbols in UCS. Author: Chandan Misra. Submission Date:

JTC1/SC2/WG2 N

ISO/IEC JTC/SC2/WG Universal Multiple Octet Coded Character Set (UCS)

L2/ ISO/IEC JTC1/SC2/WG2 N4671

5c. Are the character shapes attached in a legible form suitable for review?

Yes 11) 1 Form number: N2652-F (Original ; Revised , , , , , , , 2003-

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

Proposal to encode the TAKRI VOWEL SIGN VOCALIC R

Proposal to Encode Some Outstanding Early Cyrillic Characters in Unicode

Although the International Phonetic Alphabet deprecated the letters. Unicode Character Properties. Character properties are proposed here.

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC A.

The solid pentagram, as found in the top row of the figure above, is also used in mysticism and occult contexts (see figures in section 6).

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC

Structure Vowel signs are used in a manner similar to that employed by other Brahmi-derived scripts. Consonants have an inherent /a/ vowel sound.

JTC1/SC2/WG2 N3048. Form number: N2352-F (Original ; Revised , , , , , , ) Page 1 of 5

ꟸ A7F8 LATIN SUBSCRIPT SMALL LETTER S ꟹ A7F9 LATIN SUBSCRIPT SMALL LETTER T ꟺ A7FA LATIN LETTER SMALL CAPITAL TURNED M

ISO/IEC JTC1 SC2 WG2 N3997

097B Ä DEVANAGARI LETTER GGA 097C Å DEVANAGARI LETTER JJA 097E Ç DEVANAGARI LETTER DDDA 097F É DEVANAGARI LETTER BBA

ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

ISO/IEC JTC1/SC2/WG2 N4078

1. Introduction. 2. Encoding Considerations

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

1. Character to be encoded. 2. Background

JTC1/SC2/WG2 N4945R

1.1 The digit forms propagated by the Dozenal Society of Great Britain

ISO/IEC JTC/1 SC/2 WG/2 N2095

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

Form number: N2652-F (Original ; Revised , , , , , , , )

UC Irvine Unicode Project

ISO/IEC JTC1/SC2/WG2 N2817

ARABIC LETTER BEH WITH SMALL ARABIC LETTER MEEM ABOVE MBEH (BEH WITH MEEM ABOVE) Represents mba sound

ISO/IEC JTC1/SC2/WG2 N4210

2. Requester's name: Urdu and Regional Language Software Development Forum, Ministry of Science and Technology, Government of Pakistan

L2/18 143R2. Re: ESC Recommendations for 2018Q2 UTC From: ESC Date:

ISO/IEC/JTC 1/SC2/WG2 N 4066

Transcription:

Proposal to encode MALAYALAM SIGN PARA Introduction Cibu Johny, cibu@google.com 2014-Jan-16 Historically Paṟa has been an important measurement unit in Kerala, for measuring rice grain. The word also described the area of a paddy field by the amount of grain in Paṟa it produces. So this unit is often used in historical agricultural land transactions. Image 1: The vessel used to measure one Paṟa of grain is also called a Paṟa. Shape of the sign looks like this: This sign is used to indicate a Para is typically preceded or followed by a number. Its usage period seems to be in the later 19th century from its attestations below. Paṟa sign needs to be encoded to record the documents from its above usage period. Confusability In theory, this letter could be confused with the traditional rendering of <NNNA(ഩ), VIRAMA, RA(ര)>. However, the symbol for NNNA was largely a theoretical one proposed by the 1

1 grammar Kērala Pāṇinīyaṃ by Rājarāja Varma. So this ligature is never attested in historical documents and fonts are unlikely to have it. Anyway, to avoid security issues, this character should be excluded from IDNs since it is not needed for contemporary Malayalam use. Attestations 2 Elements of South-Indian Palæography (1871) by A.C.Burnell This book reproduces some pages of Zamorin s Granthavari (book of counterparts of leases etc.). Image 2: Paṟa signs used before the Malayalam digits in this running text. It reads as: <para>5000 <para>2577. Malayalam selections: with translations, grammatical analyses and 3 vocabulary (1851) by A.J.Arbuthnot This book reproduces many pages of communication between collector of Malabar and various regional administrators in Kerala. In the book, each communication is translated and then and describes in English. Image 3: Paṟa signs used after the digits. It reads as: 33<para>. Image 4: Arbuthnot describes the Paṟa sign. Please note that there is variation in the glyph used in the Malayalam only text, from that in the English description the tail reaches the top and and makes a loop. Arbuthnot did not bother to describe this variation and we can assume that this is a glyph variation until contrary evidences emerge. 1 See N3295 in L2/07-283: http://www.unicode.org/l2/l2007/07283-n3295-malayalam.pdf 2 http://books.google.com/books?id=phmiaaaaqaaj&pg=pa141 page 141. 3 http://books.google.com/books?id=vgm-aaaacaaj&pg=pt134 page 118 and 119. 2

Choice of codepoint The proposed codepoint is 0D65 which is the closest location available to fractions and digits. In Devanagari chart corresponding location of 0965 is assigned to Double Danda. Today Devanagari Dandas are used for Malayalam like in rest of the Indic. In the unlikely event of future attestation of a Malayalam specific Double Danda, that Danda would need to be placed elsewhere. Legend White: Published as of Unicode 6.3. Green: Accepted and at advanced stage not permitting changes. Yellow: Fractions, accepted. Orange: Chillu LLL accepted but preferably to be moved to group with Chillu Y. Blue: New location proposed for Chillu LLL. Purple: Locations of ongoing proposals of Anusvara Above, Chillus M and Y, and Vertical Bar and Circular Viramas. Red: Location proposed for Paṟa. 3

Proposed character with properties 0D65;MALAYALAM SIGN PARA;So;0;L;;;;;N;;;;; Official Proposal Summary Form (Based on N3902-F) A. Administrative 1. Title Proposal to encode MALAYALAM SIGN PARA 2. Requester s name Cibu Johny 3. Requester type (Member body/liaison/individual contribution) Individual contribution 4. Submission date 2014-Jan-16 5. Requester s reference (if applicable) 6. Choose one of the following: This is a complete proposal (or) More information will be provided later This is a complete proposal. B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters), Proposed name of script 1b. The proposal is for addition of character(s) to an existing block, Name of the existing block, Malayalam 2. Number of characters in proposal 1 (one) 3. Proposed category Category B1, specialized small (for this character) 4. Is a repertoire including character names provided? 4a. If YES, are the names in accordance with the character naming guidelines in Annex L of P&P document? 4b. Are the character shapes attached in a legible form suitable for review? 5. Fonts related: a. Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the standard? Michael Everson b. Identify the party granting a license for use of the font by the editors (include address, e-mail etc.) Michael Everson, everson@evertype.com 6a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? 6b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached? 7. Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? 4

8. Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. See detailed proposal. C. Technical Justification 1. Has this proposal for addition of character(s) been submitted before? If YES, explain. 2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? 2b. If YES, with whom? Sunil V.S., Kevin Siji and Biju C.P. in a public Google+ post. The author is himself one of the user community. 2c. If YES, available relevant documents ne specifically. The matter was discussed via email and social media like Google+. 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? Those (scholars and enthusiasts) who desire to store old Malayalam documents using this character. 4a. The context of use for the proposed characters (type of use; common or rare) Rare 4b. Reference See detailed proposal. 5a. Are the proposed characters in current use by the user community?. This character was in historical use. 5b. If YES, where? 6a. After giving due considerations to the principles in the P&P document must the proposed characters be entirely in the BMP? 6b. If YES, is a rationale provided? It belongs in the Malayalam block which is in the BMP. 6c. If YES, reference 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? Only one character is proposed. 8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence? 8b. If YES, is a rationale for its inclusion provided? 8c. If YES, reference 9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters? 9b. If YES, is a rationale for its inclusion provided? 9c. If YES, reference 10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character? 10b. If YES, is a rationale for its inclusion provided? 10c. If YES, reference 11a. Does the proposal include use of combining characters and/or use of composite sequences? 11b. If YES, is a rationale for such use provided? 11c. If YES, reference 11d. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? 5

12a. Does the proposal contain characters with any special properties such as control function or similar semantics?. 12b. If YES, describe in detail (include attachment if necessary) 13a. Does the proposal contain any Ideographic compatibility character(s)? 13b. If YES, is the equivalent corresponding unified ideographic character(s) identified? 13c. If YES, reference: 6