Because of these dispositions, Ireland changed its vote to Yes, leaving only one Negative vote (Japan)

Similar documents
ISO/IEC JTC1/SC2/WG2 N 4346R

ISO/IEC JTC1/SC2/WG2 N 3476

ISO/IEC JTC1/SC2/WG2 N 2490

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

ISO/IEC JTC 1/SC 2 N 3332/WG2 N 2057

Consent docket re WG2 Resolutions at its Meeting #35 as amended. For the complete text of Resolutions of WG2 Meeting #35, see L2/98-306R.

This document is to be used together with N2285 and N2281.

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS

Title: Graphic representation of the Roadmap to the BMP of the UCS

Template for comments and secretariat observations Date: Document: ISO/IEC 10646:2014 PDAM2

ISO/IEC JTC 1/SC 2 N 3363/WG2 N2124

ISO/IEC JTC 1/SC 2 N 3354

ISO/IEC JTC/1 SC/2 WG/2 N2095

L2/ ISO/IEC JTC1/SC2/WG2 N4671

ISO/IEC JTC 1/SC 2 N 3426

****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete***

Introduction. Acknowledgements

N4115 an alternative encoding for geometric shapes

ä + ñ ISO/IEC JTC1/SC2/WG2 N

The Unicode Standard Version 6.1 Core Specification

Introduction. Requests. Background. New Arabic block. The missing characters

ISO/IEC JTC 1/SC 32 N 2432

ISO/IEC JTC1/SC2/WG2 N4599 L2/

Form number: N2352-F (Original ; Revised , , , , , , ) N2352-F Page 1 of 7

ISO/IEC JTC 1/SC 2 N 3194

ISO/IEC JTC 1/SC 2 N 4004 DATE:

1. Introduction 2. TAMIL DIGIT ZERO JTC1/SC2/WG2 N Character proposed in this document About INFITT and INFITT WG

5 U+1F641 SLIGHTLY FROWNING FACE

ISO/IEC JTC 1/SC 32 N 2334

Below are several requests for N Ko additions. Feedback from the UTC is requested on items 2 6. (Item 1 is included as input for font creators.

P, O and L Members of ISO/IEC JTC 1/SC 2 ; ISO/IEC JTC 1 Secretariat; ISO/IEC ITTF Def

ISO/IEC/JTC 1/SC 2/WG 2/IRG Ideographic Rapporteur Group (IRG) IRG (Special) Meeting #44 Recommendations and Action Items

5 U+1F641 SLIGHTLY FROWNING FACE

ISO/IEC JTC 1/ SC 2 /WG2 Universal Multiple-Octet Coded Character Set (UCS) -ISO/IEC Secretariat: ANSI

Thu Jun :48:11 Canada/Eastern

L2/ Date:

Yes. Form number: N2652-F (Original ; Revised , , , , , , , )

Title: Graphic representation of the Roadmap to the SMP, Plane 1 of the UCS

JTC1/SC2/WG2 N

Two distinct code points: DECIMAL SEPARATOR and FULL STOP

ISO/IEC JTC/SC2/WG Universal Multiple Octet Coded Character Set (UCS)

Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set

ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 2/WG 2

ISO/IEC JTC 1/SC 32 N 2324

ISO/IEC JTC 1/SC 32 N 2482

UC Irvine Unicode Project

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

Doc Type: Working Group Document Title: Recommendations for encoding Xiàngqí game symbols Source: Michael Everson (

POSIX-Ada BINDING RAPPORTEUR GROUP PROCEDURES

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

Fig. 2 of E.161 Fig. 3 of E.161 Fig. 4 of E.161

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 2/WG 2/IRG

Reference ISO/IEC FDIS (Ed 2) Committee ISO/IEC JTC 1/SC 35 Edition number 2

The Unicode Standard Version 12.0 Core Specification

Proposal to Encode Some Outstanding Early Cyrillic Characters in Unicode

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS. Yes (or) More information will be provided later:

Proposal to encode MALAYALAM LETTER VEDIC ANUSVARA

A B Ɓ C D Ɗ Dz E F G H I J K Ƙ L M N Ɲ Ŋ O P R S T Ts Ts' U W X Y Z

ISO/IEC JTC 1/SC 2 N WG2 N2436 DATE:

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 2/WG 2/IRG

Ꞑ A790 LATIN CAPITAL LETTER A WITH SPIRITUS LENIS ꞑ A791 LATIN SMALL LETTER A WITH SPIRITUS LENIS

Comments on the Proposals to Encode Tamil Symbols and Fractions by ICTA Sri Lanka. The document

Proposal to encode three Arabic characters for Arwi

Kannada 2. L2/ Representation of Jihvamuliya and Upadhmaniya in Kannada Srinidhi

1. Introduction 2. TAMIL LETTER SHA Character proposed in this document About INFITT and INFITT WG

ISO/IEC JTC 1/SC 2 N 3891 DATE:

Request for encoding 1CF4 VEDIC TONE CANDRA ABOVE

Page 2 of 6 We therefore propose to unify characters with different glyph forms in different source dictionaries where the glyph variants are not used

ISO/IEC JTC 1/SC 2/WG 2/N2789 L2/04-224

ISO/IEC JTC 1/SC 2 N 3713 DATE: WG2 N 2706

ISO/IEC JTC 1/SC 32 N 2602

164 1 Scope Figure 1 Ed Three instances of 'specifie' should be 'specifies'. Make the corrections. Accepted END

Proposal to encode the SANDHI MARK for Newa

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC 1/SC 2/WG 2/IRG

Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation

JTC1/SC2/WG

ISO/IEC INTERNATIONAL STANDARD

ISO/IEC JTC 1/SC 17 N 4103

ISO/IEC JTC 1/SC 2 Coded Character Sets Secretariat: Japan (JISC)

ISO/IEC JTC 1/SC 2 N 3771/WG2 N2876 DATE: Corrected:

Proposal to add U+2B95 Rightwards Black Arrow to Unicode Emoji

ISO/IEC JTC1/SC2/WG2 N4157 L2/12-121

German National Body comment on SC 2 N4052 Date: Document: WG2 N3592-Germany

ISO/IEC JTC 1/SC 2 N 3840 DATE:

Proposed Update Unicode Standard Annex #34

National Body Comments ISO/IEC PDTS C++ Extensions for Transactional Memory

Proposal to encode fourteen Pakistani Quranic marks

ISO/IEC JTC1/SC2/WG2 N4445

1. Introduction. 2. Encoding Considerations

Interdisciplinary Journal of Best Practices in Global Development Final Manuscript Preparation Guidelines

ISO/IEC JTC 1/SC 2 N 3514 WG 2 N2328 L2/01-114

Roadmap to the SMP. Michael Everson, Rick McGowan, Ken Whistler.

The solid pentagram, as found in the top row of the figure above, is also used in mysticism and occult contexts (see figures in section 6).

1.1 The digit forms propagated by the Dozenal Society of Great Britain

ISO/IEC JTC 1/SC 32 N 2406

Reported by Jim Moore, The MITRE Corporation, ,

ISO/IEC JTC 1/SC 32 N 2325

Information technology Keyboard layouts for text and office systems. Part 9: Multi-lingual, multiscript keyboard layouts

Transcription:

ISO/IEC JTC1/SC2/WG2 N 4871 Date: 2017-09-28 ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC) Doc. Type: Disposition of comments Title: Disposition of comments on PDAM2 to ISO/IEC 10646 5 th edition Source: Michel Suignard (project editor) Project: JTC1 02.10646.00.02.00.05 Status: For review by WG2 Date: 2017-09-28 Distribution: WG2 Reference: SC2 N4547 WG2 N4858 Medium: Paper, PDF file Comments were received from the following members: Ireland, Japan, and USA. The following document is the disposition of those comments. The disposition is organized per country. Note With some minor exceptions, the full content of the ballot comments has been included in this document to facilitate the reading. The dispositions are inserted in between these comments and are marked in Underlined Bold Serif text, with explanatory text in italicized serif. Because of these dispositions, Ireland changed its vote to Yes, leaving only one Negative vote (Japan) Page 1

Results of the dispositions in terms of code point allocation Character added: A7C0 LATIN CAPITAL LETTER THORN WITH DIAGONAL STROKE A7C1 LATIN SMALL LETTER THORN WITH DIAGONAL STROKE A7C2 LATIN CAPITAL LETTER ANGLICANA W A7C3 LATIN SMALL LETTER ANGLICANA W Characters removed: 1B132 HIRAGANA LETTER SMALL KO 1B155 KATAKANA LETTER SMALL KO Names changes 13432 EGYPTIAN HIEROGLYPH START AT TOP 13433 EGYPTIAN HIEROGLYPH START AT BOTTOM 13434 EGYPTIAN HIEROGLYPH END AT TOP 13435 EGYPTIAN HIEROGLYPH END AT BOTTOM 13437 EGYPTIAN HIEROGLYPH BEGIN SEGMENT 13438 EGYPTIAN HIEROGLYPH END SEGMENT 1FA08 NEUTRAL CHESS KNIGHT ROTATED FORTY-FIVE DEGREES Block name change Egyptian Hieroglyphs Format Controls to Egyptian Hieroglyph Format Controls Page 2

Ireland: Negative Ireland disapproves the draft with the editorial comments given below. Acceptance of these comments and appropriate changes to the text will change our vote to approval. Technical comments: T1. Page 8, Row A72: Latin Extended-D With regard to N4836 Proposal to add LATIN LETTER THORN WITH DIAGONAL STROKE to the UCS, Ireland requests the addition of LATIN CAPITAL LETTER THORN WITH DIAGONAL STROKE and LATIN SMALL LETTER THORN WITH DIAGONAL STROKE at A7C0 and A7C1 respectively. Ireland notes that other Latin letters, such as 0244 Ʉ LATIN CAPITAL LETTER U WITH STROKE and 0289 ʉ LATIN SMALL LETTER U BAR alongside A7B8 LATIN CAPITAL LETTER U WITH DIAGONAL STROKE and A7B9 LATIN SMALL LETTER U WITH DIAGONAL STROKE, are similarly distinguished for use in various languages. We consider that the arguments given in N4836 are sufficient to demonstrate significant differences both in the original manuscripts manuscript and in a good three centuries of printing traditions of the Anglicist and Nordicist communities, justifying the encoding of these characters. This was also discussed in the context of the agenda item related to N4836 in meeting WG2 #66.. T2. Page 8, Row A72: Latin Extended-D With regard to N4838 Proposal to add LATIN LETTER ANGLICANA W to the UCS, Ireland requests the addition of LATIN CAPITAL LETTER ANGLICANA W and LATIN SMALL LETTER ANGLICANA W at A7C2 and A7C3 respectively. Contrastive used in Middle Cornish and Middle English manuscripts has been demonstrated, and we are informed that a forthcoming series of editions of the Cornish corpus require these letters urgently. This was also discussed in the context of the agenda item related to N4838. T3. Page 17, Row 1343: Egyptian Hieroglyphs Format Controls Ireland believes that the character names chosen for some of these controls has not been harmonized with the character names of similar characters in the standard, such as those in the Musical Symbols, Anatolian Hieroglyphs, and Modifier Letters block. We observe that BEGIN and END are typically used, for instance, and that INSERT has not been used for any control character. START has been used only for the C0 controls (which are not formal character names). Omitting the Symbols for Control Characters, the UCS contains the following characters with START, BEGIN, and END in their names: 0001 START OF HEADING 0002 START OF TEXT 0003 END OF TEXT 0004 END OF TRANSMISSION 0017 END OF TRANSMISSION BLOCK Page 3 0019 END OF MEDIUM 0086 START OF SELECTED AREA 0087 END OF SELECTED AREA 0096 START OF GUARDED AREA 0097 END OF GUARDED AREA

0098 START OF STRING 02F9 MODIFIER LETTER BEGIN HIGH TONE 02FA MODIFIER LETTER END HIGH TONE 02FB MODIFIER LETTER BEGIN LOW TONE 02FC MODIFIER LETTER END LOW TONE 06DD ARABIC END OF AYAH 06DE ARABIC START OF RUB EL HIZB 0700 SYRIAC END OF PARAGRAPH 1A1F BUGINESE END OF SECTION 220E END OF PROOF 1D173 MUSICAL SYMBOL BEGIN BEAM 1D174 MUSICAL SYMBOL END BEAM 1D175 MUSICAL SYMBOL BEGIN TIE 1D176 MUSICAL SYMBOL END TIE 1D177 MUSICAL SYMBOL BEGIN SLUR 1D178 MUSICAL SYMBOL END SLUR 1D179 MUSICAL SYMBOL BEGIN PHRASE 1D17A MUSICAL SYMBOL END PHRASE 1D1A8 MUSICAL SYMBOL END OF STIMME Clearly some of these are just punctuation characters marking the ends of paragraphs and sections, but there are already in the UCS paired START/END and BEGIN/END characters. It appears that both kinds of functions are used for Egyptian. The insert function specifies the position of a character at the front or at the back of a cluster, while the segment delimiters function more like the Musical Symbol characters. We do not believe that the verb INSERT is needed in these names; START AT and END AT (location) are simpler and shorter, and yield simpler abbreviations; see our comment E2 below. Ireland recommends the following names be used: 13432 EGYPTIAN HIEROGLYPH START AT TOP 13433 EGYPTIAN HIEROGLYPH START AT BOTTOM 13434 EGYPTIAN HIEROGLYPH END AT TOP 13435 EGYPTIAN HIEROGLYPH END AT BOTTOM 13437 EGYPTIAN HIEROGLYPH BEGIN SEGMENT 13438 EGYPTIAN HIEROGLYPH END SEGMENT See also comment TE2 from USA and E2 from Ireland. T4. Page 17, Row 1343: Egyptian Hieroglyphs Format Controls Ireland prefers the name for this block to be Egyptian Hieroglyph Format Controls. The first plural is not necessary in the title, and is not used in the character names in the block. Page 4

Editorial comments: (All accepted glyph changes are based on receiving a font from Ireland with the appropriate glyphs.) E1. Page 5, Row 1C9: Georgian Extended Ireland requests a glyph change for two Georgian characters, 1CBA GEORGIAN MTAVRULI CAPITAL LETTER AIN used in this block, and 10FA GEORGIAN LETTER AIN in the Georgian block. The reason for this is that the glyph as shown in 1CBA can look identical to the glyph of 1C9A GEORGIAN MTAVRULI CAPITAL LETTER LAS in some font styles where the letter has one hump instead of three. This presents a security issue, because the letter LAS is extremely common, while AIN is used only in transcriptions of the Batsbi language. though a glyph change should solve the problem. We attach below some samples of the shapes AIN usually takes. On the left, example from Давит Кадагидзе & Нико Кадагидзе. 1984. Цова-тушинско-грузинско-русский словарь. Тбилиси: Мецниереба. On the right, example from Kote Chrelashvili. 2002. Tsova-Tush language. Tbilisi. Withdrawn E2. Page 17, Row 1343: Egyptian Hieroglyphs Format Controls. Ireland requests that the glyphs for these characters be revised. Control characters in the UCS typically have identifying letter abbreviations in their dotted-box glyphs. The exception to this is the Ideographic Description Characters, which are intended to be visibly displayed graphic characters, not invisible composition controls as the annotation to the code chart states. The glyphs of the Egyptian characters in this PDAM are hardly distinguishable from those of the Ideographic Description Characters, and we believe this could be confusing to users and implementers. We propose that the glyphs below be used. Page 5

Not accepted The new proposed glyphs do not seem to present an improvement over the current glyphs. The abbreviations do not bring benefit and the currently proposed glyphs are easy to map to the symbols used in N4818 to express the concepts. Based on these dispositions, Ireland changed its vote to Positive Page 6

Japan: Negative Technical comment: T.1. Page 18, Small Kana Extension 1B130-1B16F We understand it is reasonable to reserve the coding space for allocating small Kana characters corresponding to all Kana characters except the ones already encoded. However, 9 small Kana characters allocated in this PDAM are not acceptable. As shown in the contribution that Japan NB submitted [ISO/IEC JTC 1/SC 2 N4523], we found the evidence of usage for more small Kana characters. And furthermore more small Kana characters would be found through the further investigation. Therefore, Japan NB proposes to have the discussion at next WG2 meeting. By the way, it is strange that there is no document in SC2 and WG2 document registry which proposes to encode 9 small Kana characters.. Proposed change by Japan: Delete 9 small Kana characters on this PDAM and assign the set of small Kana characters based on the result of discussion at the next WG2 meeting. in principle There are two WG2 document supporting these 9 additions: N3986: Proposal to encode Hiragana and Katakana Small Letter KO and N4818 Feedback for the two proposals to encode additional small kana characters L2/16-334 and L2/16-354. The second document is a support document and as such does not contains all the references of the original Unicode UTC documents: L2/16-334 and L2/16-354. This could have been improved by adding these two UTC documents added to the WG2 registry. After discussion at WG2 meeting #66, it was decided to remove the two KO characters and to keep the remaining 7 characters. This will be further investigated for future additions. Editorial comment: ED.1. Clause 9.2 UTF-8 Table 2 (This is not for the part of PDAM2 text, but for the existing text of ISO/IEC 10646 standard.) Proposed change by Japan: 3rd row of Scalar value column should be changed from zzzzyyyyyyxxxxx to zzzzyyyyyyxxxxxx. One x is missing. Page 7

USA: Positive with comments Comments (GE for General, TE for Technical, ED for Editorial): GE.1. Georgian Extended We strongly support the inclusion of Georgian Extended. The characters are in modern use and the user community strongly supports them. None. Noted TE.1. Chess symbols (1FA00-1FA6F) There is a typo in name U+1FA08 NEUTRAL CHESS KNIGHT ROTATED FORTYFIVE DEGREES. Change to: NEUTRAL CHESS KNIGHT ROTATED FORTY-FIVE DEGREES. TE.2. Egyptian Hieroglyphs Format Controls (13430-1343F) To be able to render Egyptian quadrat structures fully, we request the addition of U+13436 EGYPTIAN HIEROGLYPH OVERLAY MIDDLE from WG2 N4818 (=UTC L2/17-112.). Add the character U+13436 EGYPTIAN HIEROGLYPH OVERLAY MIDDLE to the Amendment See also comments T3 and E2 from Ireland. ED.1. Enclosed Alphanumeric Supplement (1F100-1F1FF) The character U+1F16C RAISED MR SIGN can also appear in a circled glyph; add a note to indicate this. Add a note to U+1F16C RAISED MR SIGN "Can appear either as isolated letters or circled letters". Not accepted This proposed annotation is not supported by evidence. ED.2. Chess symbols (1FA00-1FA6F) There is a typo in the informative aliases for U+1FA22 WHITE CHESS TURNED KNIGHT, U+1FA28 BLACK CHESS TURNED KNIGHT, and U+1FA2E NEUTRAL CHESS TURNED KNIGHT; the aliases have knightrider instead of nightrider. Change knightrider to nightrider. ED.3. Vedic Extensions (1CD0-1CFF) Remove extraneous as in note to U+1CFA VEDIC SIGN DOUBLE ANUSVARA ANTARGOMUKHA ( used as as a base for a combining nasal sign ). Make the change as described. Page 8