Indic NLP Library Documentation

Size: px
Start display at page:

Download "Indic NLP Library Documentation"

Transcription

1 Indic NLP Library Documentation Release 0.2 Anoop Kunchukuttan Apr 27, 2018

2

3 Contents 1 indicnlp Package common Module langinfo Module loader Module Subpackages Indices and tables 13 Python Module Index 15 i

4 ii

5 Contents: Contents 1

6 2 Contents

7 CHAPTER 1 indicnlp Package 1.1 common Module exception indicnlp.common.indicnlpexception(msg) Bases: exceptions.exception Exceptions thrown by Indic NLP Library components are instances of this class. msg attribute contains exception details. indicnlp.common.get_resources_path() Get the path to the Indic NLP Resources directory indicnlp.common.init() Initialize the module. The following actions are performed: Checks of INDIC_RESOURCES_PATH variable is set. If not, checks if it can beb initialized from INDIC_RESOURCES_PATH environment variable. If that fails, an exception is raised indicnlp.common.set_resources_path(resources_path) Set the path to the Indic NLP Resources directory 1.2 langinfo Module indicnlp.langinfo.get_offset(c, lang) Applicable to Brahmi derived Indic scripts indicnlp.langinfo.in_coordinated_range(c_offset) Applicable to Brahmi derived Indic scripts indicnlp.langinfo.is_approximant(c, lang) Is the character an approximant consonant indicnlp.langinfo.is_aspirated(c, lang) Is the character a aspirated consonant 3

8 indicnlp.langinfo.is_aum(c, lang) Is the character a vowel sign (maatraa) indicnlp.langinfo.is_consonant(c, lang) Is the character a consonant indicnlp.langinfo.is_dental(c, lang) Is the character a dental indicnlp.langinfo.is_fricative(c, lang) Is the character a fricative consonant indicnlp.langinfo.is_halanta(c, lang) Is the character the halanta character indicnlp.langinfo.is_indiclang_char(c, lang) Applicable to Brahmi derived Indic scripts indicnlp.langinfo.is_labial(c, lang) Is the character a labial indicnlp.langinfo.is_nasal(c, lang) Is the character a nasal consonant indicnlp.langinfo.is_nukta(c, lang) Is the character the halanta character indicnlp.langinfo.is_number(c, lang) Is the character a number indicnlp.langinfo.is_palatal(c, lang) Is the character a palatal indicnlp.langinfo.is_retroflex(c, lang) Is the character a retroflex indicnlp.langinfo.is_unaspirated(c, lang) Is the character a unaspirated consonant indicnlp.langinfo.is_unvoiced(c, lang) Is the character a unvoiced consonant indicnlp.langinfo.is_velar(c, lang) Is the character a velar indicnlp.langinfo.is_voiced(c, lang) Is the character a voiced consonant indicnlp.langinfo.is_vowel(c, lang) Is the character a vowel indicnlp.langinfo.is_vowel_sign(c, lang) Is the character a vowel sign (maatraa) indicnlp.langinfo.offset_to_char(c, lang) Applicable to Brahmi derived Indic scripts 4 Chapter 1. indicnlp Package

9 1.3 loader Module 1.4 Subpackages morph Package unsupervised_morph Module normalize Package indic_normalize Module class indicnlp.normalize.indic_normalize.bengalinormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Bengali script. In addition to basic normalization by the super class, Replaces the composite characters containing nuktas by their decomposed form Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama Canonicalize two part dependent vowels replace pipe character by poorna virama character replace colon : by visarga if the colon follows a charcter in this script NUKTA = u'\u09bc' normalize(text) class indicnlp.normalize.indic_normalize.devanagarinormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Devanagari script. In addition to basic normalization by the super class, Replaces the composite characters containing nuktas by their decomposed form replace pipe character by poorna virama character replace colon : by visarga if the colon follows a charcter in this script NUKTA = u'\u093c' get_char_stats(text) normalize(text) class indicnlp.normalize.indic_normalize.gujaratinormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Gujarati script. In addition to basic normalization by the super class, Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama replace colon : by visarga if the colon follows a charcter in this script NUKTA = u'\u0abc' normalize(text) 1.3. loader Module 5

10 class indicnlp.normalize.indic_normalize.gurmukhinormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Gurmukhi script. In addition to basic normalization by the super class, Replaces the composite characters containing nuktas by their decomposed form Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama replace pipe character by poorna virama character replace colon : by visarga if the colon follows a charcter in this script NUKTA = u'\u0a3c' normalize(text) class indicnlp.normalize.indic_normalize.indicnormalizerfactory Bases: object Factory class to create language specific normalizers. get_normalizer(language, remove_nuktas=false) Call the get_normalizer function to get the language specific normalizer Paramters: language: language code remove_nuktas: boolean, should the normalizer remove nukta characters is_language_supported(language) Is the language supported? class indicnlp.normalize.indic_normalize.kannadanormalizer Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Kannada script. In addition to basic normalization by the super class, Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama canonicalize two-part dependent vowel signs replace colon : by visarga if the colon follows a charcter in this script normalize(text) class indicnlp.normalize.indic_normalize.malayalamnormalizer Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Malayalam script. In addition to basic normalization by the super class, Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama canonicalize two-part dependent vowel signs Change from old encoding of chillus (till Unicode 5.0) to new encoding replace colon : by visarga if the colon follows a charcter in this script normalize(text) class indicnlp.normalize.indic_normalize.normalizeri Bases: object The normalizer classes do the following: 6 Chapter 1. indicnlp Package

11 Some characters have multiple Unicode codepoints. The normalizer chooses a single standard representation Some control characters are deleted While typing using the Latin keyboard, certain typical mistakes occur which are corrected by the module Base class for normalizer. Performs some common normalization, which includes: Byte order mark, word joiner, etc. removal ZERO_WIDTH_NON_JOINER and ZERO_WIDTH_JOINER removal ZERO_WIDTH_SPACE and NO_BREAK_SPACE replaced by spaces Script specific normalizers should derive from this class and override the normalize() method. They can call the super class normalize() method to avail of the common normalization BYTE_ORDER_MARK = u'\ufeff' BYTE_ORDER_MARK_2 = u'\ufffe' NO_BREAK_SPACE = u'\xa0' SOFT_HYPHEN = u'\xad' WORD_JOINER = u'\u2060' ZERO_WIDTH_JOINER = u'\u200d' ZERO_WIDTH_NON_JOINER = u'\u200c' ZERO_WIDTH_SPACE = u'\u200b' correct_visarga(text, visarga_char, char_range) get_char_stats(text) normalize(text) Method to be implemented for normalization for each script class indicnlp.normalize.indic_normalize.oriyanormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Oriya script. In addition to basic normalization by the super class, Replaces the composite characters containing nuktas by their decomposed form Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama Canonicalize two part dependent vowels Replace va with ba replace pipe character by poorna virama character replace colon : by visarga if the colon follows a charcter in this script NUKTA = u'\u0b3c' normalize(text) class indicnlp.normalize.indic_normalize.tamilnormalizer Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Tamil script. In addition to basic normalization by the super class, 1.4. Subpackages 7

12 Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama canonicalize two-part dependent vowel signs replace colon : by visarga if the colon follows a charcter in this script normalize(text) class indicnlp.normalize.indic_normalize.telugunormalizer(remove_nuktas=false) Bases: indicnlp.normalize.indic_normalize.normalizeri Normalizer for the Teluguscript. In addition to basic normalization by the super class, Replace the reserved character for poorna virama (if used) with the recommended generic Indic scripts poorna virama canonicalize two-part dependent vowel signs replace colon : by visarga if the colon follows a charcter in this script get_char_stats(text) normalize(text) script Package indic_scripts Module tokenize Package indic_tokenize Module indicnlp.tokenize.indic_tokenize.trivial_tokenize(s, lang= hi ) Trivial tokenizer for languages in the Indian sub-continent indicnlp.tokenize.indic_tokenize.trivial_tokenize_indic(s) A trivial tokenizer which just tokenizes on the punctuation boundaries. This also includes punctuations for the Indian lang the purna virama and the deergha virama returns a list of tokens indicnlp.tokenize.indic_tokenize.trivial_tokenize_urdu(s) A trivial tokenizer which just tokenizes on the punctuation boundaries. This also includes punctuations for the Urdu script. These punctuations characters were identified from the Unicode database for Arabic script by looking for punctuation symbols. returns a list of tokens transliterate Package itrans_transliterator Module Transliterate texts between unicode and standard transliteration schemes. Transliterate texts between non-latin scripts and commonly-used latin transliteration schemes. Uses standard Unicode character blocks e.g. DEVANAGARI U U+097F and transliteration schemes e.g. the IAST convention for transliteration of Sanskrit to latin-with-dots. 8 Chapter 1. indicnlp Package

13 The following character blocks and transliteration schemes are included: DEVANAGARI IAST ITRANS (Sanskrit only) Harvard Kyoto CYRILLIC ISO 9:1995 (Russian only) New character blocks and transliteration schemes can be added by creating new CharacterBlock and TransliterationScheme objects. COMMAND LINE USAGE python transliterator.py text inputformat outputformat... writes the transliterated text to stdout text the text to be transliterated OR the name of a file containing the text inputformat the name of the character block or transliteration scheme that the text is to be transliterated FROM, e.g. CYRILLIC, IAST. Not case-sensitive outputformat the name of the character block or transliteration scheme that the text is to be transliterated TO, e.g. CYRILLIC, IAST. Not case-sensitive USAGE Transliterate a text: >>> import transliterator >>> transliterator.transliterate('yogazcittavrttinirodhah', 'harvardkyoto',... 'devanagari', {'outputasciiencoded' : True}) 'य गश च त तव & #x924; त न र ध ' Create a new CharacterBlock and TransliterationScheme: >>> import transliterator >>> cb = transliterator.characterblock('newblock', range(0x901, 0x9FF)) >>> scheme = transliterator.transliterationscheme(cb.name, 'NEWSCHEME',... {'ab': 0x901, 'cd': 0x902}) >>> transliterator.transliterate('abcd', scheme, cb, {'outputasciiencoded' : True}) ' ' COPYRIGHT AND DISCLAIMER Transliterator is: version 0.1 software - use at your own risk. The IAST, ITRANS and Harvard-Kyoto transliteration schemes have been tested for classical Sanskrit, not for any other language. The Cyrillic alphabet and ISO 9:1995 transliteration (for Russian only) are included but have been even more lightly tested than Devanagari. Copyright (c) 2005 by Alan Little By obtaining, using, and/or copying this software and/or its associated documentation, you agree that you have read, understood, and will comply with the following terms and conditions: 1.4. Subpackages 9

14 Permission to use, copy, modify, and distribute this software and its associated documentation for any purpose and without fee is hereby granted, provided that the above copyright notice appears in all copies, and that both that copyright notice and this permission notice appear in supporting documentation, and that the name of the author not be used in advertising or publicity pertaining to distribution of the software without specific, written prior permission. THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSO- EVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. class indicnlp.transliterate.itrans_transliterator.characterblock(name, charrange, char- Class=<class indicnlp.transliterate.itrans_transliterator Bases: dict Dictionary-like representation of a set of unicode characters. For our purposes, a character block corresponds to an alphabet/script that we want to be able to transliterate to or from, e.g. Cyrillic, Devanagari. Keys are unicode characters. Values are TLCharacter instances. class indicnlp.transliterate.itrans_transliterator.devanagaricharacter(unicodehexvalue, block) Bases: indicnlp.transliterate.itrans_transliterator.tlcharacter Special processing for Devanagari characters. class indicnlp.transliterate.itrans_transliterator.devanagaricharacterblock(name, char- Range) Bases: indicnlp.transliterate.itrans_transliterator.characterblock, indicnlp. transliterate.itrans_transliterator._devanagari Class representing the Devanagari Unicode character block. class indicnlp.transliterate.itrans_transliterator.devanagaritransliterationscheme(blockname scheme- Name, data, swaptable=none Bases: indicnlp.transliterate.itrans_transliterator.transliterationscheme, indicnlp.transliterate.itrans_transliterator._devanagari Class representing a Devanagari transliteration scheme. indicnlp.transliterate.itrans_transliterator.itrans = {'R^i': 2315, 'gh': 2328, 'ld': 23 ITrans uses some characters only in common ligatures. The easiest way to deal with these is to replace them with their normal consonant equivalents before we try to transliterate. (This assumes we are mainly transliterating itrans inbound, and that the normal consonants are acceptable outbound. ITrans is not a good choice for outbound anyway because it has so many ambiguities) 10 Chapter 1. indicnlp Package

15 class indicnlp.transliterate.itrans_transliterator.tlcharacter(unicodehexvalue, block) Bases: object Class representing a Unicode character with its equivalents. Public attributes: unicodehexvalue the numeric value of the Unicode code point. unichr the character value of the Unicode code point. name the name of the Unicode code point. equivalents a dict containing the character s equivalents in various transliteration schemes, in the format: { Scheme A : A, Scheme B : aah, } where keys are TransliterationScheme names, values are transliterated equivalents of the character. addequivalent(equivname, equivalent) Add an equivalent for the character. Arguments: equivname the name of a TransliterationScheme equivalent string/unicode equivalent in the named TransliterationScheme for this code point. unicodehexvalue = None Use name check to filter out unused characters. unicodedata.name() raises ValueError for these class indicnlp.transliterate.itrans_transliterator.transliterationscheme(blockname, scheme- Name, data, swaptable=none) Bases: dict Dictionary-like representation of a transliteration scheme. e.g. the Harvard-Kyoto, IAST or ITRANS schemes for transliterating Devanagari to or from the latin alphabet. Keys are unicode strings representing the letter-equivalents used in the transliteration scheme. TLCharacter instances. indicnlp.transliterate.itrans_transliterator.main(argv=none) Call transliterator from a command line. python transliterator.py text inputformat outputformat... writes the transliterated text to stdout Values are text the text to be transliterated OR the name of a file containing the text inputformat the name of the character block or transliteration scheme that the text is to be transliterated FROM, e.g. CYRILLIC, IAST. Not case-sensitive outputformat the name of the character block or transliteration scheme that the text is to be transliterated TO, e.g. CYRILLIC, IAST. Not case-sensitive indicnlp.transliterate.itrans_transliterator.resetoptions() Reset options to their default values. indicnlp.transliterate.itrans_transliterator.transliterate(text, informat, out- Format, requestoptions={}) Transliterate a text Subpackages 11

16 Keyword arguments: text a unicode string containing the text to be transliterated informat the from CharacterBlock or TransliterationScheme, or its name outformat the target CharacterBlock or TransliterationScheme, or its name requestoptions optional dict containing option settings that override the defaults for this request. Returns a unicode object containing the text transliterated into the target character set. Raises: ValueError unrecognised input or output format. KeyError a character in text is not a member of informat, or has no corresponding character defined in outformat. sinhala_transliterator Module class indicnlp.transliterate.sinhala_transliterator.sinhaladevanagaritransliterator Bases: object A Devanagari to Sinhala transliterator based on explicit Unicode Mapping static devanagari_to_sinhala(text) devnag_sinhala_map = {u'\u0901': u'\u0d82', u'\u0900': u'\u0d82', u'\u0903': u'\u0d8 sinhala_devnag_map = {u'\u0d83': u'\u0903', u'\u0d82': u'\u0902', u'\u0d85': u'\u090 static sinhala_to_devanagari(text) unicode_transliterate Module class indicnlp.transliterate.unicode_transliterate.itranstransliterator Bases: object Transliterator between Indian scripts and ITRANS static from_itrans(text, lang_code) static to_itrans(text, lang_code) class indicnlp.transliterate.unicode_transliterate.unicodeindictransliterator Bases: object Base class for rule-based transliteration among Indian languages. Script pair specific transliterators should derive from this class and override the transliterate() method. They can call the super class transliterate() method to avail of the common transliteration static transliterate(text, lang1_code, lang2_code) convert the source language script (lang1) to target language script (lang2) text: text to transliterate lang1_code: language 1 code lang1_code: language 2 code 12 Chapter 1. indicnlp Package

17 CHAPTER 2 Indices and tables genindex modindex search 13

18 14 Chapter 2. Indices and tables

19 Python Module Index i indicnlp.common, 3 indicnlp.langinfo, 3 indicnlp.normalize.indic_normalize, 5 indicnlp.tokenize.indic_tokenize, 8 indicnlp.transliterate.itrans_transliterator, 8 indicnlp.transliterate.sinhala_transliterator, 12 indicnlp.transliterate.unicode_transliterate, 12 15

20 16 Python Module Index

21 Index A addequivalent() (indicnlp.transliterate.itrans_transliterator.tlcharacter F from_itrans() (indicnlp.transliterate.unicode_transliterate.itranstransliterato static method), 12 method), 11 BYTE_ORDER_MARK_2 (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 B BengaliNormalizer (class in indicnlp.normalize.indic_normalize), 5 BYTE_ORDER_MARK (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 C CharacterBlock (class in indicnlp.transliterate.itrans_transliterator), 10 correct_visarga() D (indicnlp.normalize.indic_normalize.normalizeri method), 7 G devanagari_to_sinhala() (indicnlp.transliterate.sinhala_transliterator.sinhaladevanagaritransliterator I static method), 12 DevanagariCharacter (class in indicnlp.transliterate.itrans_transliterator), in_coordinated_range() (in module indicnlp.langinfo), 3 indicnlp.common (module), 3 10 indicnlp.langinfo (module), 3 DevanagariCharacterBlock (class in indicnlp.transliterate.itrans_transliterator), indicnlp.normalize.indic_normalize (module), 5 10 indicnlp.tokenize.indic_tokenize (module), 8 DevanagariNormalizer (class in indicnlp.normalize.indic_normalize), indicnlp.transliterate.itrans_transliterator (module), 8 5 indicnlp.transliterate.sinhala_transliterator (module), 12 DevanagariTransliterationScheme (class in indicnlp.transliterate.itrans_transliterator), indicnlp.transliterate.unicode_transliterate (module), devnag_sinhala_map (indicnlp.transliterate.sinhala_transliterator.sinhaladevanagaritransliteratonlp.normalize.indic_normalize), IndicNlpException, 3 IndicNormalizerFactory (class in indic- 6 attribute), 12 init() (in module indicnlp.common), 3 is_approximant() (in module indicnlp.langinfo), 3 is_aspirated() (in module indicnlp.langinfo), 3 get_char_stats() (indicnlp.normalize.indic_normalize.devanagarinormalizer method), 5 get_char_stats() (indicnlp.normalize.indic_normalize.normalizeri method), 7 get_char_stats() (indicnlp.normalize.indic_normalize.telugunormalizer method), 8 get_normalizer() (indicnlp.normalize.indic_normalize.indicnormalizerfactory method), 6 get_offset() (in module indicnlp.langinfo), 3 get_resources_path() (in module indicnlp.common), 3 GujaratiNormalizer (class in indicnlp.normalize.indic_normalize), 5 GurmukhiNormalizer (class in indicnlp.normalize.indic_normalize), 5 17

22 is_aum() (in module indicnlp.langinfo), 3 is_consonant() (in module indicnlp.langinfo), 4 is_dental() (in module indicnlp.langinfo), 4 is_fricative() (in module indicnlp.langinfo), 4 is_halanta() (in module indicnlp.langinfo), 4 is_indiclang_char() (in module indicnlp.langinfo), 4 is_labial() (in module indicnlp.langinfo), 4 is_language_supported() (indic- nlp.normalize.indic_normalize), 6 nlp.normalize.indic_normalize.indicnormalizerfactory NUKTA (indicnlp.normalize.indic_normalize.bengalinormalizer method), 6 attribute), 5 is_nasal() (in module indicnlp.langinfo), 4 NUKTA (indicnlp.normalize.indic_normalize.devanagarinormalizer is_nukta() (in module indicnlp.langinfo), 4 attribute), 5 is_number() (in module indicnlp.langinfo), 4 NUKTA (indicnlp.normalize.indic_normalize.gujaratinormalizer is_palatal() (in module indicnlp.langinfo), 4 attribute), 5 is_retroflex() (in module indicnlp.langinfo), 4 NUKTA (indicnlp.normalize.indic_normalize.gurmukhinormalizer is_unaspirated() (in module indicnlp.langinfo), 4 attribute), 6 is_unvoiced() (in module indicnlp.langinfo), 4 NUKTA (indicnlp.normalize.indic_normalize.oriyanormalizer is_velar() (in module indicnlp.langinfo), 4 attribute), 7 is_voiced() (in module indicnlp.langinfo), 4 is_vowel() (in module indicnlp.langinfo), 4 O is_vowel_sign() (in module indicnlp.langinfo), 4 offset_to_char() (in module indicnlp.langinfo), 4 ITRANS (in module indicnlp.transliterate.itrans_transliterator), 10 nlp.normalize.indic_normalize), 7 OriyaNormalizer (class in indic- ItransTransliterator (class in indicnlp.transliterate.unicode_transliterate), 12 K KannadaNormalizer (class in indicnlp.normalize.indic_normalize), 6 M main() (in module indicnlp.transliterate.itrans_transliterator), 11 MalayalamNormalizer (class in indicnlp.normalize.indic_normalize), 6 N NO_BREAK_SPACE (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 normalize() (indicnlp.normalize.indic_normalize.bengalinormalizer method), 5 normalize() (indicnlp.normalize.indic_normalize.devanagarinormalizer normalize() (indicnlp.normalize.indic_normalize.oriyanormalizer method), 7 normalize() (indicnlp.normalize.indic_normalize.tamilnormalizer method), 8 normalize() (indicnlp.normalize.indic_normalize.telugunormalizer method), 8 NormalizerI (class in indic- R resetoptions() (in module indicnlp.transliterate.itrans_transliterator), 11 S nlp.transliterate.sinhala_transliterator.sinhaladevanagaritransliter nlp.transliterate.sinhala_transliterator.sinhaladevanagaritransliter set_resources_path() (in module indicnlp.common), 3 sinhala_devnag_map (indic- attribute), 12 sinhala_to_devanagari() (indic- static method), 12 SinhalaDevanagariTransliterator (class in indicnlp.transliterate.sinhala_transliterator), 12 SOFT_HYPHEN T (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 method), 5 TamilNormalizer (class in indicnlp.normalize.indic_normalize), 7 normalize() (indicnlp.normalize.indic_normalize.gujaratinormalizer method), 5 TeluguNormalizer (class in indicnlp.normalize.indic_normalize), 8 normalize() (indicnlp.normalize.indic_normalize.gurmukhinormalizer method), 6 TLCharacter (class in indicnlp.transliterate.itrans_transliterator), 10 normalize() (indicnlp.normalize.indic_normalize.kannadanormalizer method), 6 to_itrans() (indicnlp.transliterate.unicode_transliterate.itranstransliterator normalize() (indicnlp.normalize.indic_normalize.malayalamnormalizer static method), 12 method), 6 transliterate() (in module indicnlp.transliterate.itrans_transliterator), 11 normalize() (indicnlp.normalize.indic_normalize.normalizeri method), 7 18 Index

23 transliterate() (indicnlp.transliterate.unicode_transliterate.unicodeindictransliterator static method), 12 TransliterationScheme (class in indicnlp.transliterate.itrans_transliterator), 11 trivial_tokenize() (in module indicnlp.tokenize.indic_tokenize), 8 trivial_tokenize_indic() (in module indicnlp.tokenize.indic_tokenize), 8 trivial_tokenize_urdu() (in module indicnlp.tokenize.indic_tokenize), 8 U unicodehexvalue (indicnlp.transliterate.itrans_transliterator.tlcharacter attribute), 11 UnicodeIndicTransliterator (class in indicnlp.transliterate.unicode_transliterate), 12 W WORD_JOINER (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 Z ZERO_WIDTH_JOINER (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 ZERO_WIDTH_NON_JOINER (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 ZERO_WIDTH_SPACE (indicnlp.normalize.indic_normalize.normalizeri attribute), 7 Index 19

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet A. Sean Pue South Asia Language Resource Center Pre-SASLI Workshop 6/7/09 1 Objectives To understand how

More information

Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent

Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent Anoop Kunchukuttan IIT Bombay anoopk@cse.iitb.ac.in Ratish Puduppully IIIT Hyderabad ratish.surendran

More information

Transliteration of Tamil and Other Indic Scripts. Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA

Transliteration of Tamil and Other Indic Scripts. Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA Transliteration of Tamil and Other Indic Scripts Ram Viswanadha Unicode Software Engineer IBM Globalization Center of Competency, California, USA Main points of Powerpoint presentation This talk gives

More information

PROPOSALS FOR MALAYALAM AND TAMIL SCRIPTS ROOT ZONE LABEL GENERATION RULES

PROPOSALS FOR MALAYALAM AND TAMIL SCRIPTS ROOT ZONE LABEL GENERATION RULES PROPOSALS FOR MALAYALAM AND TAMIL SCRIPTS ROOT ZONE LABEL GENERATION RULES Publication Date: 23 November 2018 Prepared By: IDN Program, ICANN Org Public Comment Proceeding Open Date: 25 September 2018

More information

Proposals For Devanagari, Gurmukhi, And Gujarati Scripts Root Zone Label Generation Rules

Proposals For Devanagari, Gurmukhi, And Gujarati Scripts Root Zone Label Generation Rules Proposals For Devanagari, Gurmukhi, And Gujarati Scripts Root Zone Label Generation Rules Publication Date: 20 October 2018 Prepared By: IDN Program, ICANN Org Public Comment Proceeding Open Date: 27 July

More information

Feedback on Draft Devanagari Script Behaviour for Hindi Ver 1.4.9

Feedback on Draft Devanagari Script Behaviour for Hindi Ver 1.4.9 Feedback on Draft Devanagari Script Behaviour for Hindi Ver 1.4.9 S. Page Remarks Concern Status No. Version No. 1 1.4.9 Test Report of Akshara, is missing. Pending Pl. check Annexure 5: Definition of

More information

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters INTERNATIONAL STANDARD ISO 15919 First edition 2001-10-01 Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters Information et documentation Translittération

More information

REMARKS ON THE USE OF ZWJ & ZWNJ IN THE BRAHMI AND PERSO- ARABIC FAMILIES

REMARKS ON THE USE OF ZWJ & ZWNJ IN THE BRAHMI AND PERSO- ARABIC FAMILIES REMARKS ON THE USE OF ZWJ & ZWNJ IN THE BRAHMI AND PERSO- ARABIC FAMILIES The use of ZWJ/ZWNJ is at two levels: 1. EMBEDDED WITHIN THE OPEN TYPE FONT In the creation of Open Type Fonts where the ZWJ /

More information

transliteration Documentation

transliteration Documentation transliteration Documentation Release 0.4.1 Santhosh Thottingal January 09, 2017 Contents 1 transliteration API 3 2 Indices and tables 5 Python Module Index 7 i ii transliteration Documentation, Release

More information

L2/ Unicode A genda Agenda for for Bangla Bangla Bidyut Baran Chaudhuri

L2/ Unicode A genda Agenda for for Bangla Bangla Bidyut Baran Chaudhuri Unicode Agenda for Bangla Bidyut Baran Chaudhuri Society for Natural Language Technology Research & Indian Statistical Institute, Kolkata, India Indian Script and Bangla Most Indian Scripts are derived

More information

The Unicode Standard Version 6.1 Core Specification

The Unicode Standard Version 6.1 Core Specification The Unicode Standard Version 6.1 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Request for encoding GRANTHA LENGTH MARK

Request for encoding GRANTHA LENGTH MARK Request for encoding 11355 GRANTHA LENGTH MARK Shriramana Sharma jamadagni-at-gmail-dot-com 2009-Oct-25 This is a request for encoding a character in the Grantha block. While I have only recently submitted

More information

Proposal on Handling Reph in Gurmukhi and Telugu Scripts

Proposal on Handling Reph in Gurmukhi and Telugu Scripts Proposal on Handling Reph in Gurmukhi and Telugu Scripts Nagarjuna Venna August 1, 2006 1 Introduction Chapter 9 of the Unicode standard [1] describes the representational model for encoding Indic scripts.

More information

The Unicode Standard Version 10.0 Core Specification

The Unicode Standard Version 10.0 Core Specification The Unicode Standard Version 10.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

E- MĀDHAVANIDĀNAM - User Manual

E- MĀDHAVANIDĀNAM - User Manual E- MĀDHAVANIDĀNAM - User Manual Index HOME...2 Home page...2 Select Language Script...3 READ Madhavanidana...3 Select Chapter (Adhyaya)...5 Select Adhikarana (Sloka or Sloka samuha)...6 To read entire

More information

As per given sort order at Pg 58, kindly mention position of standalone क in tabular format. BY Others

As per given sort order at Pg 58, kindly mention position of standalone क in tabular format. BY Others Page S. No. Version No. Concern Status 1 1.4.8 The draft should be vetted by two other independent wellknown Pending linguists, other than author(s) and their details to be shared. By TDIL Programme 2

More information

uniseg-python Documentation

uniseg-python Documentation uniseg-python Documentation Release 0.7.1 Masaaki Shibata Apr 15, 2017 Contents 1 Modules 1 1.1 uniseg.codepoint Unicode code point............................. 1 1.2 uniseg.graphemecluster Grapheme cluster.........................

More information

DS-IASTConvert: An Automatic Script Converter between Devanagari and International Alphabet of Sanskrit Transliteration

DS-IASTConvert: An Automatic Script Converter between Devanagari and International Alphabet of Sanskrit Transliteration DS-IASTConvert: An Automatic Script Converter between Devanagari and International Alphabet of Sanskrit Transliteration Subhash Chandra, 2 Vivek Kumar, 3 Anju Assistant Professor, 2 Doctoral Research Scholar,

More information

Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR

Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR Proposal to encode the DOGRA VOWEL SIGN VOCALIC RR Srinidhi A and Sridatta A Tumakuru, India srinidhi.pinkpetals24@gmail.com, sridatta.jamadagni@gmail.com June 25, 2017 1 Introduction This is a proposal

More information

ESCAPE SEQUENCE G0: ESC 02/08 04/13 C0: C1: NAME Extended African Latin alphabet coded character set for bibliographic information interchange

ESCAPE SEQUENCE G0: ESC 02/08 04/13 C0: C1: NAME Extended African Latin alphabet coded character set for bibliographic information interchange TYPE: 9-character graphic character set REGISTRATION NUMBER: 9 DATE OF REGISTRATION: ESCAPE SEQUENCE G: ESC /8 / G: ESC /9 / G: ESC / / G: ESC / / C: C: NAME Extended African Latin alphabet coded character

More information

Title: Graphic representation of the Roadmap to the BMP of the UCS

Title: Graphic representation of the Roadmap to the BMP of the UCS ISO/IEC JTC1/SC2/WG2 N2045 Title: Graphic representation of the Roadmap to the BMP of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 1999-08-15 Action: For confirmation by ISO/IEC

More information

Cibu Johny, 2014-Dec-26

Cibu Johny, 2014-Dec-26 Proposal to encode MALAYALAM LETTER CHILLU Y Cibu Johny, cibu@google.com 2014-Dec-26 Discussion In the Malayalam script, a Chillu or Chillaksharam is a special vowel-less form of a consonant. In Unicode,

More information

Joiners (ZWJ/ZWNJ) with Semantic content for words in Indian subcontinent languages

Joiners (ZWJ/ZWNJ) with Semantic content for words in Indian subcontinent languages Joiners (ZWJ/ZWNJ) with Semantic content for words in Indian subcontinent languages N. Ganesan This document gives examples of Unicode joiners, ZWJ and ZWNJ where the meanings of words differ substantially

More information

Issues in Indic Language Collation

Issues in Indic Language Collation Issues in Indic Language Collation Cathy Wissink Program Manager, Windows Globalization Microsoft Corporation I. Introduction As the software market for India 1 grows, so does the interest in developing

More information

COSC 243 (Computer Architecture)

COSC 243 (Computer Architecture) COSC 243 Computer Architecture And Operating Systems 1 Dr. Andrew Trotman Instructors Office: 123A, Owheo Phone: 479-7842 Email: andrew@cs.otago.ac.nz Dr. Zhiyi Huang (course coordinator) Office: 126,

More information

WireXfers Documentation

WireXfers Documentation WireXfers Documentation Release 2014.06-dev Priit Laes June 21, 2016 Contents 1 User Guide 3 1.1 Introduction............................................... 3 1.2 Using WireXfers.............................................

More information

GONDI and GUNJALA GONDI CHARACTER NAMES Vowels EE and OO. Comment on GONDI (L2/15-005) and GUNJALA GONDI (L2/ ) proposals

GONDI and GUNJALA GONDI CHARACTER NAMES Vowels EE and OO. Comment on GONDI (L2/15-005) and GUNJALA GONDI (L2/ ) proposals GONDI and GUNJALA GONDI CHARACTER NAMES Vowels EE and OO Comment on GONDI (L2/15-005) and GUNJALA GONDI (L2/15-086 ) proposals Naga Ganesan (naa.ganesan@gmail.com) Abstract: This document requests naming

More information

Proposal to encode Kannada Sign Spacing Candrabindu

Proposal to encode Kannada Sign Spacing Candrabindu Proposal to encode Kannada Sign Spacing Candrabindu Vinodh Rajan vrs3@st-andrews.ac.uk Introduction The Badaga language is a minority language in the Indian state of Tamil Nadu. It is spoken by approximately

More information

e - HAND BOOK OF DOMESTIC MEDICINE AND COMMON AYURVEDIC REMEDIES User Manual

e - HAND BOOK OF DOMESTIC MEDICINE AND COMMON AYURVEDIC REMEDIES User Manual e - HAND BOOK OF DOMESTIC MEDICINE AND COMMON AYURVEDIC REMEDIES User Manual Index Home page...2 About us...3 Read Book...4 Select Chapters...4 Browsing through a Chapter:...5 Using Table of Contents...5

More information

Issues in Indic Language Collation

Issues in Indic Language Collation Issues in Indic Language Collation Cathy Wissink Program Manager, Windows Globalization Microsoft Corporation I. Introduction As the software market for India i grows, so does the interest in developing

More information

ஒர ங க ற ததத ற றம ம தகத ட பத ட ம ம னவர ரமணஶர மத இந த யவ யல/ததத ழ லந ட ப ஆய வத ளர தம ழ நத ட

ஒர ங க ற ததத ற றம ம தகத ட பத ட ம ம னவர ரமணஶர மத இந த யவ யல/ததத ழ லந ட ப ஆய வத ளர தம ழ நத ட ஒர ங க ற ததத ற றம ம தகத ட பத ட ம ம னவர ரமணஶர மத இந த யவ யல/ததத ழ லந ட ப ஆய வத ளர தம ழ நத ட Genesis and Philosophy of Unicode Shriramana Sharma, Ph D Indology/Technology Research Scholar Tamil Nadu jamadagni

More information

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS

Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS ISO/IEC JTC1/SC2/WG2 N2316 Title: Graphic representation of the Roadmap to the BMP, Plane 0 of the UCS Source: Ad hoc group on Roadmap Status: Expert contribution Date: 2001-01-09 Action: For confirmation

More information

Domain Names in Pakistani Languages. IDNs for Pakistani Languages

Domain Names in Pakistani Languages. IDNs for Pakistani Languages ا ہ 6 5 a ز @ ں ب Domain Names in Pakistani Languages س a ی س a ب او اور را < ہ ر @ س a آف ا ر ا 6 ب 1 Domain name Domain name is the address of the web page pg on which the content is located 2 Internationalized

More information

Proposal to encode MALAYALAM SIGN PARA

Proposal to encode MALAYALAM SIGN PARA Proposal to encode MALAYALAM SIGN PARA Introduction Cibu Johny, cibu@google.com 2014-Jan-16 Historically Paṟa has been an important measurement unit in Kerala, for measuring rice grain. The word also described

More information

COMTool User Guide Version /02/2013

COMTool User Guide Version /02/2013 1 of 20 Contents Terms and Conditions of License for use of gratuitous software... 3 Purpose:... 3 Installation Instructions:... 4 Using COMTool:... 8 Select a COM Port... 8 Select a product... 8 Selecting

More information

VTT INFORMATION TECHNOLOGY. Nipper

VTT INFORMATION TECHNOLOGY. Nipper VTT INFORMATION TECHNOLOGY Nipper ADVANCED USER GUIDE Copyright Copyright VTT Information Technology 2003. All rights reserved. The information in this document is subject to change without notice and

More information

Kannada 2. L2/ Representation of Jihvamuliya and Upadhmaniya in Kannada Srinidhi

Kannada 2. L2/ Representation of Jihvamuliya and Upadhmaniya in Kannada Srinidhi TO: UTC L2/14 XXX FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Laurentiu Iancu SUBJECT: Recommendations to UTC #138 February 2014 on Script Proposals DATE: 26 January 2014

More information

Getting Started (No installation necessary) Windows On Windows systems, simply double click the AntGram icon to launch the program.

Getting Started (No installation necessary) Windows On Windows systems, simply double click the AntGram icon to launch the program. AntGram (Windows) Build 1.0 (Released September 22, 2018) Laurence Anthony, Ph.D. Center for English Language Education in Science and Engineering, School of Science and Engineering, Waseda University,

More information

OpenType Font by Harsha Wijayawardhana UCSC

OpenType Font by Harsha Wijayawardhana UCSC OpenType Font by Harsha Wijayawardhana UCSC Introduction The OpenType font format is an extension of the TrueType font format, adding support for PostScript font data. The OpenType font format was developed

More information

Atrio Map. Installation and User Guide

Atrio Map. Installation and User Guide Atrio Map Version 8 Installation and User Guide Atrio 6001 N. Adams Road, Suite 285 Bloomfield Hills, Michigan 48304-1547 (248) 594-1500 - www.atriocrm.com Table of Contents Introduction... 3 Requirements/Compatibility...

More information

ISO/IEC JTC 1/SC 2/WG 2 Proposal summary form N2652-F accompanies this document.

ISO/IEC JTC 1/SC 2/WG 2 Proposal summary form N2652-F accompanies this document. Dated: April 28, 2006 Title: Proposal to add TAMIL OM Source: International Forum for Information Technology in Tamil (INFITT) Action: For consideration by UTC and ISO/IEC JTC 1/SC 2/WG 2 Distribution:

More information

Cassette Documentation

Cassette Documentation Cassette Documentation Release 0.3.8 Charles-Axel Dein October 01, 2015 Contents 1 User s Guide 3 1.1 Foreword................................................. 3 1.2 Quickstart................................................

More information

097B Ä DEVANAGARI LETTER GGA 097C Å DEVANAGARI LETTER JJA 097E Ç DEVANAGARI LETTER DDDA 097F É DEVANAGARI LETTER BBA

097B Ä DEVANAGARI LETTER GGA 097C Å DEVANAGARI LETTER JJA 097E Ç DEVANAGARI LETTER DDDA 097F É DEVANAGARI LETTER BBA ISO/IEC JTC1/SC2/WG2 N2934 L2/05-082 2005-03-30 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation еждународная организация

More information

Getting Started (No installation necessary)

Getting Started (No installation necessary) ProtAnt (Windows) Build 1.2.1 (Released March 21, 2017) Laurence Anthony, Ph.D. Center for English Language Education in Science and Engineering, School of Science and Engineering, Waseda University, 3-4-1

More information

PyQ Documentation. Release 3.8. Enlightenment Research, LLC.

PyQ Documentation. Release 3.8. Enlightenment Research, LLC. PyQ Documentation Release 3.8 Enlightenment Research, LLC. November 21, 2016 Contents 1 Quickstart 3 2 Table of Contents 5 2.1 Installation................................................ 5 2.1.1 OS Support...........................................

More information

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters

ISO INTERNATIONAL STANDARD. Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters INTERNATIONAL STANDARD ISO 15919 First edition 2001-10-01 Information and documentation Transliteration of Devanagari and related Indic scripts into Latin characters Information et documentation Translittération

More information

Nafees Pakistani Naskh v2.01 beta

Nafees Pakistani Naskh v2.01 beta Nafees Pakistani Naskh v2.01 beta Release Notes November 07, 2007 CENTER FOR RESEARCH IN URDU LANGUAGE PROCESSING NATIONAL UNIVERSITY OF COMPUTER AND EMERGING SCIENCES, LAHORE PAKISTAN Table of Contents

More information

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC A.

ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC A. JTC1/SC2/WG2 N3710 ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 A. Administrative 1 Title: Proposal to add Six characters in the

More information

PyWin32ctypes Documentation

PyWin32ctypes Documentation PyWin32ctypes Documentation Release 0.1.3.dev1 David Cournapeau, Ioannis Tziakos Sep 01, 2017 Contents 1 Usage 3 2 Development setup 5 3 Reference 7 3.1 PyWin32 Compatibility Layer......................................

More information

Release notes for CPLEX

Release notes for CPLEX IBM ILOG CPLEX V12.1 Release notes for CPLEX Copyright International Business Machines Corporation 1987, 2009 US Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP

More information

L2/ Proposal to encode archaic vowel signs O OO for Kannada. 1. Thanks. 2. Introduction

L2/ Proposal to encode archaic vowel signs O OO for Kannada. 1. Thanks. 2. Introduction L2/14-004 Proposal to encode archaic vowel signs O OO for Kannada Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2013-Dec-31 1. Thanks I thank Srinidhi of Tumkur, Karnataka, for alerting me to these

More information

Unicode and Standardized Notation. Anthony Aristar

Unicode and Standardized Notation. Anthony Aristar Data Management and Archiving University of California at Santa Barbara, June 24-27, 2008 Unicode and Standardized Notation Anthony Aristar Once upon a time There were people who decided to invent computers.

More information

Andrew Glass and Shriramana Sharma. anglass-at-microsoft-dot-com jamadagni-at-gmail-dot-com November-2

Andrew Glass and Shriramana Sharma. anglass-at-microsoft-dot-com jamadagni-at-gmail-dot-com November-2 Proposal to encode 1107F BRAHMI NUMBER JOINER (REVISED) Andrew Glass and Shriramana Sharma anglass-at-microsoft-dot-com jamadagni-at-gmail-dot-com 1. Background 2011-vember-2 In their Brahmi proposal L2/07-342

More information

Rendering in Dzongkha

Rendering in Dzongkha Rendering in Dzongkha Pema Geyleg Department of Information Technology pema.geyleg@gmail.com Abstract The basic layout engine for Dzongkha script was created with the help of Mr. Karunakar. Here the layout

More information

SOUTH ASIA Indic 1. Tamil Documents: L2/ Naming Tamil Symbols in SMP Vallinam Characters Ganesan

SOUTH ASIA Indic 1. Tamil Documents: L2/ Naming Tamil Symbols in SMP Vallinam Characters Ganesan TO: UTC L2/15 045 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, and Andrew Glass SUBJECT: Recommendations to UTC #142 February 2015 on Script Proposals DATE: 30 January 2015 The

More information

Preface. Audience. Cisco IOS Software Documentation. Organization

Preface. Audience. Cisco IOS Software Documentation. Organization This preface describes the audience, organization, and conventions of this publication, and provides information on how to obtain related documentation. Cisco documentation and additional literature are

More information

Request for encoding 1CF3 ROTATED ARDHAVISARGA

Request for encoding 1CF3 ROTATED ARDHAVISARGA Request for encoding 1CF3 ROTATED ARDHAVISARGA Shriramana Sharma jamadagni-at-gmail-dot-com 2009-Oct-09 This is a request for encoding a character in the Vedic Extensions block. This character resembles

More information

X Locale Database Specification. Yoshio Horiuchi, IBM Japan

X Locale Database Specification. Yoshio Horiuchi, IBM Japan X Locale Database Specification Yoshio Horiuchi, IBM Japan X Locale Database Specification by Yoshio Horiuchi X Version 11, Release 7.7 Copyright 1994 IBM Corporation License to use, copy, modify, and

More information

Naela Sarras :19

Naela Sarras :19 Naela Sarras 2011-09-13 10:19 To Naela Sarras , "devanagari-vip@icann.org" , "drgovind@nic.in" , "dr.omvikas@gmail.com"

More information

Local deployment consists of locally storing the IT Policy file on each user s workstation.

Local deployment consists of locally storing the IT Policy file on each user s workstation. Implementing the IT Policy Manager for Originally posted: March 2002 Summary The IT policy manager enables you to define and restrict the settings for users desktop software. It enables you to provide

More information

monolith Documentation

monolith Documentation monolith Documentation Release 0.3.3 Łukasz Balcerzak December 16, 2013 Contents 1 Usage 3 1.1 Execution manager............................................ 3 1.2 Creating commands...........................................

More information

Letters.org. CONSTRUCTION COMPANY INTRODUCTION LETTER. Included: Construction Company Introduction Letter

Letters.org.   CONSTRUCTION COMPANY INTRODUCTION LETTER. Included: Construction Company Introduction Letter Letters.org CONSTRUCTION COMPANY INTRODUCTION LETTER Included: Introduction Tips Sample Template 1 Introduction An introduction letter is basically used as a tool of marketing. An introduction letter mainly

More information

Small Logger File System

Small Logger File System Small Logger File System (http://www.tnkernel.com/) Copyright 2011 Yuri Tiomkin Document Disclaimer The information in this document is subject to change without notice. While the information herein is

More information

Thu Jun :48:11 Canada/Eastern

Thu Jun :48:11 Canada/Eastern Roadmaps to Unicode Thu Jun 24 2004 17:48:11 Canada/Eastern Home Site Map Search Tables Roadmap Introduction Roadmap to the BMP (Plane 0) Roadmap to the SMP (Plane 1) Roadmap to the SIP (Plane 2) Roadmap

More information

Migrating BlackBerry-enabled mailboxes from Microsoft Exchange 5.5 to Microsoft Exchange 2000

Migrating BlackBerry-enabled mailboxes from Microsoft Exchange 5.5 to Microsoft Exchange 2000 Migrating BlackBerry-enabled mailboxes from Originally posted: August 2002 Affected software versions BlackBerry Enterprise for Exchange Summary This document provides an overview of how to migrate a BlackBerry-enabled

More information

1. Introduction 2. TAMIL LETTER SHA Character proposed in this document About INFITT and INFITT WG

1. Introduction 2. TAMIL LETTER SHA Character proposed in this document About INFITT and INFITT WG Dated: September 14, 2003 Title: Proposal to add TAMIL LETTER SHA Source: International Forum for Information Technology in Tamil (INFITT) Action: For consideration by UTC and ISO/IEC JTC 1/SC 2/WG 2 Distribution:

More information

WordEmbeddingLoader Documentation

WordEmbeddingLoader Documentation WordEmbeddingLoader Documentation Release 0.2.0 Yuta Koreeda Aug 14, 2017 Modules 1 Issues with encoding 3 2 Development 5 3 CHANGELOG 7 3.1 v0.2.................................................... 7

More information

Proposal to encode Devanagari Sign High Spacing Dot

Proposal to encode Devanagari Sign High Spacing Dot Proposal to encode Devanagari Sign High Spacing Dot Jonathan Kew, Steve Smith SIL International April 20, 2006 1. Introduction In several language communities of Nepal, the Devanagari script has been adapted

More information

Proposal to encode characters for Extended Tamil. 1. Introduction

Proposal to encode characters for Extended Tamil. 1. Introduction Proposal to encode characters for Extended Tamil Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2010-Jul-10 This document requests the encoding of characters which are required to enable the Tamil

More information

python Roll: Users Guide 5.5 Edition

python Roll: Users Guide 5.5 Edition python Roll: Users Guide 5.5 Edition python Roll: Users Guide : 5.5 Edition Published May 08 2012 Copyright 2012 The copyright holder, and UC Regents Table of Contents Preface...iv 1. Installing the python

More information

DoJSON Documentation. Release Invenio collaboration

DoJSON Documentation. Release Invenio collaboration DoJSON Documentation Release 1.2.0 Invenio collaboration March 21, 2016 Contents 1 About 1 2 Installation 3 3 Documentation 5 4 Testing 7 5 Example 9 5.1 User s Guide...............................................

More information

ndeftool documentation

ndeftool documentation ndeftool documentation Release 0.1.0 Stephen Tiedemann May 19, 2018 Contents 1 NDEFTOOL 3 1.1 Synopsis................................................. 3 1.2 Description................................................

More information

AW-UE70 Firmware Update Procedure

AW-UE70 Firmware Update Procedure Firmware Update Procedure Please be sure to read this first. Update Procedure March 2016. The update of camera is performed via a network from Web setting screen on PC. There might be accidents in the

More information

Text Record Type Definition. Technical Specification NFC Forum TM RTD-Text 1.0 NFCForum-TS-RTD_Text_

Text Record Type Definition. Technical Specification NFC Forum TM RTD-Text 1.0 NFCForum-TS-RTD_Text_ Text Record Type Definition Technical Specification NFC Forum TM RTD-Text 1.0 NFCForum-TS-RTD_Text_1.0 2006-07-24 RESTRICTIONS ON USE This specification is copyright 2005-2006 by the NFC Forum, and was

More information

Using non-latin alphabets in Blaise

Using non-latin alphabets in Blaise Using non-latin alphabets in Blaise Rob Groeneveld, Statistics Netherlands 1. Basic techniques with fonts In the Data Entry Program in Blaise, it is possible to use different fonts. Here, we show an example

More information

pyserial-asyncio Documentation

pyserial-asyncio Documentation pyserial-asyncio Documentation Release 0.4 pyserial-team Feb 12, 2018 Contents 1 Short introduction 3 2 pyserial-asyncio API 5 2.1 asyncio.................................................. 5 3 Appendix

More information

Information technology Keyboard layouts for text and office systems. Part 9: Multi-lingual, multiscript keyboard layouts

Information technology Keyboard layouts for text and office systems. Part 9: Multi-lingual, multiscript keyboard layouts INTERNATIONAL STANDARD ISO/IEC 9995-9 First edition 2016-10-01 Information technology Keyboard layouts for text and office systems Part 9: Multi-lingual, multiscript keyboard layouts Technologies de l

More information

RapidIO Interconnect Specification Part 3: Common Transport Specification

RapidIO Interconnect Specification Part 3: Common Transport Specification RapidIO Interconnect Specification Part 3: Common Transport Specification Rev. 1.3, 06/2005 Copyright RapidIO Trade Association RapidIO Trade Association Revision History Revision Description Date 1.1

More information

MultiCom/MV 1.0. Comtrol RocketPort Guide. Copyright , Viewpoint Systems, Inc. All Rights Reserved

MultiCom/MV 1.0. Comtrol RocketPort Guide. Copyright , Viewpoint Systems, Inc. All Rights Reserved MultiCom/MV 1.0 Comtrol RocketPort Guide Copyright 1994-2000, Viewpoint Systems, Inc. All Rights Reserved Viewpoint Systems, Inc. does not warrant that the Program will meet Customer s requirements or

More information

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley.

This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. This PDF file is an excerpt from The Unicode Standard, Version 4.0, issued by the Unicode Consortium and published by Addison-Wesley. The material has been modified slightly for this online edition, however

More information

AW-UE70 Firmware Update Procedure

AW-UE70 Firmware Update Procedure Firmware Update Procedure Please be sure to read this first. Update Procedure January 2018. The update of camera is performed via a network from Web setting screen on PC. There might be accidents in the

More information

NOVEL IMPLEMENTATION OF SEARCH ENGINE FOR TELUGU DOCUMENTS WITH SYLLABLE N- GRAM MODEL

NOVEL IMPLEMENTATION OF SEARCH ENGINE FOR TELUGU DOCUMENTS WITH SYLLABLE N- GRAM MODEL NOVEL IMPLEMENTATION OF SEARCH ENGINE FOR TELUGU DOCUMENTS WITH SYLLABLE N- GRAM MODEL DR.B.PADMAJA RANI* AND DR.A.VINAY BABU 1 *Associate Professor Department of CSE JNTUCEH Hyderabad A.P. India http://jntuceh.ac.in/csstaff.htm

More information

Structure Vowel signs are used in a manner similar to that employed by other Brahmi-derived scripts. Consonants have an inherent /a/ vowel sound.

Structure Vowel signs are used in a manner similar to that employed by other Brahmi-derived scripts. Consonants have an inherent /a/ vowel sound. ISO/IEC JTC1/SC2/WG2 N3023 L2/06-003 2006-01-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация

More information

Python Roll Users Guide. 6.2 Edition

Python Roll Users Guide. 6.2 Edition Python Roll Users Guide 6.2 Edition Python Roll Users Guide: 6.2 Edition Published May 07 2015 Copyright 2015 The copyright holder, and UC Regents Table of Contents Preface...iv 1. Installing the Python

More information

Introduction to Normalization and Modern Collation

Introduction to Normalization and Modern Collation Introduction to Normalization and Modern Collation Roozbeh Pournader Sharif FarsiWeb, Inc. roozbeh@farsiweb.info The gap that needed filling For compatibility reasons, Unicode has more than one way to

More information

End User License Agreement

End User License Agreement End User License Agreement Kyocera International, Inc. ( Kyocera ) End User License Agreement. CAREFULLY READ THE FOLLOWING TERMS AND CONDITIONS ( AGREEMENT ) BEFORE USING OR OTHERWISE ACCESSING THE SOFTWARE

More information

SDN Adaptive Load Balancing. Feature Description

SDN Adaptive Load Balancing. Feature Description SDN Adaptive Load Balancing Feature Description VERSION: 5.0 UPDATED: JULY 2016 Copyright Notices Copyright 2002-2016 KEMP Technologies, Inc.. All rights reserved.. KEMP Technologies and the KEMP Technologies

More information

Ecma International Policy on Submission, Inclusion and Licensing of Software

Ecma International Policy on Submission, Inclusion and Licensing of Software Ecma International Policy on Submission, Inclusion and Licensing of Software Experimental TC39 Policy This Ecma International Policy on Submission, Inclusion and Licensing of Software ( Policy ) is being

More information

HYCU SCOM Management Pack for F5 BIG-IP

HYCU SCOM Management Pack for F5 BIG-IP USER GUIDE HYCU SCOM Management Pack for F5 BIG-IP Product version: 5.5 Product release date: August 2018 Document edition: First Legal notices Copyright notice 2015-2018 HYCU. All rights reserved. This

More information

StorageGRID Webscale NAS Bridge Management API Guide

StorageGRID Webscale NAS Bridge Management API Guide StorageGRID Webscale NAS Bridge 2.0.3 Management API Guide January 2018 215-12414_B0 doccomments@netapp.com Table of Contents 3 Contents Understanding the NAS Bridge management API... 4 RESTful web services

More information

GUI 1.5 Release Notes

GUI 1.5 Release Notes GUI 1.5 Release Notes Released: June 2003 The information contained within this document is subject to change without notice. Copyright 2003 All rights reserved. No part of this documentation may be reproduced,

More information

also represented by combnining vowel matras with ē, and ō: ayɯ, eyi, ayi;

also represented by combnining vowel matras with ē, and ō: ayɯ, eyi, ayi; JTC1/SC2/WG2 N4025 L2/11-120 2011-04-22 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация

More information

Packet Trace Guide. Packet Trace Guide. Technical Note

Packet Trace Guide. Packet Trace Guide. Technical Note Packet Trace Guide Technical Note VERSION: 2.0 UPDATED: JANUARY 2016 Copyright Notices Copyright 2002-2016 KEMP Technologies, Inc.. All rights reserved.. KEMP Technologies and the KEMP Technologies logo

More information

redis-lua Documentation

redis-lua Documentation redis-lua Documentation Release 2.0.8 Julien Kauffmann October 12, 2016 Contents 1 Quick start 3 1.1 Step-by-step analysis........................................... 3 2 What s the magic at play here?

More information

josync Documentation Release 1.0 Joel Goop and Jonas Einarsson

josync Documentation Release 1.0 Joel Goop and Jonas Einarsson josync Documentation Release 1.0 Joel Goop and Jonas Einarsson May 10, 2014 Contents 1 Contents 3 1.1 Getting started.............................................. 3 1.2 Jobs....................................................

More information

RAVENNA-2-SAP Converter Installation + Operation Guide

RAVENNA-2-SAP Converter Installation + Operation Guide RAVENNA-2-SAP Converter Installation + Operation Guide Version 1.0 September 2016 RAVENNA-2-SAP Converter Guide 1 Table of Contents: 1 LEGAL 3 1.1 LICENSE 3 1.2 DISCLAIMER 3 2 INTRODUCTION 4 2.1 BACKGROUND

More information

Tamil font for powerpoint. Tamil font for powerpoint.zip

Tamil font for powerpoint. Tamil font for powerpoint.zip Tamil font for powerpoint Tamil font for powerpoint.zip 30/11/2008 How shall i present my poems which is in tamil in powerpoint slide? You can do this in power point 1.Download any tamil font that u wanttamil

More information

0WebMaker 3.0 Release Notes

0WebMaker 3.0 Release Notes 0WebMaker 3.0 Release Notes Version 3.0 WebMaker 3.0 provides a number of new features and improvements, including a simplified user interface, HTML 3.2 compatibility, and support for Casccading Style

More information

pdfcalligraph an itext 7 add-on pdfcalligraph

pdfcalligraph an itext 7 add-on   pdfcalligraph an itext 7 add-on www.itextpdf.com 1 Introduction Your business is global, shouldn t your documents be, too? The PDF format does not make it easy to support certain alphabets, but now, with the help of

More information

ISO International Organization for Standardization Organisation Internationale de Normalisation

ISO International Organization for Standardization Organisation Internationale de Normalisation ISO International Organization for Standardization Organisation Internationale de Normalisation ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) ISO/IEC JTC 1/SC 2/WG 2 N2381R

More information