Arabic document composition with T E X

Size: px
Start display at page:

Download "Arabic document composition with T E X"

Transcription

1 Arabic document composition with T E X Azzeddine LAZREK University Cadi Ayyad, Faculty of Sciences Department of Computer Science Marrakesh - Morocco lazrek@ucam.ac.ma

2 Outline 1 Arabic text composition Engines Others packages ArabTEX Ω Arabi 2 Mathematical font RyDArab CurExt 3

3 Engines Others packages ArabTEX Ω Arabi Although the typesetting system TEX was originally designed especially for composition of mathematical documents, it has become a very ne system for typesetting documents in many other elds. The Arabic alphabetbased scripts are some of the linguistic contexts where the use of TEX has been progressively adapted. TEX-XET : D. Knuth and P. MacKay, Department of Near Eastern Languages and Civilization in University of Washington, presented the basic principles of a system of mixing LTR and RTL text based on TEX. These principles of bi-directionality and inversion character glyphs are appointed TEX-XET and viewed throw dvi-ivd. Primitives: \beginr, \endr, \beginl, \endl Azzeddine LAZREK 3/25 Arabic document composition with TEX

4 Engines Others packages ArabTEX Ω Arabi TEX--XET : developed by Peter Breitenlohner under the aegis of DANTE. It is an extended of TEX-XET integrated in ε-tex bidi : developed by François Charette as an extended of TEX--XET ε-tex ε-latex 1997: developed as part of the N T S project, supported by DANTE, and compatible with TEX MlTEX : developed by Michael Ferguson XETEX : developed by Jonathan Kew, extended ε-tex Azzeddine LAZREK 4/25 Arabic document composition with TEX

5 RTL capabilities Engines Others packages ArabTEX Ω Arabi TEX programs with Right-to-Left capabilities, knowing the primitives: \beginr, \endr, \beginl, \endl tetex for UNIX/Linux: ε-tex NTEX for UNIX: TEX--XET Web2C for Windows, the default TEX program in MikTEX DirectTEX for Older Mac OS: TEX--XET XETEX for Mac OS X, Linux and Windows TEX Live CD-ROMs distributed by TUG Azzeddine LAZREK 5/25 Arabic document composition with TEX

6 Engines Others packages ArabTEX Ω Arabi Abandante packages Fanton : Michel Fanton, INALCO in France, experimente a system based on TEX-XET ArabicTEX : developed by Y. Halalambous, integrated in DOS-GUT then in Win-GUT GUTenberg interface. ftp://ftp.gutenberg.eu.org/pub/gutenberg/pc/windows/documentation_supp/manuel.ps Al-Amal : developed by Y. Halalambous allowing for bi-directionality. It uses fonts (Naskhi) designed with METAFONT and covering all Arabic characters codied by Unicode. It is a part of ScholarTEX. It is a system comprising: an extension of macros, a series of fonts, a preprocessor, utilities (scholars) and compilers for multiple languages (Arabic, Armenian, Hebrew, Greek,... ). Azzeddine LAZREK 6/25 Arabic document composition with TEX

7 Engines Others packages ArabTEX Ω Arabi Maintened packages Aleph : new engine combines the features of ε-tex and Ω. lamed is the LATEX format using Aleph. bilotta.pdf AlQalam 2006: developed by Hossam A. H. Fahmy ARABXETEX : package developped by François Charette for bidictional text. It provides a convenient ArabTEX-like user-interface for typesetting languages using the Arabic script, with exible access to font features. It can be used in XELATEX and also with any ε-tex-based engine, notably pdflatex. Azzeddine LAZREK 7/25 Arabic document composition with TEX

8 Engines Others packages ArabTEX Ω Arabi RTL script package developed by Klaus Lagally, University of Stuttgart, 1990: Roman transliteration or Arabic encoding: ISO8859-6, UTF-8 Important algorithm for Hamza position Contextual analyze by some TEX macros Confection of Naskh Arabic font Example: Azzeddine LAZREK 8/25 Arabic document composition with TEX

9 Engines Others packages ArabTEX Ω Arabi Unicode enhanced version of TEX developed by John Plaice, University of Sydney, and Yannis Haralambous, Fluxus Virus then ENSTB, 1995: Unicode input Unicode processing Multi-lingual composition Example: Process: Azzeddine LAZREK 9/25 Arabic document composition with TEX

10 Engines Others packages ArabTEX Ω Arabi Arabic and Farsi scripts package developed by Youssef Jabri, Oujda University, 2005: used in any TEX engine supporting RTL directionality accepts input in several 8-bit encodings: UTF-8, ISO8859-6, CP1256 is a pure L A TEX/Babel approach to Arabic typesetting, so contextual analysis is done in the fonts Example: Azzeddine LAZREK 10/25 Arabic document composition with TEX

11 Mathematical font RyDArab CurExt Features of mathematical expressions w.r.t. regular text: Specic symbols:,,,... variable sizes and shapes: a a, a + b, b Bi-dimensional: a c + d, x i, x i, Azzeddine LAZREK 11/25 Arabic document composition with TEX

12 Mathematical font RyDArab CurExt Features of mathematical expressions w.r.t. regular text: Specic symbols:,,,... variable sizes and shapes: a a, a + b, b Bi-dimensional: a c + d, x i, x i, Azzeddine LAZREK 11/25 Arabic document composition with TEX

13 Mathematical font RyDArab CurExt Features of mathematical expressions w.r.t. regular text: Specic symbols:,,,... variable sizes and shapes: a a, a + b, b Bi-dimensional: a c + d, x i, x i, Azzeddine LAZREK 11/25 Arabic document composition with TEX

14 Mathematical font RyDArab CurExt Creating a mathematical font is a complex artistic and technical task RamzArab Design and development of the OpenType font RamzArab that tries to meet, as far as possible, the requirements of: Homogeneity: designing symbols with the same feather for homogeneous shapes, sizes, boldness,... Completeness: regrouping most of the usual specic Arabic symbols in use Originality: observing the most of practical calligraphic rules Azzeddine LAZREK 12/25 Arabic document composition with TEX

15 Mathematical font RyDArab CurExt Unicode The Unicode Standard provides a quite complete set of standard mathematical characters. There is a good deal of symbols found in Arabic mathematical handbooks that are not yet part of the Unicode Standard and can't be obtained through a simple mirroring nor through a simple implementation process. Some of such special characters, designed in RamzArab, are proposed for inclusion into the Unicode Standard and now are on discussion. Until their adoption by Unicode, the symbols used in these tools will be located in the Private Use Area E000-F8FF in the Basic Multilingual Plane. Azzeddine LAZREK 13/25 Arabic document composition with TEX

16 Mathematical font RyDArab CurExt RyDArab system preserves all the distinguished qualities: a full numerical composition a high typographical quality oering several options to adapt it to areas and levels generating documents in several formats (DVI, PS, PDF) producing documents in format HTML with image based mathematical expressions towards transformation from/to OpenMath MathML same commands structure in order to automatist translation (Ar, En, Fr) Azzeddine LAZREK 14/25 Arabic document composition with TEX

17 Mathematical font RyDArab CurExt Conception of CurExt system designing some dynamic symbols. The parentheses and Kashida symbols are representative of the possible cases of horizontal and vertical curvilinear extensibility. So, that can be easily generalized to other variable-sized symbols. A lengthening of the straight line is not in conformity with the Arabic calligraphy rules. Azzeddine LAZREK 15/25 Arabic document composition with TEX

18 Facilitate the use and understanding commands of TEX Composer Arabic documents using only Arabic, and control, characters (ex. \, $) Avoiding problems due to bidirectional lines Azzeddine LAZREK 16/25 Arabic document composition with TEX

19 Facilitate the use and understanding commands of TEX Composer Arabic documents using only Arabic, and control, characters (ex. \, $) Avoiding problems due to bidirectional lines Azzeddine LAZREK 16/25 Arabic document composition with TEX

20 Facilitate the use and understanding commands of TEX Composer Arabic documents using only Arabic, and control, characters (ex. \, $) Avoiding problems due to bidirectional lines Azzeddine LAZREK 16/25 Arabic document composition with TEX

21 The complex and indirect methods are still used. Causes RyDArab et CurExt are not suciently known TEX dicult programming language Language barrier Solution Arabization of TEX DadT E X Azzeddine LAZREK 17/25 Arabic document composition with TEX

22 The complex and indirect methods are still used. Causes RyDArab et CurExt are not suciently known TEX dicult programming language Language barrier Solution Arabization of TEX DadT E X Azzeddine LAZREK 17/25 Arabic document composition with TEX

23 Problems related to the bidi Unicode algorithm: Text selection Character position Semantic Azzeddine LAZREK 18/25 Arabic document composition with TEX

24 Problems related to the bidi Unicode algorithm: Text selection Character position Semantic Azzeddine LAZREK 18/25 Arabic document composition with TEX

25 Problems related to the bidi Unicode algorithm: Text selection Character position Semantic \catcode \Àº =11 \catcode \11=À º Azzeddine LAZREK 18/25 Arabic document composition with TEX

26 Advantages Limits Direct: based to TEX Portable: works on all platforms Extensible: adding new commands Flexible: retranslation commands Supercial: without any change in the kernel and packages Adapted DadTEX to other systems (Arabi) Using RyDArab with Omega into DadTEX Advanced use of Arabic-Indic numbers Localization of lenames Example: Azzeddine LAZREK 19/25 Arabic document composition with TEX

27 Advantages Limits Direct: based to TEX Portable: works on all platforms Extensible: adding new commands Flexible: retranslation commands Supercial: without any change in the kernel and packages Adapted DadTEX to other systems (Arabi) Using RyDArab with Omega into DadTEX Advanced use of Arabic-Indic numbers Localization of lenames Example: Azzeddine LAZREK 19/25 Arabic document composition with TEX

28

29 The End Thank you!

30 ArabT E X example Source Result test1 a + b a b test1 a + b a b test2 test2 Azzeddine LAZREK 22/25 Arabic document composition with TEX 1 1

31 Ω example Arabic text composition Source Result test1 a + b a b test1 a + b a b test2 test2 Azzeddine LAZREK 23/25 Arabic document composition with TEX 1 1

32 Ω process Arabic text composition omega test.tex lambda test.tex oxdvi test.dvi odvips -o test.ps test.dvi gv test.ps TEX L A TEX Azzeddine LAZREK 24/25 Arabic document composition with TEX

33 Arabi example Source Result Azzeddine LAZREK 25/25 Arabic document composition with TEX

34 RyDArab example Source Result Azzeddine LAZREK 26/25 Arabic document composition with TEX

35 CurExt example Source Result Azzeddine LAZREK 27/25 Arabic document composition with TEX

36 DadT E X example Azzeddine LAZREK 28/25 Arabic document composition with TEX

37 RamzArab Arabic mathematical symbols Azzeddine LAZREK 29/25 Arabic document composition with TEX

Multilingual mathematical e-document processing

Multilingual mathematical e-document processing Multilingual mathematical e-document processing Azzeddine LAZREK University Cadi Ayyad, Faculty of Sciences Department of Computer Science Marrakech - Morocco lazrek@ucam.ac.ma http://www.ucam.ac.ma/fssm/rydarab

More information

ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1

ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 TP PT Form for PT ISO/IEC JTC 1/SC 2/WG 2 N3086 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646TP Please fill all the sections A, B and C below. Please

More information

Infrastructure for High-Quality Arabic

Infrastructure for High-Quality Arabic TUG 06 Marrakech Infrastructure for High-Quality Arabic Yannis Haralambous École Nationale Supérieure des Télécommunications de Bretagne Technopôle Brest Iroise, CS 83818, 29238 Brest Cedex TUG 06 Marrakech

More information

Tex with Unicode Characters

Tex with Unicode Characters Tex with Unicode Characters 7/10/18 Presented by: Yuefei Xiang Agenda ASCII Code Unicode Unicode in Tex Old Style Encoding -Inputenc, -ucs Morden Encoding -XeTeX -LuaTeX Unicode bi-direction in Tex -Emacs-AucTeX

More information

How a Font Can Respect Basic Rules of Arabic Calligraphy

How a Font Can Respect Basic Rules of Arabic Calligraphy International Arab Journal of e-technology, Vol. 1, No. 1, January 2009 1 How a Font Can Respect Basic Rules of Arabic Calligraphy Abdelouahad BAYAR Ecole Supérieure de Technologie - Sa Cadi Ayyad University

More information

Nastaleeq: A challenge accepted by Omega

Nastaleeq: A challenge accepted by Omega Nastaleeq: A challenge accepted by Omega Atif Gulzar, Shafiq ur Rahman Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, Lahore, Pakistan atif dot

More information

Others Symbols, Additional characters proposed to Unicode. Azzeddine Lazrek

Others Symbols, Additional characters proposed to Unicode. Azzeddine Lazrek JTC1/SC2/WG2 N 3088 Others Symbols, Additional characters proposed to Unicode Azzeddine Lazrek lazrek@ucam.ac.ma Cadi Ayyad University, Faculty of Sciences P.O. Box 2390, Marrakech, Morocco Phone: +212

More information

Towards An Operational (La)TEX Package Supporting Optical Scaling of Dynamic Mathematical Symbols

Towards An Operational (La)TEX Package Supporting Optical Scaling of Dynamic Mathematical Symbols Towards An Operational (La)TEX Package Supporting Optical Scaling of Dynamic Mathematical Symbols Abdelouahad BAYAR a.bayar@uca.ma Cadi Ayyad University École Supérieure de Technologie de Safi (High College

More information

ISO/IEC JTC1/SC2/WG2 N2641

ISO/IEC JTC1/SC2/WG2 N2641 ISO/IEC JTC1/SC2/WG2 N2641 2003-10-05 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по

More information

Using Sweave and patchdvi with Japanese text

Using Sweave and patchdvi with Japanese text Using Sweave and patchdvi with Japanese text Duncan Murdoch 27 6 8 The patchdvi package works with Sweave [? ] and document previewers to facilitate editing: it modifies the links that LATEX puts into

More information

AUTOMATING THE GENERATION AND TYPESETTING OF ARABIC SCRIPT

AUTOMATING THE GENERATION AND TYPESETTING OF ARABIC SCRIPT AUTOMATING THE GENERATION AND TYPESETTING OF ARABIC SCRIPT by Sherif Samir Hassan Mansour A Thesis Submitted to the Faculty of Engineering at Cairo University in Partial Fulfillment of the Requirements

More information

Title: Computer support for Signwriting written form of sign language

Title: Computer support for Signwriting written form of sign language Title: Computer support for Signwriting written form of sign language Author: Guylhem Aznar , Patrice Dalle TCI team, IRIT lab Keywords: Sign language, Computer,

More information

AFP Support for TrueType/Open Type Fonts and Unicode

AFP Support for TrueType/Open Type Fonts and Unicode AFP Support for TrueType/Open Type Fonts and Unicode Reinhard Hohensee Distinguished Engineer October 24, 2003 Ricoh Topics What is Unicode? What are TrueType and OpenType fonts? Why have we extended the

More information

A Multidimensional Approach to Typesetting

A Multidimensional Approach to Typesetting A Multidimensional Approach to Typesetting John Plaice Paul Swoboda School of Computer Science and Engineering The University of New South Wales unsw sydney nsw 2052, Australia http://www.cse.unsw.edu.au/

More information

TUGboat, Volume 25 (2004), No This is the default.

TUGboat, Volume 25 (2004), No This is the default. TUGboat, Volume 25 (2004), No. 2 4 RyDArab Typesetting Arabic mathematical expressions Azzeddine Lazrek Abstract TEX was adapted to handle passages of Arabic script some years ago, via packages such as

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

In another markup scheme it will be like as given hereunder:

In another markup scheme it will be like as given hereunder: 1 Introduction 1.1 The Concept of Generic Markup Originally markup was the annotation of manuscripts of a copy editor telling the typesetter how to format the manuscript. It consisted of handwritten notes

More information

Experiences typesetting OpenType math

Experiences typesetting OpenType math Experiences typesetting OpenType math with LuaLaTEX and XeLaTEX Dr. Ulrik Vieth Stuttgart, Germany 4th International ConTEXt Meeting, Brejlow, 2010 Overview of this talk Review of OpenType math support

More information

Infrastructure for High-Quality Arabic Typesetting

Infrastructure for High-Quality Arabic Typesetting Département Informatique, ENST Bretagne, CS 83 818, 29 238 BREST Cedex 3, France yannis.haralambous@enst-bretagne.fr Introduction This paper presents what we consider to be the ideal 1 (or at least a first

More information

uptex Unicode version of ptex with CJK extensions

uptex Unicode version of ptex with CJK extensions uptex Unicode version of ptex with CJK extensions Takuji Tanaka uptex project Oct 26, 2013 Takuji Tanaka (uptex project) uptex Unicode version of ptex with CJK extensions Oct 26, 2013 1 / 42 Outline /

More information

What s new since TEX?

What s new since TEX? Based on Frank Mittelbach Guidelines for Future TEX Extensions Revisited TUGboat 34:1, 2013 Raphael Finkel CS Department, UK November 20, 2013 All versions of TEX Raphael Finkel (CS Department, UK) What

More information

Keyman, LANGIDs & Codepages

Keyman, LANGIDs & Codepages Keyman, LANGIDs & Codepages Interactions you may not expect Peter Constable SIL Non-Roman Script Initiative Copyright 2001 Peter Constable & SIL International In certain situations, Keyman 5 may appear

More information

L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird. LATEX: Eh? p.1/13

L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird. LATEX: Eh? p.1/13 LATEX: Eh? p.1/13 L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird acaird@umich.edu LATEX: Eh? p.2/13 L A TEX It is not a word-processor. LATEX: Eh? p.2/13 L A TEX It is not a word-processor.

More information

Unicode in Education. Adil Allawi Technical Director

Unicode in Education. Adil Allawi Technical Director Unicode in Education Adil Allawi Technical Director adil@diwan.com Why Education and Unicode: Cost of printing and distribution is high - paper is becoming a scarce resource Many classes only need part

More information

Typesetting Thai With LaTeX

Typesetting Thai With LaTeX Typesetting Thai With LaTeX Hin-Tak Leung January 9, 2012 There are three ways of using TX (or more honestly, L A TX 2ε) to typeset Thai. They are X TX (or X L A TX), ThaiL A TX, and cjk/l A TX s Thai

More information

Aesthetical Attributes for Segmenting Arabic Word

Aesthetical Attributes for Segmenting Arabic Word Aesthetical Attributes for Segmenting Arabic Word Mohamed Hssini 1, Azzeddine Lazrek 2 Department of Computer Science, Faculty of Sciences, Cadi Ayyad University Marrakech, Morocco 1 m.hssini@ucam.ac.ma;

More information

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 TABLE OF CONTENTS Introduction... 2 Input Requirements... 2 Input file... 2 Input File Processing... 2 Commands... 3 Categories of Commands... 4 Formatting Commands...

More information

OpenType Math Illuminated

OpenType Math Illuminated OpenType Math Illuminated Dr. Ulrik Vieth Stuttgart, Germany BachoTEX 2009 Developments in text typesetting Major trends in publishing support for Unicode character sets support for OpenType font technology

More information

OpenType Font by Harsha Wijayawardhana UCSC

OpenType Font by Harsha Wijayawardhana UCSC OpenType Font by Harsha Wijayawardhana UCSC Introduction The OpenType font format is an extension of the TrueType font format, adding support for PostScript font data. The OpenType font format was developed

More information

Edinburgh Research Explorer

Edinburgh Research Explorer dinburgh Research xplorer Open-belly surgery in 2 Citation for published version: Haralambous, Y, Bella, G & Gulzar, A 2006, 'Open-belly surgery in 2' TUGboat, vol. 27, no. 1, pp. 91-97. Link: Link to

More information

SAPGUI for Windows - I18N User s Guide

SAPGUI for Windows - I18N User s Guide Page 1 of 30 SAPGUI for Windows - I18N User s Guide Introduction This guide is intended for the users of SAPGUI who logon to Unicode systems and those who logon to non-unicode systems whose code-page is

More information

Compressed Images in PostScript and PDF with L A TEX HOWTO

Compressed Images in PostScript and PDF with L A TEX HOWTO Compressed Images in PostScript and PDF with L A TEX HOWTO Daniel Käps last revised: 2002-11-24 Abstract It is a common problem that PostScript files (as generated by TEX/L A TEX users) become really large

More information

Digital Typography in the New Millennium: Flexible Documents by a Flexible Engine

Digital Typography in the New Millennium: Flexible Documents by a Flexible Engine Digital Typography in the New Millennium: Flexible Documents by a Flexible Engine Christos K.K. Loverdos 1 and Apostolos Syropoulos 2 1 Department of Informatics and Telecommunications, University of Athens

More information

A multidimensional approach to typesetting

A multidimensional approach to typesetting A multidimensional approach to typesetting John Plaice, Paul Swoboda School of Computer Science and Engineering The University of New South Wales UNSW Sydney NSW 2052, Australia plaice@cse.unsw.edu.au,

More information

New Font Offerings: Cochineal, Nimbus15, LibertinusT1Math

New Font Offerings: Cochineal, Nimbus15, LibertinusT1Math New Font Offerings: Cochineal, Nimbus15, LibertinusT1Math Michael Sharpe, UCSD TUG Toronto, July 2016 Cochineal an oldstyle text font family with Roman, Greek and Cyrillic alphabets derived from Sebastian

More information

mylatexformat Use mylatexformat to make a format based on the preamble of any LaTeX file 2011/02/12 version 3.4

mylatexformat Use mylatexformat to make a format based on the preamble of any LaTeX file 2011/02/12 version 3.4 mylatexformat Use mylatexformat to make a format based on the preamble of any LaTeX file FC 2011/02/12 version 3.4 Abstract mylatexformat can be used to make a format from the preamble of any L A TEX file.

More information

Activity Report at SYSTRAN S.A.

Activity Report at SYSTRAN S.A. Activity Report at SYSTRAN S.A. Pierre Senellart September 2003 September 2004 1 Introduction I present here work I have done as a software engineer with SYSTRAN. SYSTRAN is a leading company in machine

More information

Two distinct code points: DECIMAL SEPARATOR and FULL STOP

Two distinct code points: DECIMAL SEPARATOR and FULL STOP Two distinct code points: DECIMAL SEPARATOR and FULL STOP Dario Schiavon, 207-09-08 Introduction Unicode, being an extension of ASCII, inherited a great historical mistake, namely the use of the same code

More information

An Introduction to. Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014

An Introduction to. Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014 An Introduction to Rado Ivanov CIS400 Senior Design Tutorial September 18, 2014 Today's Outline Introducing TeX/LaTeX Benefits and potential difficulties Installation and use on Unix/Mac/Windows Compiling

More information

2011 Martin v. Löwis. Data-centric XML. Character Sets

2011 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

preliminary draft, June 15, :57 preliminary draft, June 15, :57

preliminary draft, June 15, :57 preliminary draft, June 15, :57 TUGboat, Volume 0 (9999), No. 0 preliminary draft, June 15, 2018 17:57? 1 FreeType MF Module: A module for using METAFONT directly inside the FreeType rasterizer Jaeyoung Choi, Ammar Ul Hassan and Geunho

More information

2007 Martin v. Löwis. Data-centric XML. Character Sets

2007 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

Arabic text justification

Arabic text justification Arabic text justification Mohamed Jamal Eddine Benatia, Mohamed Elyaakoubi and Azzeddine Lazrek Department of Computer Science, Faculty of Science, University Cadi Ayyad P.O. Box 2390, Marrakesh, Morocco

More information

Liberate TEX: Progress on Building a New TEX-Language Interpreter

Liberate TEX: Progress on Building a New TEX-Language Interpreter Liberate TEX: Progress on Building a New TEX-Language Interpreter Doug McKenna Mathemaesthetics, Inc. Boulder, Colorado TUG 2014 The TEX Ecosystem Seems Fractured and Forked There s TEX... or ε-tex...

More information

Information technology Keyboard layouts for text and office systems. Part 9: Multi-lingual, multiscript keyboard layouts

Information technology Keyboard layouts for text and office systems. Part 9: Multi-lingual, multiscript keyboard layouts INTERNATIONAL STANDARD ISO/IEC 9995-9 First edition 2016-10-01 Information technology Keyboard layouts for text and office systems Part 9: Multi-lingual, multiscript keyboard layouts Technologies de l

More information

ä + ñ ISO/IEC JTC1/SC2/WG2 N

ä + ñ ISO/IEC JTC1/SC2/WG2 N ISO/IEC JTC1/SC2/WG2 N3727 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation Международная организация по стандартизации

More information

Thomas Wolff

Thomas Wolff Mined: An Editor with Extensive Unicode and CJK Support for the Text-based Terminal Environment Thomas Wolff http://towo.net/mined/ towo@computer.org Introduction Many Unicode editors are GUI applications

More information

The univie-ling-paper class

The univie-ling-paper class The univie-ling-paper class Jürgen Spitzmüller Version 1.9, 2019/01/21 Abstract The univie-ling-paper class provides a L A TEX 2ε class suitable for papers (i. e., [Pro]Seminararbeiten) in (Applied) Linguistics

More information

Proposal to Encode Oriya Fraction Signs in ISO/IEC 10646

Proposal to Encode Oriya Fraction Signs in ISO/IEC 10646 Proposal to Encode Oriya Fraction Signs in ISO/IEC 0646 University of Michigan Ann Arbor, Michigan, U.S.A. pandey@umich.edu December 4, 2007 Contents Proposal Summary Form i Introduction 2 Characters Proposed

More information

The Use of Unicode in MARC 21 Records. What is MARC?

The Use of Unicode in MARC 21 Records. What is MARC? # The Use of Unicode in MARC 21 Records Joan M. Aliprand Senior Analyst, RLG What is MARC? MAchine-Readable Cataloging MARC is an exchange format Focus on MARC 21 exchange format An implementation may

More information

The basics of LaTeX. Cédric Buron. April 25, 2016

The basics of LaTeX. Cédric Buron. April 25, 2016 The basics of LaTeX Cédric Buron April 25, 2016 Hello fellows! Today, I introduce a very important language in my PhD, L A TEX(what a logo!). In this article, I m going to introduce very quickly the purpose,

More information

An introduction to TeX. And other options for producing documents containing mathematics

An introduction to TeX. And other options for producing documents containing mathematics An introduction to TeX And other options for producing documents containing mathematics Robbie Robinson Department of Mathematics GWU Washington DC 20052 robinson@gwu.edu 1. What is TeX? Tex is: A typesetting

More information

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet

Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet Blending Content for South Asian Language Pedagogy Part 2: South Asian Languages on the Internet A. Sean Pue South Asia Language Resource Center Pre-SASLI Workshop 6/7/09 1 Objectives To understand how

More information

CSCE 222 Discrete Structures for Computing. LaTeX. Dr. Hyunyoung Lee. !!!!! Based on slides by Andreas Klappenecker

CSCE 222 Discrete Structures for Computing. LaTeX. Dr. Hyunyoung Lee. !!!!! Based on slides by Andreas Klappenecker CSCE 222 Discrete Structures for Computing LaTeX Dr. Hyunyoung Lee!!!!! Based on slides by Andreas Klappenecker 1 Tripitaka Koreana Palman Daejanggyeong ( Eighty-Thousand Tripitaka ) South Korean collection

More information

The solid pentagram, as found in the top row of the figure above, is also used in mysticism and occult contexts (see figures in section 6).

The solid pentagram, as found in the top row of the figure above, is also used in mysticism and occult contexts (see figures in section 6). L2/09 185R2 TO: UTC and ISO/IEC JTC1/SC2 WG2 FROM: Azzeddine Lazrek and Deborah Anderson (SEI, UC Berkeley) RE: Proposal for PENTAGRAM characters DATE: 14 May 2009 1. Background It has come to our attention

More information

Unicode definition list

Unicode definition list abstract character D3 3.3 2 abstract character sequence D4 3.3 2 accent mark alphabet alphabetic property 4.10 2 alphabetic sorting annotation ANSI Arabic digit 1 Arabic-Indic digit 3.12 1 ASCII assigned

More information

****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete***

****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete*** 1 of 5 3/3/2003 1:25 PM ****This proposal has not been submitted**** ***This document is displayed for initial feedback only*** ***This proposal is currently incomplete*** ISO INTERNATIONAL ORGANIZATION

More information

Statistics Statistical Computing Software

Statistics Statistical Computing Software Statistics 135 - Statistical Computing Software Mark E. Irwin Department of Statistics Harvard University Autumn Term Monday, September 19, 2005 - January 2006 Copyright c 2005 by Mark E. Irwin Personnel

More information

You 2 Software

You 2 Software PrismaCards Enter text for languages with exotic fonts You 2 Software http://www.you2.de info@you2.de Introduction To work in PrismaCards and other programs with complex fonts for different languages you

More information

Introduction 1. Chapter 1

Introduction 1. Chapter 1 This PDF file is an excerpt from The Unicode Standard, Version 5.2, issued and published by the Unicode Consortium. The PDF files have not been modified to reflect the corrections found on the Updates

More information

Typesetting CJK languages with Omega

Typesetting CJK languages with Omega Typesetting CJK languages with Omega Jin-Hwan Cho Korean TEX Users Group chofchof@ktug.or.kr Haruhiko Okumura Matsusaka University, 515-8511 Japan okumura@acm.org Abstract This paper describes how to typeset

More information

LaTeX installation guide

LaTeX installation guide How LaTeX works? LaTeX installation guide First, we use a text editor (eg. Texmaker) to create a LaTeX document. Then, we run a LaTeX compiler (we'll be using MiKTeX). Finally, we need to use a viewer/previewer

More information

THIS IS AN INTRODUCTION TO. LaTeX. Introduction to Latex. University of Minnesota, November 7, 2016

THIS IS AN INTRODUCTION TO. LaTeX. Introduction to Latex. University of Minnesota, November 7, 2016 THIS IS AN INTRODUCTION TO LaTeX Introduction to Latex Jimmy Broomfield University of Minnesota, November 7, 2016 1 / Jimmy Broomfield Introduction to LaTeX 1/22 22 Outline Introduction Installation Getting

More information

My LaTeX Document. July 7, Introduction 2

My LaTeX Document. July 7, Introduction 2 My LaTeX Document Me July 7, 2017 Contents 1 Introduction 2 2 Exercises 2 2.1 Piecewise defined function...................... 2 2.2 Greek letters............................. 2 2.3 Limits.................................

More information

Some NT S thoughts. Hans Hagen. The next stage. Some NT S thoughts 1

Some NT S thoughts. Hans Hagen. The next stage. Some NT S thoughts 1 Some NT S thoughts 1 Some NT S thoughts Hans Hagen The next stage When we take a look at Peter Breitenlohner s ε-tex, we see extensions in the spirit of TEX. Based on experiences with macro writing, some

More information

Written & Oral Presentation: Computer Tools

Written & Oral Presentation: Computer Tools Written & Oral Presentation: Computer Tools Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 Course MATH-GA.2840-004, Spring 2018 February 7th, 2018 A. Donev (Courant Institute) Tools

More information

T E X Gyre Pagella Math

T E X Gyre Pagella Math T E X Gyre Pagella Math or Misfortunes of Math Typographer Bachotek 29IV 3V2012 B. Jackowski, P. Strzelczyk and J. M. Nowacki What is a math font? What is a math font? Naïve answer: it is a font that contains

More information

Form number: N2352-F (Original ; Revised , , , , , , ) N2352-F Page 1 of 7

Form number: N2352-F (Original ; Revised , , , , , , ) N2352-F Page 1 of 7 ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 1 Please fill all the sections A, B and C below. (Please read Principles and Procedures

More information

What is T E X? T E X and L A T E X Document preparation tools. Setting and casting type. What Knuth was setting. Setting and casting type

What is T E X? T E X and L A T E X Document preparation tools. Setting and casting type. What Knuth was setting. Setting and casting type T E X and L A T E X Document preparation tools This lecture will introduce software necessary to produce documents using L A T E X in the School of Computer Science. It will also show the basics of producing

More information

PROPOSAL SUBMISSION FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646

PROPOSAL SUBMISSION FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 PROPOSAL SUBMISSION FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 A. Administrative 1. Title: Proposal to add Archaic Mediterranean Script block to ISO 10646 2. Requester's

More information

Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS

Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS Proposal to Encode the Ganda Currency Mark for Bengali in the BMP of the UCS University of Michigan Ann Arbor, Michigan, U.S.A. pandey@umich.edu May 21, 2007 1 Introduction This is a proposal to encode

More information

On the use of symbolic markup in the Production of a Multi-Lingual Dictionary 1

On the use of symbolic markup in the Production of a Multi-Lingual Dictionary 1 On the use of symbolic markup in the Production of a Multi-Lingual Dictionary 1 Klaus Lagally 2 Institut für Informatik, Universität Stuttgart Breitwiesenstraße 20-22, 70565 Stuttgart, GERMANY EMail: lagally@informatik.uni-stuttgart.de

More information

10 years of OpenType math font development

10 years of OpenType math font development Overview Choices of math fonts Completeness Design consistency Testing Summary 10 years of OpenType math font development Ulrik Vieth Stuttgart, Germany TUG@BachoT E X 2017 Brodnica, Poland Overview Choices

More information

Proposal to Add Four SENĆOŦEN Latin Charaters

Proposal to Add Four SENĆOŦEN Latin Charaters L2/04-170 Proposal to Add Four SENĆOŦEN Latin Charaters by: John Elliot, Peter Brand, and Chris Harvey of: Saanich Native Heritage Society and First Peoples' Cultural Foundation Date: May 5, 2004 The SENĆOŦEN

More information

25 Years of TEX and METAFONT

25 Years of TEX and METAFONT TUG 2003 keynote address... p. 1/?? Years of TEX and METAFONT Looking Back and Looking Forward Nelson H. F. Beebe Department of Mathematics University of Utah Salt Lake City, UT 84112-0090 USA TUG 2003

More information

Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set

Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set Title: Application to include Arabic alphabet shapes to Arabic 0600 Unicode character set Action: For consideration by UTC and ISO/IEC JTC1/SC2/WG2 Author: Mohammad Mohammad Khair Date: 17-Dec-2018 Introduction:

More information

Rendering in Dzongkha

Rendering in Dzongkha Rendering in Dzongkha Pema Geyleg Department of Information Technology pema.geyleg@gmail.com Abstract The basic layout engine for Dzongkha script was created with the help of Mr. Karunakar. Here the layout

More information

About BasicTeX Richard Koch. June 5, 2017

About BasicTeX Richard Koch. June 5, 2017 About BasicTeX-2017 Richard Koch June 5, 2017 1 Introduction Most TeX distributions for Mac OS X are based on TeX Live, the reference edition of TeX produced by TeX User Groups across the world. Among

More information

Help with PDF Files Is there a way someone else can do this for me? What Software Do I Need to Create PDF Files?

Help with PDF Files Is there a way someone else can do this for me? What Software Do I Need to Create PDF Files? Help with PDF Files Portable Document Format (PDF) is commonly used to distribute documents online because it allows readers on different platforms (e.g. Windows, Mac and UNIX) to view files as originally

More information

Detailed Contents for TEX Unbound: Strategies for Font, Graphics, and More

Detailed Contents for TEX Unbound: Strategies for Font, Graphics, and More Bijlage L Contents TEX Unbound 41 Detailed Contents for TEX Unbound: Strategies for Font, Graphics, and More Alan Hoenig Abstract This book will be published by Oxford University Press in early 1997. Contact

More information

JTC1/SC2/WG

JTC1/SC2/WG JTC1/SC2/WG2 2016-07-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации

More information

A. Administrative. B. Technical General

A. Administrative. B. Technical General ISO/IEC JTC1/SC2/WG2 N2241 2000-08-27 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation internationale de normalisation еждународная организация по

More information

TECkit version 2.0 A Text Encoding Conversion toolkit

TECkit version 2.0 A Text Encoding Conversion toolkit TECkit version 2.0 A Text Encoding Conversion toolkit Jonathan Kew SIL Non-Roman Script Initiative (NRSI) Abstract TECkit is a toolkit for encoding conversions. It offers a simple format for describing

More information

Indian T E X Users Group

Indian T E X Users Group Indian T E X Users Group URL: http://www.river-valley.com/tug On-line Tutorial on L A T E X The Tutorial Team Indian T E X Users Group, sjp Buildings, Cotton Hills Trivandrum 695014, india 2000 Prof. (Dr.)

More information

Refinement of digitized documents through recognition of mathematical formulae

Refinement of digitized documents through recognition of mathematical formulae Refinement of digitized documents through recognition of mathematical formulae Toshihiro KANAHORI Research and Support Center on Higher Education for the Hearing and Visually Impaired, Tsukuba University

More information

Groundtruth Image Generation from Electronic Text (Demonstration)

Groundtruth Image Generation from Electronic Text (Demonstration) Groundtruth Image Generation from Electronic Text (Demonstration) David Doermann and Gang Zi Laboratory for Language and Media Processing, University of Maryland, College Park MD 21043, USA {doermann,

More information

VERY VERY SHORT GUIDE TO LATEX

VERY VERY SHORT GUIDE TO LATEX - a System VERY VERY SHORT GUIDE TO LATEX School of Mathematics September 2017 - a System Advantages... Disadvantages... The Basics THE BASIC IDEA A word processor allows the user to design the document

More information

Volume 27, Number 2 / 2006 TUG 2006 Conference Proceedings

Volume 27, Number 2 / 2006 TUG 2006 Conference Proceedings TUGBOAT Volume 27, Number 2 / 2006 TUG 2006 Conference Proceedings General Delivery 110 Karl Berry / From the president 111 Barbara Beeton / Editorial comments TUG2006; Chuck Bigelow goes to RIT; DEK in

More information

TUGboat, Volume 32 (2011), No. 1 17

TUGboat, Volume 32 (2011), No. 1 17 TUGboat, Volume 32 (2011), No. 1 17 TUGboat s 100 issues Basic statistics and random gleanings Abstract TUG was founded to provide an organization for people who are interested in typography and font design,

More information

Electronic Publishing and Document Processing Advanced Topics in Computer Science(CS ) Home Page Spring 1997 Course Outline/Syllabus

Electronic Publishing and Document Processing Advanced Topics in Computer Science(CS ) Home Page Spring 1997 Course Outline/Syllabus Electronic Publishing and Document Processing Advanced Topics in Computer Science(CS 236605) Home Page Spring 1997 Course Outline/Syllabus LECTURES: Wednesdays 14:30-16:20 Fishbach 507 (Main Multimedia

More information

The hanging package. Contents. 1 Introduction

The hanging package. Contents. 1 Introduction The hanging package Author: Peter Wilson, Herries Press Maintainer: Will Robertson will dot robertson at latex-project dot org 2009/09/02 Abstract The hanging package provides facilities for defining hanging

More information

Java Multilingual Elementary Tool

Java Multilingual Elementary Tool November 28, 2004 Outline Designing Outline Multilingual system: refer to computer programs which permit user interaction with the computer in one or more languages A Java multilingual elementary tool

More information

Conversion of TEX fonts into Type 1 format

Conversion of TEX fonts into Type 1 format Conversion of TEX fonts into Type 1 format Szabó Péter September 2001 presented at EuroTEX 2001 Page 1 of 12 The problem: Acrobat Reader renders most fonts slowly and unreadably ugly on

More information

UC Irvine Unicode Project

UC Irvine Unicode Project UC Irvine Unicode Project Title A proposal to encode New Testament editorial characters in the UCS Permalink https://escholarship.org/uc/item/6r10d7w1 Authors Pantelia, Maria C. Peevers, Richard Publication

More information

136 TUGboat, Volume 39 (2018), No. 2

136 TUGboat, Volume 39 (2018), No. 2 136 TUGboat, Volume 39 (2018), No. 2 FreeType MF Module: A module for using METAFONT directly inside the FreeType rasterizer Jaeyoung Choi, Ammar Ul Hassan, Geunho Jeong Abstract METAFONT is a font description

More information

John H. Jenkins If available now, identify source(s) for the font (include address, , ftp-site, etc.) and indicate the tools used:

John H. Jenkins If available now, identify source(s) for the font (include address,  , ftp-site, etc.) and indicate the tools used: ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 1 Please fill all the sections A, B and C below. Please read Principles and Procedures

More information

Google Search Appliance

Google Search Appliance Google Search Appliance Search Appliance Internationalization Google Search Appliance software version 7.2 and later Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com GSA-INTL_200.01

More information

Extensible Rendering for Complex Writing Systems

Extensible Rendering for Complex Writing Systems Extensible Rendering for Complex Writing Systems Sharon Correll SIL International 1 Introduction Those needing to work with multilingual text, particularly using any kind of complex script, commonly run

More information

Additional Mathematical and Letterlike Characters

Additional Mathematical and Letterlike Characters Additional Mathematical and Letterlike Characters ISO/IEC JTC1/SC2/WG2 N2590 Date: August 23, 2003 Source: The Unicode Consortium Status: Liaison contribution Action: For consideration by WG2 in preparing

More information

SYSTEM PROGRAMMING & SYSTEM ADMINISTRATION SECTION -A

SYSTEM PROGRAMMING & SYSTEM ADMINISTRATION SECTION -A LECTURE 1 SYSTEM PROGRAMMING & SYSTEM ADMINISTRATION SECTION -A REFERENCES: SYSTEM PROGRAMMING BY JOHN J. DONOVAN (TMH EDITION) & GOOGLE SEARCH ENGINE GROUP DISCUSSION 1. About the Syllabus and how are

More information