Tex with Unicode Characters

Size: px
Start display at page:

Download "Tex with Unicode Characters"

Transcription

1 Tex with Unicode Characters 7/10/18 Presented by: Yuefei Xiang

2 Agenda ASCII Code Unicode Unicode in Tex Old Style Encoding -Inputenc, -ucs Morden Encoding -XeTeX -LuaTeX Unicode bi-direction in Tex -Emacs-AucTeX -XSL-FO!2

3 ASCII Code Basic ASCII(0-127) Input just by keyboard!3

4 Cork Encoding Cork encoding(0-256) \usepackage[t1]{fontenc} \charnumber Eg. -> \char34 Č -> \char268!4

5 Unicode 0-32, control codes backward compatible ASCII codes Latin-1 Latin extended Greek extended, Cyrillic!5

6 Unicode in TeX

7 Unicode in TeX None-math mode symbols:!7

8 Unicode in TeX He said Hello to Dan -> He said Hello to Dan.!8

9 Unicode in TeX He said Hello to Dan -> He said Hello to Dan. He said ''Hello'' to Dan -> He said Hello to Dan!9

10 Unicode in TeX He said Hello to Dan -> He said Hello to Dan. He said ''Hello'' to Dan -> He said Hello to Dan He said `` Hello ' ' to Dan -> He said Hello to Dan.!10

11 Unicode in TeX math mode symbols:!11

12 Unicode in TeX math mode symbols: $abc$ Means abc is in math mode eg. -> $\sum$!12

13 Unicode Latin, Greek Various encoding schema GBK(Chinese) Kanji(Japanese) KOI8-RU(Russian) Only small part of characters are supported by original Donald Knuth's TeX How to deal with other characters?!13

14 Unicode in TeX old-style old-style Modern Tex (DVI) pdftex (PDF) LuaTex (PDF) Modern XeTex (XDV)!14

15 Old Style Inputenc(utf8) This package is of vital importance if you use non-ascii characters in your document. For example, if you type the word Ángela, the Á character will not appear correctly in the output. To solve this problem, use: Universal Code Character Set(utf8x) More mathematical symbols are supported, for example,,,,, and several kinds of arrows. Double-struck letters (A, B, etc.) are now typeset using \mathbb instead of \mathbbm. The handling of small greek letters in math mode has been improved.!15

16 Old Style Universal Code Character Set(utf8x)!16

17 XeTex Adaptation for Mac OS X Allowed to use UTF-8 and UTF-16(with UTF-16 Byte Order Mark found) Convert automatically Read UTF-8 input files Byte sequences Unicode Without addition TeX macros eg. Inputenc package!17

18 XeTex Mac OS Roman!18

19 XeTex \XeTeXinputencoding "charset-name" "mac" (Mac OS Roman), "cp1252" (Windows codepage 1252, Western European), "shift-jis" (e.g., for Japanese)!19

20 XeTex- Hyphenation support The hyphenation tables in XeTeX have been extended to support 16-bit Unicode characters. Hyphenation patterns using Devanagari letters!20

21 LuaTex- callbacks Common used callbacks!21

22 LuaTex- callbacks mapping a byte to its numerical representation in Unicode u is 253 in Latin-3 turning that representation into unicode byte 0x016D (365) in Unicode!22

23 Unicode bi-direction in Tex Emacs- AUCTex Texmaker/TexStudio(support has been removed, not enough stable) XSL-FO!23

24 Emacs Emacs(Editing macros) editing environment/text work environment. GNU Emacs created by Richard Stallman AucTEX: collection of Emacs functions supporting modes for editing Tex documents!24

25 AucTEX Set buffer-local variables bidi-display-reordering (controls whether text in the buffer is reordered for display) Value = non-nil(right-ti-left) Default Value = t(left-to-right) bidi-paragraph-start-re & bidi-paragraph-separate-reto (truncate paragraphs) Value = ^ bidi-paragraph-direction (dynamic determination of the base direction) Value = right-to-left/left-to-right Default Value = non-nil(disable the dynamic determination of the base direction)!25

26 AucTEX Set formatting characters RIGHT-TO-LEFT MARK / rlm forces the right-to-left direction on the following paragraph LEFT-TO-RIGHT MARK / lrm forces the left-to-right direction on the following paragraph!26

27 XSL-FO!27

28 XSL-FO xml-like syntax writing-mode attribute: lr-tb: for left-to-right, top-to-bottom (the default value), suitable for languages such as English, Polish rl-tb: for right-to-left, top-to-bottom, suitable for languages such as Hebrew and Arabic tb-rl: for top-to-bottom, right-to-left, suitable for traditional Japanese lr-alternating-rl-bt, lr-inverting-rl-bt!28

29 THANK YOU!29

UTF and Turkish. İstinye University. Representing Text

UTF and Turkish. İstinye University. Representing Text Representing Text Representation of text predates the use of computers for text Text representation was needed for communication equipment One particular commonly used communication equipment was teleprinter

More information

Representing Characters and Text

Representing Characters and Text Representing Characters and Text cs4: Computer Science Bootcamp Çetin Kaya Koç cetinkoc@ucsb.edu Çetin Kaya Koç http://koclab.org Winter 2018 1 / 28 Representing Text Representation of text predates the

More information

Can R Speak Your Language?

Can R Speak Your Language? Languages Can R Speak Your Language? Brian D. Ripley Professor of Applied Statistics University of Oxford ripley@stats.ox.ac.uk http://www.stats.ox.ac.uk/ ripley The lingua franca of computing is (American)

More information

uptex Unicode version of ptex with CJK extensions

uptex Unicode version of ptex with CJK extensions uptex Unicode version of ptex with CJK extensions Takuji Tanaka uptex project Oct 26, 2013 Takuji Tanaka (uptex project) uptex Unicode version of ptex with CJK extensions Oct 26, 2013 1 / 42 Outline /

More information

Representing Characters, Strings and Text

Representing Characters, Strings and Text Çetin Kaya Koç http://koclab.cs.ucsb.edu/teaching/cs192 koc@cs.ucsb.edu Çetin Kaya Koç http://koclab.cs.ucsb.edu Fall 2016 1 / 19 Representing and Processing Text Representation of text predates the use

More information

2011 Martin v. Löwis. Data-centric XML. Character Sets

2011 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

2007 Martin v. Löwis. Data-centric XML. Character Sets

2007 Martin v. Löwis. Data-centric XML. Character Sets Data-centric XML Character Sets Character Sets: Rationale Computer stores data in sequences of bytes each byte represents a value in range 0..255 Text data are intended to denote characters, not numbers

More information

Character Encodings. Fabian M. Suchanek

Character Encodings. Fabian M. Suchanek Character Encodings Fabian M. Suchanek 22 Semantic IE Reasoning Fact Extraction You are here Instance Extraction singer Entity Disambiguation singer Elvis Entity Recognition Source Selection and Preparation

More information

Unicode and Non Unicode Printing with the Swiss 721 Font

Unicode and Non Unicode Printing with the Swiss 721 Font Unicode and Non Unicode Printing with the Swiss 721 Font There are many methods of printing international characters with Unicode fonts on a Zebra printer. We offer a free Swiss 721 font with 983 characters

More information

SAPGUI for Windows - I18N User s Guide

SAPGUI for Windows - I18N User s Guide Page 1 of 30 SAPGUI for Windows - I18N User s Guide Introduction This guide is intended for the users of SAPGUI who logon to Unicode systems and those who logon to non-unicode systems whose code-page is

More information

Practical character sets

Practical character sets Practical character sets In MySQL, on the web, and everywhere Domas Mituzas MySQL @ Sun Microsystems Wikimedia Foundation It seems simple a b c d e f a ą b c č d e ę ė f а б ц д е ф פ ע ד צ ב א... ---...

More information

What s new since TEX?

What s new since TEX? Based on Frank Mittelbach Guidelines for Future TEX Extensions Revisited TUGboat 34:1, 2013 Raphael Finkel CS Department, UK November 20, 2013 All versions of TEX Raphael Finkel (CS Department, UK) What

More information

COM Text User Manual

COM Text User Manual COM Text User Manual Version: COM_Text_Manual_EN_V2.0 1 COM Text introduction COM Text software is a Serial Keys emulator for Windows Operating System. COM Text can transform the Hexadecimal data (received

More information

An Introductive Presentation of XSL-FO. Jean-Michel HUFFLEN LIFC University of Franche-Comté GUIT meeting, 15th October 2011

An Introductive Presentation of XSL-FO. Jean-Michel HUFFLEN LIFC University of Franche-Comté GUIT meeting, 15th October 2011 An Introductive Presentation of XSL-FO Jean-Michel HUFFLEN LIFC University of Franche-Comté GUIT meeting, 15th October 2011 1 Contents Using xsl-fo Basic elements Multilingual capabilities Page model Orientation

More information

Routine Routine/ Minor/ Moderate/ Serious / Major/ Critical

Routine Routine/ Minor/ Moderate/ Serious / Major/ Critical Product S84xxse Other Products affected Version affected 17.00.01.00 Bulletin Category Firmware. Host software/ Printer /Firmware/ Communication/ Part issue/ Other Severity rating Bulletin Name Issued

More information

Friendly Fonts for your Design

Friendly Fonts for your Design Friendly Fonts for your Design Choosing the right typeface for your website copy is important, since it will affect the way your readers perceive your page (serious and formal, or friendly and casual).

More information

Bookmarks for PDF Output(Outline-Group)

Bookmarks for PDF Output(Outline-Group) Bookmarks for PDF Output(Outline-Group) The axf:outline-group groups bookmark items of PDF, and outputs them collectively. Value: Initial: empty string Applies to: block-level formatting objects

More information

TECkit version 2.0 A Text Encoding Conversion toolkit

TECkit version 2.0 A Text Encoding Conversion toolkit TECkit version 2.0 A Text Encoding Conversion toolkit Jonathan Kew SIL Non-Roman Script Initiative (NRSI) Abstract TECkit is a toolkit for encoding conversions. It offers a simple format for describing

More information

L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird. LATEX: Eh? p.1/13

L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird. LATEX: Eh? p.1/13 LATEX: Eh? p.1/13 L A TEX: Eh? What is it, what isn t it, who cares? Andy Caird acaird@umich.edu LATEX: Eh? p.2/13 L A TEX It is not a word-processor. LATEX: Eh? p.2/13 L A TEX It is not a word-processor.

More information

Introduction to Normalization and Modern Collation

Introduction to Normalization and Modern Collation Introduction to Normalization and Modern Collation Roozbeh Pournader Sharif FarsiWeb, Inc. roozbeh@farsiweb.info The gap that needed filling For compatibility reasons, Unicode has more than one way to

More information

Using Sweave and patchdvi with Japanese text

Using Sweave and patchdvi with Japanese text Using Sweave and patchdvi with Japanese text Duncan Murdoch 27 6 8 The patchdvi package works with Sweave [? ] and document previewers to facilitate editing: it modifies the links that LATEX puts into

More information

CP-147 Date 1999/01/30. Name of Standard: PS 3.3,

CP-147 Date 1999/01/30. Name of Standard: PS 3.3, Correction Number: CP-147 Log Summary: Type of Modification: Clarification Name of Standard: PS 3.3, 3.5-1998 Rationale for Correction: 1. Correct typos in Section 2: Normative references. 2. Some references

More information

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02

PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 PLATYPUS FUNCTIONAL REQUIREMENTS V. 2.02 TABLE OF CONTENTS Introduction... 2 Input Requirements... 2 Input file... 2 Input File Processing... 2 Commands... 3 Categories of Commands... 4 Formatting Commands...

More information

Thomas Wolff

Thomas Wolff Mined: An Editor with Extensive Unicode and CJK Support for the Text-based Terminal Environment Thomas Wolff http://towo.net/mined/ towo@computer.org Introduction Many Unicode editors are GUI applications

More information

CSS3 Text Extensions. 1 Summary. 2 Contents. Michel Suignard. Microsoft Corporation

CSS3 Text Extensions. 1 Summary. 2 Contents. Michel Suignard. Microsoft Corporation Michel Suignard Microsoft Corporation 1 Summary This document presents new text extensions considered for CSS3 (Cascading Style Sheet). The main topics presented are layout flow, text justification, baseline

More information

PrecisionID ITF Barcode Fonts User Manual

PrecisionID ITF Barcode Fonts User Manual PrecisionID ITF Barcode Fonts User Manual Updated 2018 Copyright 2018 PrecisionID.com All Rights Reserved Legal Notices Page 1 PrecisionID ITF (Interleaved 2 of 5) Barcode Font User Manual Notice: When

More information

1 The Cyrillic font encodings: T2A, T2B, T2C, and X2

1 The Cyrillic font encodings: T2A, T2B, T2C, and X2 1 The Cyrillic font encodings: T2A, T2B, T2C, and X2 Since the number of Cyrillic glyphs exceeds the limit for a T encoding, it is necessary to create multiple glyph containers. The output encodings T2A,

More information

Multilingual mathematical e-document processing

Multilingual mathematical e-document processing Multilingual mathematical e-document processing Azzeddine LAZREK University Cadi Ayyad, Faculty of Sciences Department of Computer Science Marrakech - Morocco lazrek@ucam.ac.ma http://www.ucam.ac.ma/fssm/rydarab

More information

Arabic document composition with T E X

Arabic document composition with T E X Arabic document composition with T E X Azzeddine LAZREK University Cadi Ayyad, Faculty of Sciences Department of Computer Science Marrakesh - Morocco lazrek@ucam.ac.ma http://www.ucam.ac.ma/fssm/rydarab

More information

Math Typesetting and Authoring

Math Typesetting and Authoring Math Typesetting and Authoring Michael Leslie Brooks July 3, 2018 University of Waterloo CS 846 Advanced Topics in Electronic Publishing 1 Agenda First half: Understand how a formatter organizes 2D arrangements

More information

Introduction to CS Dealing with tables in Word Jacek Wiślicki, Laurent Babout,

Introduction to CS Dealing with tables in Word Jacek Wiślicki, Laurent Babout, Most word processors offer possibility to draw and format even very sophisticated tables. A table consists of rows and columns, forming cells. Cells can be split and merged together. Content of each cell

More information

XML is a popular multi-language system, and XHTML depends on it. XML details languages

XML is a popular multi-language system, and XHTML depends on it. XML details languages 1 XML XML is a popular multi-language system, and XHTML depends on it XML details languages XML 2 Many of the newer standards, including XHTML, are based on XML = Extensible Markup Language, so we will

More information

LBSC 690: Information Technology Lecture 05 Structured data and databases

LBSC 690: Information Technology Lecture 05 Structured data and databases LBSC 690: Information Technology Lecture 05 Structured data and databases William Webber CIS, University of Maryland Spring semester, 2012 Interpreting bits "my" 13.5801 268 010011010110 3rd Feb, 2014

More information

Location of Talk/Slides/Software/Demos

Location of Talk/Slides/Software/Demos Implementing Better Source Editing for Bidirectional HTML and XML in the Text Editor 35 th Internationalization and Unicode Conference October 18, 2011 Shunsuke Oshima Martin J. Dürst Aoyama Gakuin University,

More information

Blaise Team IBUC, April 24, 2012

Blaise Team IBUC, April 24, 2012 Blaise Team IBUC, April 24, 2012 Additions to Blaise since IBUC Baltimore Blaise 4.8.3 Support for non-western languages CATI extensions CAWI extensions CARI extensions... Blaise 4.8.4 Accessibility (Section

More information

Conversion of Cyrillic script to Score with SipXML2Score Author: Jan de Kloe Version: 2.00 Date: June 28 th, 2003, last updated January 24, 2007

Conversion of Cyrillic script to Score with SipXML2Score Author: Jan de Kloe Version: 2.00 Date: June 28 th, 2003, last updated January 24, 2007 Title: Conversion of Cyrillic script to Score with SipXML2Score Author: Jan de Kloe Version: 2.00 Date: June 28 th, 2003, last updated January 24, 2007 Scope There is no limitation in MusicXML to the encoding

More information

Japanese utf 8 font. Japanese utf 8 font.zip

Japanese utf 8 font. Japanese utf 8 font.zip Japanese utf 8 font Japanese utf 8 font.zip 22/11/2010 Japanese: 私はガラスを (Literal UTF-8) Representing Middle English on the Web with UTF-8; The Kermit Bibliography (in UTF-8)What I'd like to do is save

More information

Multilingual vi Clones: Past, Now and the Future

Multilingual vi Clones: Past, Now and the Future THE ADVANCED COMPUTING SYSTEMS ASSOCIATION The following paper was originally published in the Proceedings of the FREENIX Track: 1999 USENIX Annual Technical Conference Monterey, California, USA, June

More information

Network Working Group. Category: Informational ETL December ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP

Network Working Group. Category: Informational ETL December ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP Network Working Group Request for Comments: 1554 Category: Informational M. Ohta Tokyo Institute of Technology K. Handa ETL December 1993 Status of this Memo ISO-2022-JP-2: Multilingual Extension of ISO-2022-JP

More information

IDN - what s up? Patrik Fältström

IDN - what s up? Patrik Fältström IDN - what s up? Patrik Fältström paf@cisco.com 1 Old stuff (what is IDNA) What is it? What implications do we get? IDNA uses Unicode 3.2 2 Protocol issues Old protocols can only handle a subset of US-

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CMPS Introduction to Computer Science Lecture Notes Binary Numbers Until now we have considered the Computing Agent that executes algorithms to be an abstract entity. Now we will be concerned with techniques

More information

DESCRIPTION wvabw converts Word documents into Abiword s native format. See for more information.

DESCRIPTION wvabw converts Word documents into Abiword s native format. See  for more information. wvabw(1) wvabw(1) wvhtml convert msword documents to Abiword s format wvtext in_word_doc out_text_doc wvabw converts Word documents into Abiword s native format. See http://www.abisource.com/ for more

More information

Development of. TeXShop. - The Past and the Future Yusuke Terada. Tetsuryokukai (鉄緑会)

Development of. TeXShop. - The Past and the Future Yusuke Terada. Tetsuryokukai (鉄緑会) Development of TeXShop - The Past and the Future Yusuke Terada Tetsuryokukai (鉄緑会) Summary 1. The history of TeXShop! 2. TeXShop s features equipped for editing Japanese documents! 3. The future of TeXShop

More information

The Unicode Standard Version 11.0 Core Specification

The Unicode Standard Version 11.0 Core Specification The Unicode Standard Version 11.0 Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers

More information

Casabac Unicode Support

Casabac Unicode Support Unicode Support Unicode Support Full Unicode support was added into the GUI Server with build 25_20040105. Before ISO 8859-1 was used for encoding and decoding HTML pages and your system's default encoding

More information

III-16Text Encodings. Chapter III-16

III-16Text Encodings. Chapter III-16 Chapter III-16 III-16Text Encodings Overview... 410 Text Encoding Overview... 410 Text Encodings Commonly Used in Igor... 411 Western Text Encodings... 412 Asian Text Encodings... 412 Unicode... 412 Unicode

More information

1 sur 26 13/12/2011 15:18 2 sur 26 13/12/2011 15:18 Introduction The File API provides an interface with the server's OS File system. It allows you to handle files and folders as JavaScript objects using

More information

Picsel epage. PowerPoint file format support

Picsel epage. PowerPoint file format support Picsel epage PowerPoint file format support Picsel PowerPoint File Format Support Page 2 Copyright Copyright Picsel 2002 Neither the whole nor any part of the information contained in, or the product described

More information

Basics of the Unicode BiDirectional Algorithm (UBDA)

Basics of the Unicode BiDirectional Algorithm (UBDA) Basics of the Unicode BiDirectional Algorithm (UBDA) The formatting system implied in (and used to typeset) the slides for the brief bidirectional text reading lesson assumes: Every character has a direction,

More information

Chapter 4: Computer Codes. In this chapter you will learn about:

Chapter 4: Computer Codes. In this chapter you will learn about: Ref. Page Slide 1/30 Learning Objectives In this chapter you will learn about: Computer data Computer codes: representation of data in binary Most commonly used computer codes Collating sequence Ref. Page

More information

CS144: Content Encoding

CS144: Content Encoding CS144: Content Encoding MIME (Multi-purpose Internet Mail Extensions) Q: Only bits are transmitted over the Internet. How does a browser/application interpret the bits and display them correctly? MIME

More information

D16 Code sets, NLS and character conversion vs. DB2

D16 Code sets, NLS and character conversion vs. DB2 D16 Code sets, NLS and character conversion vs. DB2 Roland Schock ARS Computer und Consulting GmbH 05.10.2006 11:45 a.m. 12:45 p.m. Platform: DB2 for Linux, Unix, Windows Code sets and character conversion

More information

Google Search Appliance

Google Search Appliance Google Search Appliance Search Appliance Internationalization Google Search Appliance software version 7.2 and later Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com GSA-INTL_200.01

More information

Using non-latin alphabets in Blaise

Using non-latin alphabets in Blaise Using non-latin alphabets in Blaise Rob Groeneveld, Statistics Netherlands 1. Basic techniques with fonts In the Data Entry Program in Blaise, it is possible to use different fonts. Here, we show an example

More information

Infrastructure for High-Quality Arabic

Infrastructure for High-Quality Arabic TUG 06 Marrakech Infrastructure for High-Quality Arabic Yannis Haralambous École Nationale Supérieure des Télécommunications de Bretagne Technopôle Brest Iroise, CS 83818, 29238 Brest Cedex TUG 06 Marrakech

More information

Part III: Survey of Internet technologies

Part III: Survey of Internet technologies Part III: Survey of Internet technologies Content (e.g., HTML) kinds of objects we re moving around? References (e.g, URLs) how to talk about something not in hand? Protocols (e.g., HTTP) how do things

More information

SAP NetWeaver BI. Unicode Compliance. Product Management SAP NetWeaver BI. Version 7.0 December, 2008

SAP NetWeaver BI. Unicode Compliance. Product Management SAP NetWeaver BI. Version 7.0 December, 2008 SAP NetWeaver BI Unicode Compliance Product Management SAP NetWeaver BI Version 7.0 December, 2008 Agenda 1. Unicode in General 2. Excursus: MDMP 3. Unicode support of SAP NetWeaver BI 4. Interfaces to

More information

Lecture 25: Internationalization. UI Hall of Fame or Shame? Today s Topics. Internationalization Design challenges Implementation techniques

Lecture 25: Internationalization. UI Hall of Fame or Shame? Today s Topics. Internationalization Design challenges Implementation techniques Lecture 25: Internationalization Spring 2008 6.831 User Interface Design and Implementation 1 UI Hall of Fame or Shame? Our Hall of Fame or Shame candidate for the day is this interface for choosing how

More information

Wordman s Production Corner

Wordman s Production Corner Wordman s Production Corner By Dick Eassom, AF.APMP Three Word Tricks...Fractions, Diacritics, and Gibberish The Problems The first trick was inspired by the Office Challenge in TechRepublic (http://www.techrepublic.com/):

More information

Functional Programming in Haskell Prof. Madhavan Mukund and S. P. Suresh Chennai Mathematical Institute

Functional Programming in Haskell Prof. Madhavan Mukund and S. P. Suresh Chennai Mathematical Institute Functional Programming in Haskell Prof. Madhavan Mukund and S. P. Suresh Chennai Mathematical Institute Module # 02 Lecture - 03 Characters and Strings So, let us turn our attention to a data type we have

More information

Cindex 3.0 for Windows. Release Notes

Cindex 3.0 for Windows. Release Notes Cindex 3.0 for Windows Release Notes The information contained in this document is subject to change without notice, and does not represent a commitment on the part of Indexing Research. The program described

More information

Typesetting Thai With LaTeX

Typesetting Thai With LaTeX Typesetting Thai With LaTeX Hin-Tak Leung January 9, 2012 There are three ways of using TX (or more honestly, L A TX 2ε) to typeset Thai. They are X TX (or X L A TX), ThaiL A TX, and cjk/l A TX s Thai

More information

Handling very large XML documents in an editing application

Handling very large XML documents in an editing application Handling very large XML documents in an editing application Presenter: Radu Coravu radu_coravu@oxygenxml.com @radu_coravu Bytes and characters The byte is a unit of digital information that most commonly

More information

IDN and applications. Michel Suignard Senior Program Manager Microsoft

IDN and applications. Michel Suignard Senior Program Manager Microsoft IDN and applications Michel Suignard Senior Program Manager Microsoft IDN is the first step IDN solves a DNS limitation by carrying extended domain entities within the existing framework But most users

More information

Improving Unicode support in TEX

Improving Unicode support in TEX 12 MAPS 37 Mojca Miklavec and Arthur Reutenauer Putting the Cork back in the bottle Improving Unicode support in TEX Abstract Until recently, all of the hyphenation patterns available for different languages

More information

International Cataloging: Use Non-Latin Scripts

International Cataloging: Use Non-Latin Scripts OCLC Connexion Client Guides International Cataloging: Use Non-Latin Scripts Revised: September 2011 6565 Kilgour Place, Dublin, OH 43017-3395 www.oclc.org Revision History Date Section title Description

More information

The Use of Unicode in MARC 21 Records. What is MARC?

The Use of Unicode in MARC 21 Records. What is MARC? # The Use of Unicode in MARC 21 Records Joan M. Aliprand Senior Analyst, RLG What is MARC? MAchine-Readable Cataloging MARC is an exchange format Focus on MARC 21 exchange format An implementation may

More information

Navigating the pitfalls of cross platform copies

Navigating the pitfalls of cross platform copies Navigating the pitfalls of cross platform copies Kai Stroh, UBS Hainer GmbH Overview Motivation Some people are looking for a way to copy data from Db2 for z/ OS to other platforms Reasons include: Number

More information

Unicode Encoding. The TITUS Project

Unicode Encoding. The TITUS Project Unicode Encoding and Online Data Access Ralf Gehrke / Jost Gippert The TITUS Project ( Thesaurus indogermanischer Text- und Sprachmaterialien ) (since 1987/1993) www.ala.org/alcts 1 Scope of the TITUS

More information

Computer Science 1001.py. Lecture 19, part B: Characters and Text Representation: Ascii and Unicode

Computer Science 1001.py. Lecture 19, part B: Characters and Text Representation: Ascii and Unicode Computer Science 1001.py Lecture 19, part B: Characters and Text Representation: Ascii and Unicode Instructors: Benny Chor, Amir Rubinstein Teaching Assistants: Amir Gilad, Michal Kleinbort Founding Teaching

More information

Developping of Character Object Technology with Character Databases

Developping of Character Object Technology with Character Databases Developping of Character Object Technology with Character Databases 1) 2) MORIOKA Tomohiko Christian Wittern 1) 606-8265 47 E-mail: tomo@kanji.zinbun.kyoto-u.ac.jp 2) 606-8265 47 E-mail: wittern@kanji.zinbun.kyoto-u.ac.jp

More information

Computer Science 1001.py. Lecture 19a: Generators continued; Characters and Text Representation: Ascii and Unicode

Computer Science 1001.py. Lecture 19a: Generators continued; Characters and Text Representation: Ascii and Unicode Computer Science 1001.py Lecture 19a: Generators continued; Characters and Text Representation: Ascii and Unicode Instructors: Daniel Deutch, Amir Rubinstein Teaching Assistants: Ben Bogin, Michal Kleinbort,

More information

[MS-UCODEREF]: Windows Protocols Unicode Reference. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-UCODEREF]: Windows Protocols Unicode Reference. Intellectual Property Rights Notice for Open Specifications Documentation [MS-UCODEREF]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

About BasicTeX Richard Koch. June 5, 2017

About BasicTeX Richard Koch. June 5, 2017 About BasicTeX-2017 Richard Koch June 5, 2017 1 Introduction Most TeX distributions for Mac OS X are based on TeX Live, the reference edition of TeX produced by TeX User Groups across the world. Among

More information

Label Flow Release Notes

Label Flow Release Notes Label Flow Release Notes Release 6.3.25 Release Date: 4/2/2013 Note 1: Read 6.0 Release Notes if upgrading from version 5 or earlier for important information. - Fixed an issue with PDF417 barcodes not

More information

Attacking Internationalized Software

Attacking Internationalized Software Scott Stender scott@isecpartners.com Black Hat August 2, 2006 Information Security Partners, LLC isecpartners.com Introduction Background Internationalization Basics Platform Support The Internationalization

More information

My LaTeX Document. July 7, Introduction 2

My LaTeX Document. July 7, Introduction 2 My LaTeX Document Me July 7, 2017 Contents 1 Introduction 2 2 Exercises 2 2.1 Piecewise defined function...................... 2 2.2 Greek letters............................. 2 2.3 Limits.................................

More information

The basics of LaTeX. Cédric Buron. April 25, 2016

The basics of LaTeX. Cédric Buron. April 25, 2016 The basics of LaTeX Cédric Buron April 25, 2016 Hello fellows! Today, I introduce a very important language in my PhD, L A TEX(what a logo!). In this article, I m going to introduce very quickly the purpose,

More information

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video

Multimedia Data. Multimedia Data. Text Vector Graphics 3-D Vector Graphics. Raster Graphics Digital Image Voxel. Audio Digital Video Multimedia Data Multimedia Data Text Vector Graphics 3-D Vector Graphics Raster Graphics Digital Image Voxel Audio Digital Video 1 Text There are three types of text that are used to produce pages of documents

More information

Handling very large XML documents in an editing application

Handling very large XML documents in an editing application Handling very large XML documents in an editing application Presenter: Radu Coravu radu_coravu@oxygenxml.com @radu_coravu Bytes and characters The byte is a unit of digital information that most commonly

More information

IPDS Emulation. Selects the Data Stream parameter to the following values: - IPDS (default) - ASCII LAN INTERFACE PARAMETERS

IPDS Emulation. Selects the Data Stream parameter to the following values: - IPDS (default) - ASCII LAN INTERFACE PARAMETERS IPDS Emulation The following is the description of the printer IPDS specific menu items. These items appear only, when the Ethernet interface board option is installed in the printer. The Printers uses

More information

Unicode Support. Chapter 2:

Unicode Support. Chapter 2: Unicode Support Chapter 2: SYS-ED/Computer Education Techniques, Inc. Ch 2: 1 SYS-ED/Computer Education Techniques, Inc. Ch 2: 1 Objectives You will learn: Unicode features. How to use literals and data

More information

Coordination! As complex as Format Integration!

Coordination! As complex as Format Integration! True Scripts in Library Catalogs The Way Forward Joan M. Aliprand Senior Analyst, RLG 2004 RLG Why the current limitation? Coordination! As complex as Format Integration! www.ala.org/alcts 1 Script Capability

More information

WYSIWYG Systems. CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems

WYSIWYG Systems. CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX WYSIWYM. Advantages/Disadvantages of WYSIWYG Systems WYSIWYG Systems CSCM10 Research Methodology Lecture 4 (11/2/16): A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html WYSIWYG

More information

Computer Organization and Assembly Language. Lab Session 4

Computer Organization and Assembly Language. Lab Session 4 Lab Session 4 Objective: Learn how Data is represented in Assembly Language Introduction to Data Types and using different Data Types in Assembly language programs Theory: The basic machine data types

More information

Introduction to Latex. A workshop by Dr. Ala Eshmawi

Introduction to Latex. A workshop by Dr. Ala Eshmawi Introduction to Latex A workshop by Dr. Ala Eshmawi Introduction TeX is essentially a Markup Language (like HTML, XML and RTF) TeX written by Donald Knuth in 70 s A revolution in typesetting Latex is an

More information

COSC 243 (Computer Architecture)

COSC 243 (Computer Architecture) COSC 243 Computer Architecture And Operating Systems 1 Dr. Andrew Trotman Instructors Office: 123A, Owheo Phone: 479-7842 Email: andrew@cs.otago.ac.nz Dr. Zhiyi Huang (course coordinator) Office: 126,

More information

Attacking Internationalized Software

Attacking Internationalized Software Scott Stender scott@isecpartners.com Black Hat August 2, 2006 Information Security Partners, LLC isecpartners.com Introduction Who are you? Founding Partner of Information Security Partners, LLC (isec

More information

Introduction to QuickMath

Introduction to QuickMath Introduction to QuickMath https://www.quickmath.io QuickMath Authors April 2, 2018 Contents 1 Basic Editing 1 1.1 System Requirements.......................... 1 1.2 Editor Layout...............................

More information

Getting ready for L A TEX. Alexis Dimitriadis. Version: March 28, 2013

Getting ready for L A TEX. Alexis Dimitriadis. Version: March 28, 2013 Getting ready for L A TEX Alexis Dimitriadis Version: March 28, 2013 LaTeX is a great system, but it takes some work to learn. Unfortunately, it also takes some work to set up the necessary software. This

More information

CSCM10 Research Methodology A Taster of L A TEX

CSCM10 Research Methodology A Taster of L A TEX CSCM10 Research Methodology A Taster of L A TEX Anton Setzer http://www.cs.swan.ac.uk/ csetzer/lectures/ computerscienceprojectresearchmethods/current/index.html November 10, 2016 CSCM10 Lecture 4, 11/2/16:

More information

Extension of VHDL to support multiple-byte characters

Extension of VHDL to support multiple-byte characters Abstract Extension of VHDL to support multiple-byte characters Written Japanese is comprised of many kinds of characters. Whereas one-byte is sufficient for the Roman alphabet, two-byte are required to

More information

Administrative Notes February 9, 2017

Administrative Notes February 9, 2017 Administrative Notes February 9, 2017 Feb 10: Project proposal resubmission (optional) Feb 13: Art and Images reading quiz Feb 17: In the News call #2 Data Representation: Part 2 Text representation Colour

More information

Princeton University. Computer Science 217: Introduction to Programming Systems. Data Types in C

Princeton University. Computer Science 217: Introduction to Programming Systems. Data Types in C Princeton University Computer Science 217: Introduction to Programming Systems Data Types in C 1 Goals of C Designers wanted C to: Support system programming Be low-level Be easy for people to handle But

More information

LING 388: Computers and Language. Lecture 5

LING 388: Computers and Language. Lecture 5 LING 388: Computers and Language Lecture 5 Administrivia Homework 3 graded Quick Homework 4 out today I'll be away next two weeks (my apologies) Colton Flowers, a HLT student, will take you through Python

More information

LATEX, BibTEX, and Friends. Robert Buels. Outline. What is LATEX. Using BibTEX LYX the graphical LATEX Managing your BibTEX references Further reading

LATEX, BibTEX, and Friends. Robert Buels. Outline. What is LATEX. Using BibTEX LYX the graphical LATEX Managing your BibTEX references Further reading LATEX, BibTEX, and Friends Outline Robert Buels What is LATEX Using LATEX Using BibTEX LYX the graphical LATEX Managing your BibTEX references Further reading Pronunciation TEX : tech Properly: like loch

More information

Global Deployment Guide. Version 7.8, Rev. A November 2005

Global Deployment Guide. Version 7.8, Rev. A November 2005 Global Deployment Guide Version 7.8, Rev. A November 2005 Siebel Systems, Inc., 2207 Bridgepointe Parkway, San Mateo, CA 94404 Copyright 2005 Siebel Systems, Inc. All rights reserved. Printed in the United

More information

Computer Science Applications to Cultural Heritage. Introduction to computer systems

Computer Science Applications to Cultural Heritage. Introduction to computer systems Computer Science Applications to Cultural Heritage Introduction to computer systems Filippo Bergamasco (filippo.bergamasco@unive.it) http://www.dais.unive.it/~bergamasco DAIS, Ca Foscari University of

More information

The newunicodechar package

The newunicodechar package The newunicodechar package nrico Gregorio nrico dot Gregorio at univr dot it April 8, 2018 1 Introduction When using Unicode input with L A TX it s not so uncommon to get an incomprehensible error message

More information

Technical Updates. HALCON Progress

Technical Updates. HALCON Progress Technical Updates HALCON 18.11 Progress This technical note describes how to meet technological changes in HALCON, Version 18.11.0.0. All rights reserved. No part of this publication may be reproduced,

More information