Integration of an on-line handwriting recognition system in a smart phone device

Similar documents
Synthetic On-line Handwriting Generation by Distortions and Analogy

Prototype Selection Methods for On-line HWR

Handwritten text segmentation using blurred image

BoxPlot++ Zeina Azmeh, Fady Hamoui, Marianne Huchard. To cite this version: HAL Id: lirmm

Reverse-engineering of UML 2.0 Sequence Diagrams from Execution Traces

Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases

Fuzzy sensor for the perception of colour

Haar-like-features for query-by-string word spotting

A case-based reasoning approach for unknown class invoice processing

Scalewelis: a Scalable Query-based Faceted Search System on Top of SPARQL Endpoints

Tacked Link List - An Improved Linked List for Advance Resource Reservation

Multi-font Numerals Recognition for Urdu Script based Languages

FIT IoT-LAB: The Largest IoT Open Experimental Testbed

A case-based reasoning approach for invoice structure extraction

KeyGlasses : Semi-transparent keys to optimize text input on virtual keyboard

Setup of epiphytic assistance systems with SEPIA

Multimedia CTI Services for Telecommunication Systems

Motion-based obstacle detection and tracking for car driving assistance

Blind Browsing on Hand-Held Devices: Touching the Web... to Understand it Better

Change Detection System for the Maintenance of Automated Testing

A Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme

HySCaS: Hybrid Stereoscopic Calibration Software

An algorithm based on visual perception for handwriting comparison

Fault-Tolerant Storage Servers for the Databases of Redundant Web Servers in a Computing Grid

Natural Language Based User Interface for On-Demand Service Composition

Explicit fuzzy modeling of shapes and positioning for handwritten Chinese character recognition

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs

How to simulate a volume-controlled flooding with mathematical morphology operators?

Relabeling nodes according to the structure of the graph

An Efficient Numerical Inverse Scattering Algorithm for Generalized Zakharov-Shabat Equations with Two Potential Functions

Every 3-connected, essentially 11-connected line graph is hamiltonian

ASAP.V2 and ASAP.V3: Sequential optimization of an Algorithm Selector and a Scheduler

DSM GENERATION FROM STEREOSCOPIC IMAGERY FOR DAMAGE MAPPING, APPLICATION ON THE TOHOKU TSUNAMI

Taking Benefit from the User Density in Large Cities for Delivering SMS

Deformetrica: a software for statistical analysis of anatomical shapes

Mokka, main guidelines and future

The New Territory of Lightweight Security in a Cloud Computing Environment

An FCA Framework for Knowledge Discovery in SPARQL Query Answers

Moveability and Collision Analysis for Fully-Parallel Manipulators

Linked data from your pocket: The Android RDFContentProvider

X-Kaapi C programming interface

Sewelis: Exploring and Editing an RDF Base in an Expressive and Interactive Way

Comparator: A Tool for Quantifying Behavioural Compatibility

Continuous marking menus for learning cursive pen-based gestures

A Generic Architecture of CCSDS Low Density Parity Check Decoder for Near-Earth Applications

A Practical Evaluation Method of Network Traffic Load for Capacity Planning

Branch-and-price algorithms for the Bi-Objective Vehicle Routing Problem with Time Windows

DANCer: Dynamic Attributed Network with Community Structure Generator

Light field video dataset captured by a R8 Raytrix camera (with disparity maps)

Robust IP and UDP-lite header recovery for packetized multimedia transmission

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

NP versus PSPACE. Frank Vega. To cite this version: HAL Id: hal

Syrtis: New Perspectives for Semantic Web Adoption

Linux: Understanding Process-Level Power Consumption

An Experimental Assessment of the 2D Visibility Complex

Multi-atlas labeling with population-specific template and non-local patch-based label fusion

Computing and maximizing the exact reliability of wireless backhaul networks

Study on Feebly Open Set with Respect to an Ideal Topological Spaces

Service Reconfiguration in the DANAH Assistive System

Real-Time Collision Detection for Dynamic Virtual Environments

Teaching Encapsulation and Modularity in Object-Oriented Languages with Access Graphs

Primitive roots of bi-periodic infinite pictures

Real-Time and Resilient Intrusion Detection: A Flow-Based Approach

QAKiS: an Open Domain QA System based on Relational Patterns

A New Method for Mining High Average Utility Itemsets

Experimental Evaluation of an IEC Station Bus Communication Reliability

YAM++ : A multi-strategy based approach for Ontology matching task

The Connectivity Order of Links

Open Digital Forms. Hiep Le, Thomas Rebele, Fabian Suchanek. HAL Id: hal

Structuring the First Steps of Requirements Elicitation

A Voronoi-Based Hybrid Meshing Method

FStream: a decentralized and social music streamer

Regularization parameter estimation for non-negative hyperspectral image deconvolution:supplementary material

Detecting Anomalies in Netflow Record Time Series by Using a Kernel Function

Assisted Policy Management for SPARQL Endpoints Access Control

The Proportional Colouring Problem: Optimizing Buffers in Radio Mesh Networks

Real-time FEM based control of soft surgical robots

Quality of Service Enhancement by Using an Integer Bloom Filter Based Data Deduplication Mechanism in the Cloud Storage Environment

A Fuzzy Approach for Background Subtraction

Fuzzy interpolation and level 2 gradual rules

MUTE: A Peer-to-Peer Web-based Real-time Collaborative Editor

Comparison of spatial indexes

Stream Ciphers: A Practical Solution for Efficient Homomorphic-Ciphertext Compression

From Microsoft Word 2003 to Microsoft Word 2007: Design Heuristics, Design Flaws and Lessons Learnt

Machine Print Filter for Handwriting Analysis

Catalogue of architectural patterns characterized by constraint components, Version 1.0

Automatic indexing of comic page images for query by example based focused content retrieval

Hardware Acceleration for Measurements in 100 Gb/s Networks

Kernel perfect and critical kernel imperfect digraphs structure

Self-optimisation using runtime code generation for Wireless Sensor Networks Internet-of-Things

Application-Aware Protection in DWDM Optical Networks

Generative Programming from a Domain-Specific Language Viewpoint

Technical Overview of F-Interop

XML Document Classification using SVM

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface

A Methodology for Improving Software Design Lifecycle in Embedded Control Systems

COM2REACT: V2V COMMUNICATION FOR COOPERATIVE LOCAL TRAFFIC MANAGEMENT

RETIN AL: An Active Learning Strategy for Image Category Retrieval

A N-dimensional Stochastic Control Algorithm for Electricity Asset Management on PC cluster and Blue Gene Supercomputer

Managing Risks at Runtime in VoIP Networks and Services

Transcription:

Integration of an on-line handwriting recognition system in a smart phone device E Anquetil, H Bouchereau To cite this version: E Anquetil, H Bouchereau. Integration of an on-line handwriting recognition system in a smart phone device. International Conference on Pattern Recognition, Aug 2002, Quebec, Canada. 3, pp.192-195, 2002, <10.1109/ICPR.2002.1047827>. <hal-01576053> HAL Id: hal-01576053 https://hal.inria.fr/hal-01576053 Submitted on 22 Aug 2017 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Integration of an On-line Handwriting Recognition System in a Smart Phone Device E. Anquetil INSA-IRISA (1) Campus Universitaire de Beaulieu F-35042 Rennes Cedex, France Eric.Anquetil@irisa.fr H. Bouchereau Purple Labs SAS (2) Savoie Technolac F-73375 Le Bourget-du-Lac Cédex, France Helene.Bouchereau@purplelabs.com Abstract This paper presents the evolving of our academic development to a technology driven application: the integration of an unconstraint cursive on-line handwritten character recognition system into a smart phone device. The ultimate goal of this work is to implement an accuracy handwriting recognizer into mobile devices with limited computing and memory resources. A hierarchical fuzzy modeling is used to obtain a compact and robust knowledge representation and the decision process is based on an adapted fuzzy inference system to reduce computing without decreasing the performances. We describe in this paper the basic architecture of the recognition system called ResifCar, its practical adaptation to the mobile device constraints and the recognition rates both on cursive isolated letters (91.9%) and on isolated digits (92.3%) in a writer independent context based on 100 different writers. 1. Introduction On-line systems for handwriting recognition take an important place in the newer technologies around handheld computers. These new mobile devices are personal tools small enough to be carried everywhere. Most popular are personal digital assistants (PDAs) and soon the next generation of mobile phones, called smart phone, which combine the functionality of a mobile phone with that of a PDA. Several handwriting recognition systems have been yet integrated in mobile devices. They can be characterized by the constraint environment the handwriting recognition is processed, the character set that can be recognized, the writing styles they are able to deal with, the CPU and memory requirements, and the recognition rates they can achieved in a writer independent context. It is still today a real research challenge to design a handwriting recognition system able to overcome all these interdependent problems [2][3]. In this way, we have collaborated during one year with an industrial partner (2) specialized in the development of mobile handsets. The aim of this collaboration is to optimize and integrate our handwritten character recognition system called ResifCar into a smart phone device. ResifCar is a writer independent on-line handwriting recognition system able to deal with cursive handwritten characters. It is the result of our (1) last 10 years researches [1][6][7]. This paper describes the adaptation and the optimization of ResifCar for its integration into a smart phone device based on ARM 7 TDMI microprocessor. In the context of this practical implementation, the recognizer must compute into a device with limited resources : 13 Mhz CPU, RAM size of 50 Kbytes and ROM size (program and data) around 200 Kbytes available for this feature. Moreover the input method which combines LCD display with a digitizer is based on a low tactile screen definition (0.20 mm pixel pitch) with a sampling frequency of 20 points per second. In Parallel with these practical constraints, the key goals for the on-line recognizer are: unconstrained handwriting recognition: be able to deal with the shape variations of a single character coming from the different writing styles (allographs, see Table 1); reliable recognition of cursive isolated handwritten characters: Latin alphabet, digits and other special symbols;

writer independent recognition: no need to have user training before the recognition can start; multi-strokes character recognition; easy extension of the recognizer with new character models. The following section presents the principles of the recognition system ResifCar. Then, section 3 describes the adaptation and optimization of the recognizer for its integration into the mobile device with limited resources. Section 4 presents results both on the recognition rates and on the practical integration aspects such as computation time and memory size. 2. Recognition system: ResifCar ResifCar is based on a hierarchical fuzzy modeling of each character class [6]. This intrinsic modeling provides a robust description of the most pertinent primitive shapes that define a character class. The shape primitives are automatically extracted from a character drawing both from perceptually important points called singularities or perceptually anchorage points and from dynamic information such as ductus [2] [7]. The hierarchical architecture (cf. Figure 1) of each model is represented by three general levels (L1, L2, L3) structured according to their robustness and pertinence [2] [6]. The first level models the most pertinent downstrokes primitives which are known to be one of the most invariant and robust part of the handwriting drawing. The second level refines the description of downstrokes by modeling their morphological context. And finally, the last level represents beginnings and ends of the character shape often called ligatures. The interdependence of the knowledge represented in the three levels is formalized by a fuzzy prototype tree. level 1 (L1) level 2 (L2) level 3 (L3) Figure 1. Partial example of 3 hierarchical representation levels illustrated for a letter a. The learning process is operated for each character class using a training data base. This automatic process generates successively the three representation levels. The key paradigm of the learning process is to extract and model automatically the most pertinent shape primitives by using possibilistic clustering [4] [5] in a feature space according to each of the three representation levels. Possibilistic clustering generates fuzzy ellipsoidal prototypes that model the typicality of the most pertinent primitives sets. After the learning phase, the automatic fuzzy modeling process [6] formalizes the knowledge by explicit fuzzy rules as follows: r k : if ( ((x L1 1 is mk, L1 11 andand x L1 r is m k, L1 1r ) and/or (x L1 1 is mk, L1 21 andand xl1 r is mk, L1 2r ) and/or) and((x L2 1 is m k, L2 11 andand x L2 s is m k, L2 1s ) and/or (x L2 1 is m k, L2 21 andand x L2 s is m k, L2 2s ) and/or) and((x L3 1 is m k, L3 11 andand x L3 t is m k, L3 1t ) and/or (x L3 is m k, L3 andand x L3 is m k, L3 ) and/or) 1 21 t 2t ) then the unknown pattern belongs to class c k and not to the others, where r k is the rule which characterizes the class c k, (x Lj 1,, x Lj r) is the input vector of feature space of level Lj, (m k, Lj α1,, m k, Lj αr) are the corresponding fuzzy membership functions relative to each feature, where m k, Lj α is defined by a bell-shaped function which models the prototype P k, Lj α according to the feature x Lj. The number of prototype P k, Lj α extracted for each level Lj is automatically estimated by the possibilistic clustering. That means several robust types of primitive shapes can be modeled for each representation level to cope with different drawings of a same character (often called allographs) according to different handwriting styles (see Table 1). It is important to notice the interdependence of each three parts of the premises according to the level description (L1, L2, L3). In fact, each rule premise is dynamically builds from level one to level three according to the adequacy between the unknown character shape and the hierarchical fuzzy prototype tree. All the n fuzzy rules r k generated for each class c k are aggregated in a fuzzy inference system to compose the decision process. The fuzzy inference process is computed with the sum-product inference method defined by: n ρ δ i ik i= 1 y' k = n ρi i= 1 k, Lj Lj with ρ k= m ( x' ), α Lj, α, where y k is the adequacy measure of model k with the unknown character, δ ik the kronecker symbol that defines output i of the fuzzy rule r k, and ρ k the activation value of

the rule r k. ρ k is calculated by the product conjunction of all membership degree of input feature x Lj to the corresponding fuzzy membership function m k, Lj α. In case of disjunction in the premise part max operator is used. This intrinsic modeling process allows to add new character classes without rebuilding all existing models. Moreover, the fuzzy modeling is very robust with respect to intra class variability. Then, the recognizer is able to deal with allographic variation as illustrated in Section 4. 3. Adaptation and optimization To integrate ResifCar into the mobile device we need to overcome the limitation of computing and memory resources without imposing new handwriting constraints and without decreasing recognition rates. The recognition time depends on the size of the model set. Therefore, to speed up the recognition, a pre-selection is performed thanks to the transparent (explicit) design of the hierarchical fuzzy modeling. The decision process is optimized by a pre-selection of models according to the first representation level which is based on the most stable shape primitives. The selection of the best-n models is operated as follows: max(ρ L1, k) k= 1, n model i is selected if ρ L1, i, sl1 where ρ L1, k is the level one (L1) activation value of the rule r k (model k ) and S L1 the threshold of selection. This pre-selection is very reliable and experimentally around 70% of models are pruned in average by the pre-selection process. Then, the continuation of the decision process is operated on a reduced number of models both for second and third level representations. Other practical adaptations have been done to limit computing and memory size. For instance fuzzy inference process is traduced in logarithms to avoid multiplication operations which are very computing expensive. Moreover fuzzy membership functions have been tabulated in their logarithmic form and all numerical features were coded by integer numbers. Simplified fuzzy inference is now computed as follows: k, Lj Lj y& k = k with ρ k m α ( x ) Lj, α, ρ& & = & &, where y&k is the new adequacy measure of model k with the unknown character, and ρ& k the new activation value of the rule r k calculated by the summation of all logarithmic membership degree of the input feature x& using the corresponding tabulated fuzzy logarithmic membership function m&. This new adequacy measure y&k is no more normalized and directly used by the decision process to classify the N- best models. 4. Experiments and results The experiments have been conducted on the optimized ResifCar recognizer which has been implemented into the Arm7 TDMI microprocessor of the smart phone device. Two experimental results are reported: one on isolated cursive lower-case letters with special symbols and the other on isolated cursive digits. The leaning process was conducted on 63 different classes for the 26 letters and special symbols: @, &,, $,,. For the 10 digits, 26 models have been generated. The aim is to offer the most unconstrained handwriting recognition able to cope with a great set of writing styles. Table 1 illustrates the capability of the recognizer to identify different allographic shapes. Table 1. Examples of different allographic shapes recognized by ResifCar. classes a b d f k y z 5 8 allograph examples stroke beginning stroke end The experiments were carried out on a data base composed of 15000 isolated characters written down by around 300 different writers. The models were automatically learned from samples coming from 200 different writers, and the samples from the 100 others writers were used for test experiments. Our experimental recognition results are summarized in Table 2.

Table 2. Writer independent recognition rates on isolated handwritten characters letters and special symbols Top one Top two Top three 91.9% 96.4% 97.5% digits 92.3% 94.8% 96.3% It is interesting to notice that the modeling process initially developed for handwritten letters recognition was then directly applied to handwritten digits recognition. This shows the good genericity and adaptation capability of the proposed modeling process for the recognition of different kinds of handwritten shapes. The performance of the recognition system concerning computing and memory resources are reported in Table 3 and Table 4. These figures can appear very high for a single feature in such a device but the algorithm performances are quite good compared to the same feature performances in other products. What it is important to say, it is that the algorithm permits to recognized multistrokes characters. Table 3. Memory resources sizes Recognition system the 63+26 character models memory size (Rom) 48 KB 60.9 KB Table 4. Computing performance real time computing without optimizations a letter t 9 s 0.62 s a letter a 11 s 0.49 s in average considering all the models 11 s 0.5 s memory size (Ram) 31.6 KB real time computing with optimizations Recognition rates on isolated characters are significant due to the facts that the recognition is done without any context (many letters and digits can be confused: h, k and e, l and r, v and 1, 7 etc.) and that the computing and memory resources of the device are very limited. Moreover, they are already sufficient for using the recognition system as a friendly and practical handwriting input device. 5. Conclusions and future work This paper has presented a real integration into a mobile phone of an unconstrained handwritten cursive character recognizer called ResifCar. Experiments showed the robustness and the compactness of the intrinsic hierarchical fuzzy modeling. Furthermore the decision process has been optimized to achieve reliable recognition both on isolated cursive characters and on digits with limited computing and memory resources. Work is in progress to extend the capability of the recognition system to unconstrained cursive handwritten words. In fact, this system is almost operational [7] but needs to be adapted and optimized in the same way as shown in this paper to overcome the computing and memory limitations of hand-held devices. 6. Acknowledgments Thanks to G. Lorette, L. Pasquer and J. Camillerapp from IMADOC team at IRISA for their contribution to this work. References [1] E. Anquetil, G. Lorette, "On-line Handwriting Character Recognition System Based on Hierarchical Qualitative Fuzzy Modeling", Progress in Handwriting Recognition, World Scientific edition, pp.109-116, 1997. [2] G. Lorette, "Handwriting recognition or reading? what is the situation at the dawn of the 3rd millenium", International Journal on Document Analysis and Recogniton, pp.2-12, July 1999. [3] R. Plamondon, S N. Srihari, "On-Line and Off-Line Hanwriting Recognition : A Comprehensive Survey", IEEE Transactions on Pattern Analysis and Machine Intelligence,Vol. 22, No 1, pp.63-84, January 2000. [4] R. Krishnapuram and J. M. Keller, "A possibilistic approach to clustering", IEEE Trans. on fuzzy Systems, Vol. 1, No. 2, pp. 98-110, 1993. [5] R. Krishnapuram "Generation of Membership Functions via Possibilistic Clustering", IEEE World congress on computational intelligence, pp. 902-908, 1994. [6] E. Anquetil, G. Lorette, "Automatic Generation of Hierarchical Fuzzy Classification Systems Based on Explicit Fuzzy Rules Deduced from Possibilistic Clustering: Application to On-line Handwritten Character Recognition", Information Processing and Management of Uncertainty in Knowledge-Based Systems, Grenade, Spain, pp. 259-264, Juillet 1996. [7] E. Anquetil, G. Lorette, "Perceptual Model of Handwriting Drawing, application to Handwriting Segmentation Problem", International Conf. on Document Analysis and Recognition, Ulm, Germany, pp. 112-117, Août 1997.