FineReader Engine Overview & New Features in V10 Semyon Sergunin ABBYY Headquarters September 2010 Michael Fuchs ABBYY Europe GmbH September 2010
FineReader Engine Processing Steps Step 1: Image/Document Input Step 2: Image Pre-processing Algorithms Step 3: Document & Layout Analysis Step 4: Recognition Step 5: Verification of the Recognition Results Step 6: Synthesis & Export
Step 1 Image Input
Step 1. Input Opening existing images Load images from disc or memory BMP, PCX, DCX, GIF, PNG, DjVu JPEG and JPEG2000 (part 1) TIFF B&W (uncompressed, CCITT3, CCITT3FAX, CCITT4, PackBits, ZIP, LZW) Grayscale (uncompressed, Packbits, JPEG, ZIP, LZW) Colour (uncompressed, JPEG, ZIP, LZW) PDF Adobe PDF Library 9.0 Access to internal data (Metadata, Annotations, Text Objects, etc.) Memory Image formats: Raw, Bitmap [HBITMAP], DIB Load images from digital cameras Advanced image pre-processing algorithms in FRE available! Screenshot Reader Capture any area from the screen Any formats (including Flash)
Step 1. Input Scanning documents (TWAIN) Scanning via TWAIN Interface ADF (Automatic Document Feeder) Manual paper feeder Scanner settings Brightness Colour Resolution Image compression Define scanning area (zone) Simplex / Duplex Orientation / automatic rotation / manual rotation Paper format Paper Top/Bottom/Left/Right Etc. Visual Component: Alternatively the original dialogue from the scanner driver can be used
Step 2 Image Pre-Processing
Step 2. Image pre-processing Available Options Automatic rotation Automatic deskewing Cropping Automatic image splitting Straighten lines of text Noise removal Despeckling Scale images (i.e. interpolate images with low resolution) Rotation (90, 180 and 270 )
Step 2. Image pre-processing Binarisation Overview Intelligent background filtering Adaptive Binarisation
Step 2. Image pre-processing New V10: New Binarisation Original scan Prev. binarisation New binarsation
Step 2. Image pre-processing New V10: Binarisation,Textured Background optimisations Original scan Prev. binarisation New binarisation
Step 2. Image pre-processing New V10: Binarisation for the IMPACT project Original Prev. binarisation New No text from the other page
Step 2. Image pre-processing New V10 Colour Filtering (stamps and marks)
Step 2. Image pre-processing: Camera OCR New V10: Automatic correction of 3D perspective distortions Before After
Step 2. Image pre-processing: Camera OCR New V10: Blurred images correction Before After
Step 2. Image pre-processing: Camera OCR New V10: ISO noise reduction Before After
Step 3 Document & Layout Analysis
Step 3. Document & Layout Analysis Detecting sections of a document, analyse layout and find barcodes
Step 3. Document & Layout Analysis 3 layout analysis modes are available: Document Analysis Normal Returns text, tables, graphics (pictures), barcodes & patchcodes, lines (separators) Document Analysis for full text indexing Graphics & pictures are OCRed as well Returns text, tables, graphics (pictures), text inside of pictures and diagrams, barcodes & patchcodes, lines (separators) Document Analysis for invoices (DAI) Optimized for small fonts Returns text, tables as plain text, text inside of pictures and diagrams, barcodes & patchcodes, lines (separators)
Step 3. Document & Layout Analysis New V10: Improved detection of charts and graphics Old Technology Improved detection of pictures (photographs) V 10 Technology Old Technology V 10 Technology
Step 3. Document & Layout Analysis New V10: Improvements for magazine-style pages Wrong detection of image and text blocks Correct detection of image and text blocks Old Technology V 10 Technology
Step 4 Recognition
Step 4. Recognition After line detection, character recognition is applied with different classifiers Raster classifier Contour classifier Structure classifier Feature differentiating classifier
Step 4. Recognition Processing speed - Accuracy Balance The old Conflict Recognition Accuracy vs. Processing Speed still exists. Engine 10 solves this with different approaches! Image Quality does matter! New Accurate Mode for low resolution/quality images slightly slower Slightly improved accuracy in Normal Mode Significant speed increase on good quality images in a new enhanced Fast Mode
Step 4. Recognition New V10: Accurate mode for low resolution scans Additional classifier trained on low resolution scans and faxes About 20% more accurate for low resolution scans About 10% slower than Normal mode
Step 4. Recognition Accuracy Improvements FRE10 Normal mode vs. FRE9 Normal mode *based on ABBYY internal tests; number of recognition errors normalized relative to FRE9_R1 values
Step 4. Recognition Speed Improvements - important notes* Values of speed and accuracy make sense only for comparison of ABBYY OCR technologies in these particular conditions for these particular test batches. Please DO NOT USE these numbers as absolute values, comparing to other results of OCR technologies, taken for different batches! Background color keys: *based on ABBYY internal tests
Step 4. Recognition Speed Comparison FRE 8, 9, 10 modes* *based on ABBYY internal tests
Step 4. Recognition Increased speed for European languages* *based on ABBYY internal tests
Chinese Simplified FRE10_R1 FRE9_R1 FRE9_R7 Recognition test Chinese Simplified, Books 79
Step 4. Recognition Speed improvements through Multi-Core Support* & tuned Profiles Built in Multi-core support for multi page documents Added in V9 Improvements in V10 New V10: New tuned processing profiles increase the overall performance for specific scenarios 2 Sessions tomorrow! Rate, times 4,0 3,5 3,0 2,5 2,0 Recognition performance increase rate for multi-core systems comparing to one-core system 2 cores 4 cores 1,5 1,0 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Pages in a document *based on ABBYY internal tests
Step 6 Synthesis & Export
Step 6. Document Export New API for PDF Export in FineReader Engine 10 FRE9.0 PDF Export Parameters Author Bw Format Color Format Creator Embed Fonts Encryption Info Export Mode Font Mode Gray Format Keep Text And Background Color Keywords Paper Height Paper Width PDF Version Picture Format Picture Resolution Producer Quality Replace Uncertain Words With Image Running Title Mode Set Page Size By Layout Size Subject Title Write Links Write Tagged PDF MRC Params (READ ONLY) FRE 9.0 25 parameters FRE10 PDF Export Parameters Scenario MRC Mode PDFA Compliance Mode Resolution Resolution Type Colority Text Export Mode PDF Features (READ ONLY) Picture Compression Params (READ ONLY) FRE10 7 parameters Scenario profiles MAX PDF Quality MIN PDF Size MAX Export Speed Balanced Quality-Size-Speed Fast and easy adjustment of PDF export and ability to set up any of all parameters Scenario Profiles Max Quality Balanced Min Size Max Speed PDF Features Embed Fonts Encryption Info Meta Data Writing Params Paper Size PDF Version Replace Uncertain Words With Image Running Title Mode Write Links Write Tagged PDF
Step 6. Synthesis & Export 2nd Generation of ADRT New elements and enhancements from the previous ADRT New elements Overall enhancement of ADRT 1.0 work Engine 10 offers a new API to the internal ADRT results
Step 6. Synthesis & Export New XML Output Formats E-book Reader: PDFs can be displayed but the new formats allow much more flexible rendering when switching from portrait to landscape mode FB2* epub* Libraries: AltoXML* Open Document Text format:.odt* ISO Standard, XML based export format More and more often required in public projects *planned for a Maintenance Release of FRE 10
FineReader Engine 10 Jumpstart Samples and Source Code for Developers
FineReader Engine 10 The must have SDK! ABBYY made significant technology optimisations in Engine 10: Image Pre-processing: New Binarisation = better OCR = better Results Speed Improvements: New Fast Mode, improved Multi-core Support Quality Improvements: New mode for low resolution images, improved Fraktur OCR New and Improved Language Support Improved Document Analysis and ADRT New API Calls and Optimised Processing Profiles New and Improved Export formats
Any questions? Thank you for your attention!