ABBYY FineReader

Last updated

FineReader PDF
Developer(s) ABBYY
Initial releaseJuly 1993;31 years ago (1993-07)
Stable release
16.0.13.4766 [1] / 10 November 2022;22 months ago (2022-11-10)
Operating system Windows, macOS, Linux
Type OCR
License Commercial proprietary software (Retail or volume licensing)
Website pdf.abbyy.com OOjs UI icon edit-ltr-progressive.svg

ABBYY FineReader PDF is an optical character recognition (OCR) application developed by ABBYY. [2] [3] First released in 1993, the program runs on Microsoft Windows (Windows 7 or later) and Apple macOS (10.12 Sierra or later). Since v15, the Windows version can also edit PDF files. [2]

Users can use the program to convert image documents (photos, scans, PDF files) and screen captures into editable file formats, including Microsoft Word, Microsoft Excel, Microsoft PowerPoint, Rich Text Format, HTML, PDF/A, searchable PDF, CSV and txt (plain text) files. [3] Since Version 11, files can be saved in the DjVu format. Since Version 15, the program recognizes text in 192 languages and has a built-in spell check for 48 of them.

FineReader recognizes new characters in several ways. Users can train the app on characters, adding them to the recognition alphabet. Users can select characters from a list and add them to the alphabet of a selected language (for example, adding certain Icelandic characters to a German alphabet for a German text describing Iceland). Finally, users can add domain-specific vocabulary to the FineReader’s built-in lexicon. [4]

The program also enables users to compare documents, add annotations and comments, and schedule batch processing. [5] [6]

As of 2015, there were more than 20 million users of ABBYY FineReader worldwide. [7] [2] [8] ABBYY licenses the embedded OCR technology to various companies including Fujitsu, Panasonic, Xerox, Plustek, and Samsung. [9] [10]

In February 2020, version 15 of the software was rated "Highest-quality OCR on the market" by PC Magazine . [11]

Related Research Articles

<span class="mw-page-title-main">Optical character recognition</span> Computer recognition of visual text

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo or from subtitle text superimposed on an image.

<span class="mw-page-title-main">Mojibake</span> Garbled text as a result of incorrect character encodings

Mojibake is the garbled or gibberish text that is the result of text being decoded using an unintended character encoding. The result is a systematic replacement of symbols with completely unrelated ones, often from a different writing system.

<span class="mw-page-title-main">Adobe Acrobat</span> Set of application software to view, edit and manage files in Portable Document Format (PDF)

Adobe Acrobat is a family of application software and Web services developed by Adobe Inc. to view, create, manipulate, print and manage Portable Document Format (PDF) files.

<span class="mw-page-title-main">The Bat!</span> Email client for Windows

The Bat! is an email client for the Microsoft Windows operating system, developed by Moldovan software company Ritlabs. It is sold as shareware and offered in three editions: Home Edition, Professional Edition, and Voyager which is a portable version and is included with Professional Edition.

<span class="mw-page-title-main">Microsoft OneNote</span> Free-form note-taking app for personal computers and smartphones

Microsoft OneNote is a note-taking software developed by Microsoft. It is available as part of the Microsoft 365 suite and since 2014 has been free on all platforms outside the suite. OneNote is designed for free-form information gathering and multi-user collaboration. It gathers users' notes, drawings, screen clippings, and audio commentaries. Notes can be shared with other OneNote users over the Internet or a network.

capella is a musical notation program or scorewriter developed by the German company capella-software AG, running on Microsoft Windows or corresponding emulators in other operating systems, like Wine on Linux and others on Apple Macintosh. Capella requires to be activated after a trial period of 30 days. The publisher writes the name in lower case letters only. The program was initially created by Hartmut Ring, and is now maintained and developed by Bernd Jungmann.

<span class="mw-page-title-main">Microsoft Reader</span> E-book software

Microsoft Reader is a discontinued Microsoft application for reading e-books, first released in August 2000, that used its own .LIT format. It was available for Windows computers and Pocket PC PDAs. The name was also used later for an unrelated application for reading PDF and XPS files, first released with Windows 8 - this app was discontinued in 2018.

Evernote is a note-taking and task-management application developed by the Evernote Corporation. It is intended for archiving and creating notes with embedded photos, audio, and saved web content. Notes are stored in virtual "notebooks" and can be tagged, annotated, edited, searched, and exported.

PaperPort is commercial document management software published by Kofax, used for working with scanned documents. It uses a built-in optical character recognition to create files in searchable Portable Document Format (PDF); text in these files is indexed and can be searched for with appropriate software, such as Microsoft's Windows Search. Earlier versions of PaperPort used OmniPage to provide this function. It provides image editing tools for these files.

PostScript fonts are font files encoded in outline font specifications developed by Adobe Systems for professional digital typesetting. This system uses PostScript file format to encode font information.

The following is a comparison of e-book formats used to create and publish e-books.

This comparison of optical character recognition software includes:

<span class="mw-page-title-main">STDU Viewer</span> Document viewer

STDU Viewer is computer software, a compact viewer for many computer file formats: Portable Document Format (PDF), World Wide Fund for Nature (WWF), DjVu, comic book archive, FB2, ePUB, XML Paper Specification (XPS), Text Compression for Reader (TCR), Mobipocket (MOBI), AZW, multi-page TIFF, text file (TXT), PalmDoc (PDB), Windows Metafile (EMF), Windows Metafile (WMF), bitmap (BMP), Graphics Interchange Format (GIF), JPEG-JPG, Portable Network Graphics (PNG), Photoshop Document (PSD), PiCture eXchange (PCX-DCX). It works under Microsoft Windows, and is free for non-commercial use.

Nota Bene is an integrated software suite of applications, including word processing, reference management, and document text analysis software that is focused on writers and scholars in the Humanities, Social Sciences, and the Arts. The integrated suite is referred to as the Nota Bene Workstation. It runs on Microsoft Windows and Macintosh.

<span class="mw-page-title-main">OCRFeeder</span>

OCRFeeder is an optical character recognition suite for GNOME, which also supports virtually any command-line OCR engine, such as CuneiForm, GOCR, Ocrad and Tesseract. It converts paper documents to digital document files and can serve to make them accessible to visually impaired users.

Microsoft Office shared tools are software components that are included in all Microsoft Office products.

<span class="mw-page-title-main">Solid PDF Tools</span>

Solid PDF Tools is a document reconstruction software product which allows users to convert PDFs into editable documents and create PDFs from a variety of file sources. The same technology used in the software's Solid Framework SDK is licensed by Adobe for Acrobat X

Asprise OCR is a commercial optical character recognition and barcode recognition SDK library that provides an API to recognize text as well as barcodes from images and output in formats like plain text, XML and searchable PDF.

<span class="mw-page-title-main">ABBYY</span> American digital intelligence company

ABBYY is an American technology company specializing in document processing, data capture, process mining and optical character recognition (OCR). Primarily focused on software as a service model, the company serves clients worldwide. One of ABBYY's best-known products is ABBYY FineReader, an optical character recognition (OCR) computer program.

References

  1. "Release 2 build 16.0.13.4766". ABBYY. New versions added as released.
  2. 1 2 3 "Вектор модернизации: обзор обновленного ABBYY FineReader 12". 3DNews - Daily Digital Digest (in Russian). Retrieved 7 August 2024.
  3. 1 2 "ABBYY FineReader Pro is an unparalleled OCR solution". Engadget. 16 June 2014. Retrieved 30 December 2021.
  4. Sporleder, Caroline; Bosch, Antal van den; Zervanou, Kalliopi (7 July 2011). Language Technology for Cultural Heritage: Selected Papers from the LaTeCH Workshop Series. Springer Science & Business Media. ISBN   978-3-642-20227-8.
  5. Nield, David; DeMuro, Jonas P.; Turner, Brian (11 October 2021). "Best OCR software of 2021: free and paid options". TechRadar. Retrieved 22 December 2021.
  6. Dalton, Will; DeMuro, Jonas P.; Turner, Brian (6 December 2021). "Best scanning software of 2022". TechRadar. Retrieved 22 December 2021.
  7. "ABBYY выпустила 12 версию своего флагманского продукта FineReader". 2015. Archived from the original on 18 May 2015.
  8. Группа компаний ABBYY, 2014
  9. Radyuhin, Vladimir (19 January 2008). "IT opportunities and challenges in Russia". The Hindu. Archived from the original on 16 July 2014.
  10. "С технологией ABBYY смартфон Samsung Galaxy S4 распознает текст с фотографий". Archived from the original on 14 May 2015.
  11. Mendelson, Edward (6 February 2020). "ABBYY FineReader Review". PC Magazine. Review updated from time to time.