Developer(s) | ABBYY |
---|---|
Initial release | July 1993 |
Stable release | 16.0.13.4766 [1] / 10 November 2022 |
Operating system | Windows, macOS, Linux |
Type | OCR |
License | Commercial proprietary software (Retail or volume licensing) |
Website | pdf |
ABBYY FineReader PDF is an optical character recognition (OCR) application developed by ABBYY. [2] [3] First released in 1993, the program runs on Microsoft Windows (Windows 7 or later) and Apple macOS (10.12 Sierra or later). Since v15, the Windows version can also edit PDF files. [2]
Users can use the program to convert image documents (photos, scans, PDF files) and screen captures into editable file formats, including Microsoft Word, Microsoft Excel, Microsoft PowerPoint, Rich Text Format, HTML, PDF/A, searchable PDF, CSV and txt (plain text) files. [3] Since Version 11, files can be saved in the DjVu format. Since Version 15, the program recognizes text in 192 languages and has a built-in spell check for 48 of them.
FineReader recognizes new characters in several ways. Users can train the app on characters, adding them to the recognition alphabet. Users can select characters from a list and add them to the alphabet of a selected language (for example, adding certain Icelandic characters to a German alphabet for a German text describing Iceland). Finally, users can add domain-specific vocabulary to the FineReader’s built-in lexicon. [4]
The program also enables users to compare documents, add annotations and comments, and schedule batch processing. [5] [6]
As of 2015 [update] , there were more than 20 million users of ABBYY FineReader worldwide. [7] [2] [8] ABBYY licenses the embedded OCR technology to various companies including Fujitsu, Panasonic, Xerox, Plustek, and Samsung. [9] [10]
In February 2020, version 15 of the software was rated "Highest-quality OCR on the market" by PC Magazine . [11]
Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo or from subtitle text superimposed on an image.
Mojibake is the garbled or gibberish text that is the result of text being decoded using an unintended character encoding. The result is a systematic replacement of symbols with completely unrelated ones, often from a different writing system.
Adobe Acrobat is a family of application software and Web services developed by Adobe Inc. to view, create, manipulate, print and manage Portable Document Format (PDF) files.
The Bat! is an email client for the Microsoft Windows operating system, developed by Moldovan software company Ritlabs. It is sold as shareware and offered in three editions: Home Edition, Professional Edition, and Voyager which is a portable version and is included with Professional Edition.
Microsoft OneNote is a note-taking software developed by Microsoft. It is available as part of the Microsoft 365 suite and since 2014 has been free on all platforms outside the suite. OneNote is designed for free-form information gathering and multi-user collaboration. It gathers users' notes, drawings, screen clippings, and audio commentaries. Notes can be shared with other OneNote users over the Internet or a network.
capella is a musical notation program or scorewriter developed by the German company capella-software AG, running on Microsoft Windows or corresponding emulators in other operating systems, like Wine on Linux and others on Apple Macintosh. Capella requires to be activated after a trial period of 30 days. The publisher writes the name in lower case letters only. The program was initially created by Hartmut Ring, and is now maintained and developed by Bernd Jungmann.
Microsoft Reader is a discontinued Microsoft application for reading e-books, first released in August 2000, that used its own .LIT format. It was available for Windows computers and Pocket PC PDAs. The name was also used later for an unrelated application for reading PDF and XPS files, first released with Windows 8 - this app was discontinued in 2018.
Evernote is a note-taking and task-management application developed by the Evernote Corporation. It is intended for archiving and creating notes with embedded photos, audio, and saved web content. Notes are stored in virtual "notebooks" and can be tagged, annotated, edited, searched, and exported.
PaperPort is commercial document management software published by Kofax, used for working with scanned documents. It uses a built-in optical character recognition to create files in searchable Portable Document Format (PDF); text in these files is indexed and can be searched for with appropriate software, such as Microsoft's Windows Search. Earlier versions of PaperPort used OmniPage to provide this function. It provides image editing tools for these files.
PostScript fonts are font files encoded in outline font specifications developed by Adobe Systems for professional digital typesetting. This system uses PostScript file format to encode font information.
The following is a comparison of e-book formats used to create and publish e-books.
This comparison of optical character recognition software includes:
STDU Viewer is computer software, a compact viewer for many computer file formats: Portable Document Format (PDF), World Wide Fund for Nature (WWF), DjVu, comic book archive, FB2, ePUB, XML Paper Specification (XPS), Text Compression for Reader (TCR), Mobipocket (MOBI), AZW, multi-page TIFF, text file (TXT), PalmDoc (PDB), Windows Metafile (EMF), Windows Metafile (WMF), bitmap (BMP), Graphics Interchange Format (GIF), JPEG-JPG, Portable Network Graphics (PNG), Photoshop Document (PSD), PiCture eXchange (PCX-DCX). It works under Microsoft Windows, and is free for non-commercial use.
Nota Bene is an integrated software suite of applications, including word processing, reference management, and document text analysis software that is focused on writers and scholars in the Humanities, Social Sciences, and the Arts. The integrated suite is referred to as the Nota Bene Workstation. It runs on Microsoft Windows and Macintosh.
OCRFeeder is an optical character recognition suite for GNOME, which also supports virtually any command-line OCR engine, such as CuneiForm, GOCR, Ocrad and Tesseract. It converts paper documents to digital document files and can serve to make them accessible to visually impaired users.
Microsoft Office shared tools are software components that are included in all Microsoft Office products.
Solid PDF Tools is a document reconstruction software product which allows users to convert PDFs into editable documents and create PDFs from a variety of file sources. The same technology used in the software's Solid Framework SDK is licensed by Adobe for Acrobat X
Asprise OCR is a commercial optical character recognition and barcode recognition SDK library that provides an API to recognize text as well as barcodes from images and output in formats like plain text, XML and searchable PDF.
ABBYY is an American technology company specializing in document processing, data capture, process mining and optical character recognition (OCR). Primarily focused on software as a service model, the company serves clients worldwide. One of ABBYY's best-known products is ABBYY FineReader, an optical character recognition (OCR) computer program.