OmniPage

Last updated
OmniPage
Developer(s) Kofax
Stable release
19.2 (Windows platform)
X (Macintosh platform)
Operating system
Type OCR
License Commercial proprietary software (Retail or volume licensing)
Website www.kofax.com/Products/omnipage

OmniPage is an optical character recognition (OCR) application available from Kofax Incorporated.

OmniPage was one of the first OCR programs to run on personal computers. [1] It was developed in the late 1980s and sold by Caere Corporation, a company headed by Robert Noyce. The original developers were Philip Bernzott, John Dilworth, David George, Bryan Higgins, and Jeremy Knight. [2] [3] [4] Caere was acquired by ScanSoft in 2000. [5] ScanSoft acquired Nuance Communications in 2005, and took over its name. [6] By 2019 OmniPage had been sold to Kofax Inc. [7]

OmniPage supports more than 120 different languages. [8] OmniPage provides software development kits for integrating OCR functionality into other applications, such as Microsoft Office Document Imaging and UiPath. [9] [10]

Related Research Articles

<span class="mw-page-title-main">Optical character recognition</span> Computer recognition of visual text

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.

<span class="mw-page-title-main">Handwriting recognition</span> Ability of a computer to receive and interpret intelligible handwritten input

Handwriting recognition (HWR), also known as handwritten text recognition (HTR), is the ability of a computer to receive and interpret intelligible handwritten input from sources such as paper documents, photographs, touch-screens and other devices. The image of the written text may be sensed "off line" from a piece of paper by optical scanning or intelligent word recognition. Alternatively, the movements of the pen tip may be sensed "on line", for example by a pen-based computer screen surface, a generally easier task as there are more clues available. A handwriting recognition system handles formatting, performs correct segmentation into characters, and finds the most plausible words.

Magnetic ink character recognition code, known in short as MICR code, is a character recognition technology used mainly by the banking industry to streamline the processing and clearance of cheques and other documents. MICR encoding, called the MICR line, is at the bottom of cheques and other vouchers and typically includes the document-type indicator, bank code, bank account number, cheque number, cheque amount and a control indicator. The format for the bank code and bank account number is country-specific.

Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software.

<span class="mw-page-title-main">MacSpeech</span> Speech recognition etc. software company

MacSpeech, Inc. was a New Hampshire-based technology company that produced software-based speech recognition and voice dictation solutions for the Apple ecosystem. The company's products included iListen, MacSpeech Dictate, MacSpeech Dictate Medical, MacSpeech Dictate Legal, MacSpeech Dictate International, and MacSpeech Scribe. On February 12, 2010, Nuance Communications, Inc. acquired MacSpeech.

<span class="mw-page-title-main">VueScan</span>

VueScan is a computer program for image scanning, especially of photographs, including negatives. It supports optical character recognition (OCR) of text documents. The software can be downloaded and used free of charge, but adds a watermark on scans until a license is purchased.

<span class="mw-page-title-main">Dragon NaturallySpeaking</span> Speech recognition software package

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems of Newton, Massachusetts, which was acquired in turn by Lernout & Hauspie Speech Products, Nuance Communications, and Microsoft. It runs on Windows personal computers. Version 15, which supports 32-bit and 64-bit editions of Windows 7, 8 and 10, was released in August 2016.

<span class="mw-page-title-main">Kofax</span> American intelligent automation software provider

Kofax Inc. is an Irvine, California-based intelligent automation software provider. Founded in 1985, the company's software allows businesses to automate and improve business workflows by simplifying the handling of data and documents.

Intelligent character recognition (ICR) is used to extract handwritten text from image images using ICR, also referred to as intelligent OCR. It is a more sophisticated type of OCR technology that recognizes different handwriting styles and fonts to intelligently interpret data on forms and physical documents.

PaperPort is commercial document management software published by Kofax, used for working with scanned documents. It uses a built-in optical character recognition to create files in searchable Portable Document Format (PDF); text in these files is indexed and can be searched for with appropriate software, such as Microsoft's Windows Search. Earlier versions of PaperPort used OmniPage to provide this function. It provides image editing tools for these files.

ABBYY FineReader PDF is an optical character recognition (OCR) application developed by ABBYY, with support for PDF file editing since v15. The program runs under Microsoft Windows 7 or later, and Apple macOS 10.12 Sierra or later. The first version was released in 1993.

<span class="mw-page-title-main">OCR-A</span> Typeface designed for early computer OCR

OCR-A is a font issued in 1966 and first implemented in 1968. A special font was needed in the early days of computer optical character recognition, when there was a need for a font that could be recognized not only by the computers of that day, but also by humans. OCR-A uses simple, thick strokes to form recognizable characters. The font is monospaced (fixed-width), with the printer required to place glyphs 0.254 cm apart, and the reader required to accept any spacing between 0.2286 cm and 0.4572 cm.

This comparison of optical character recognition software includes:

Forms processing is a process by which one can capture information entered into data fields and convert it into an electronic format. This can be done manually or automatically, but the general process is that hard copy data is filled out by humans and then "captured" from their respective fields and entered into a database or other electronic format.

<span class="mw-page-title-main">OCR-B</span> Typeface

OCR-B is a monospace font developed in 1968 by Adrian Frutiger for Monotype by following the European Computer Manufacturer's Association standard. Its function was to facilitate the optical character recognition operations by specific electronic devices, originally for financial and bank-oriented uses. It was accepted as the world standard in 1973. It follows the ISO 1073-2:1976 (E) standard, refined in 1979. It includes all ASCII symbols, and other symbols needed in the bank environment. It is widely used for the human readable digits in UPC/EAN barcodes. It is also used for machine-readable passports. It shares that purpose with OCR-A, but it is easier for the human eye and brain to read and it has a less technical look than OCR-A.

<span class="mw-page-title-main">Solid Converter PDF</span>

Solid Converter PDF is document reconstruction software from Solid Documents which converts PDF files to editable formats. Originally released for the Microsoft Windows operating system, a Mac OS X version was released in 2010. The current versions are Solid Converter PDF 9.0 for Windows and Solid PDF to Word for Mac 2.1. The same technology used by the product's Solid Framework SDK is licensed by Adobe for Acrobat X.

Microsoft Office shared tools are software components that are included in all Microsoft Office products.

<span class="mw-page-title-main">Solid PDF Tools</span>

Solid PDF Tools is a document reconstruction software product which allows users to convert PDFs into editable documents and create PDFs from a variety of file sources. The same technology used in the software's Solid Framework SDK is licensed by Adobe for Acrobat X

<span class="mw-page-title-main">OCR Systems</span> American computing company

OCR Systems, Inc., was an American computer hardware manufacturer and software publisher dedicated to optical character recognition technologies. The company's first product, the System 1000 in 1970, was used by numerous large corporations for bill processing and mail sorting. Following a series of pitfalls in the 1970s and early 1980s, founder Theodor Herzl Levine put the company in the hands of Gregory Boleslavsky and Vadim Brikman, the company's vice presidents and recent immigrants from the Soviet Ukraine, who were able to turn OCR System's fortunes around and expand its employee base. The company released the software-based OCR application ReadRight for DOS, later ported to Windows, in the late 1980s. Adobe Inc. bought the company in 1992.

References

  1. Markoff, John (August 17, 1988). "Now, PC's That Read A Page and Store It". New York Times . Retrieved March 29, 2011.
  2. US 5131053,Bernzott, Philip; Dilworth, John& George, Davidet al.,"Optical character recognition method and apparatus",issued Jul 14, 1992
  3. US 5278918,Bernzott, Philip; Dilworth, John& George, Davidet al.,"Optical character recognition method and apparatus using context analysis",issued Jan 11, 1994
  4. US 6038342,Bernzott, Philip; Dilworth, John& George, Davidet al.,"Optical character recognition method and apparatus",issued Mar 14, 2000
  5. Junnarkar, Sandeep (January 18, 2000). "ScanSoft buys rival Caere for more than $140 million". CNET News . CBS Interactive . Retrieved March 30, 2011.
  6. Nuance Press Releases – ScanSoft Changes Name to Nuance Communications, Inc.
  7. Kofax. "OmniPage Family". www.kofax.com.
  8. Nuance Communications (September 2010). "OmniPage" . Retrieved 23 September 2010.
  9. "Microsoft Announces New Office Document Imaging Tool Included in Office XP". Stories. 2001-05-31. Retrieved 2023-05-30.
  10. docs.uipath.com https://docs.uipath.com/activities/other/latest/user-guide/omnipage-ocr . Retrieved 2023-05-30.{{cite web}}: Missing or empty |title= (help)