OCR Systems

Last updated
OCR Systems, Inc.
TypePrivate
IndustryComputers
Founded1969;53 years ago (1969) in Bensalem Township, Pennsylvania, United States
FounderTheodor Herzl Levine
DefunctJuly 1992;30 years ago (1992-07)
FateAcquired by Adobe Inc.
Headquarters Huntingdon Valley, Pennsylvania, United States
Key people
  • Gregory Boleslavsky, VP
  • Vadim Brikman, VP
Products
  • System 1000
  • ReadRight
Number of employees
30+ (1988)

OCR Systems, Inc., was an American computer hardware manufacturer and software publisher dedicated to optical character recognition technologies. The company's first product, the System 1000 in 1970, was used by numerous large corporations for bill processing and mail sorting. Following a series of pitfalls in the 1970s and early 1980s, founder Theodor Herzl Levine put the company in the hands of Gregory Boleslavsky and Vadim Brikman, the company's vice presidents and recent immigrants from the Soviet Ukraine, who were able to turn OCR System's fortunes around and expand its employee base. The company released the software-based OCR application ReadRight for DOS, later ported to Windows, in the late 1980s. Adobe Inc. bought the company in 1992.

Contents

History

OCR Systems was co-founded by Theodor Herzl Levine (c. 1923 – May 30, 2005). Levine served in the U.S. Army Signal Corps during World War II in the Solomon Islands, where he helped develop a sonar to find ejected pilots in the ocean. After the war, Levine spent 22 years at the University of Pennsylvania, earning his bachelor's degree in 1951, his master's degree in electrical engineering in 1957, and his doctorate in 1968. [1] Alongside his studies, Levine taught statistics and calculus at Temple University, Rutgers University, La Salle University and Penn State Abington. [1] Sometime in the 1960s, Levine was hired at Philco. He and two of his co-workers decided to form their own company dedicated to optical character recognition, founding OCR Systems in 1969 in Bensalem, Pennsylvania. [2]

OCR Systems's first product, the System 1000, was announced in 1970. [3] OCR Systems entered a partnership with 3M to resell the System 1000 throughout the United States in March 1973. This was 3M's entry into the data entry field, managed by the company's Microfilm Products Division and accompanying 3M's suite of data retrieval systems. [4] It soon found use among Texas Instruments, AT&T, Ricoh, Panasonic and Canon for bill processing and mail sorting. [2] Later in the mid-1970s an unspecified Fortune 500 company reneged on a contract to distribute the System 1000; later still a Canadian company distributing the System 1000 in Canada went defunct. Both incidents led OCR Systems to go nearly bankrupt, although it eventually recovered. [5]

By the early 1980s, however, the company was almost insolvent. In 1983 Levine had only $8,000 in his savings and became bedridden with an illness. He left the company in the hands of Gregory Boleslavsky and Vadim Brikman, two Soviet Ukraine expats whom Levine had hired earlier in the 1980s. Boleslavsky was hired as a wire wrapper for the System 1000 and as a programmer and beta tester for ReadRight [5] —a software package developed by Levine implementing patents from Nonlinear Technology, another OCR-centric company from Greenbelt, Maryland. [6] Boleslavsky in turn recommended Brikman to Levine. The two soon became vice presidents of the company while Levine was bedridden; in Boleslavsky's case, he worked 14-hour work days for over half a year in pursuit of the title. The two presented OCR Systems' products to the National Computer Conference in Chicago, where they were massively popular. The company soon gained such clients as Allegheny Energy in Pennsylvania and the postal service of Belgium and received an influx of employees—mostly expats from Russia but also Poland and South Korea, as well as American-born workers. [5] To accommodate the company's employee base, which had grown to over 30 in 1988, [2] Levine moved OCR System's headquarters from Bensalem to the Masons Mill Business Park in Bryn Athyn. [7]

Chinon Industries of Japan signed an agreement with OCR Systems in 1987 to distribute OCR's ReadRight 1.0 software with Chinon's scanners, starting with their N-205 overhead scanner. [8] In 1988, OCR opened their agreement to distribute ReadRight to other scanner manufacturers, including Canon, Hewlett-Packard, Skyworld, Taxan, Diamond Flower and Abaton. [9] That year, the company posted a revenue of $3 million. [5] OCR Systems extended their agreement with Chinon in 1989 and introduced version 2.0 of ReadRight. [10]

OCR Systems faced stiff competition in the software OCR market in the turn of the 1990s. [2] The Toronto-based software firm Delrina signed a letter of intent to purchase the company in November 1991, expecting the deal to close in December and have OCR software available by Christmas. [11] OCR was to receive $3 million worth of Delrina shares in a stock swap, but the deal collapsed in January 1992. [12] Delrine later marketed its own Extended Character Recognition, or XCR, software package to compete with ReadRight. [13] In July 1992, OCR Systems was purchased by Adobe Inc. for an undisclosed sum. [14]

Products

System 1000

The System 1000 was based on the 16-bit Varian Data 620/i minicomputer with 4 KB of core memory. The system used the 620/i for controlling the paper feed, interpreting the format of the documents, the optical character recognition process itself, error detection, sequencing and output. [15] The System was initially programmed to recognize 1428 OCR (used by Selectrics); IBM 407 print; and the full character sets of OCR-A, OCR-B and Farrington 7B; as well as optical marks and handwritten numbers. OCR Systems promised added compatibility with more fonts available down the line—per request—in 1970. [16] The number of fonts supported was limited by the amount of core memory, which was expandable in 4 KB increments up to 32 KB. [17] The System 1000 later supported generalized typewriter and photocopier fonts. [18]

The rest of the System 1000 comprised the document transport, one or more scanner elements, a CRT display and a Teletype Model 33 or 35. [19] Pages are fed via friction with a rubber belt. [3] Up to three lines could be scanned per document, while the rest of the scanned document could be laid out in any manner granted there was enough space around the fields to be read. The reader initially supported pages as small as 3.25 in by 3.5 in dimension (later supporting 2.6 in by 3.5 in utility cash stubs) all the way to the standard ANSI letter size (8.5 in by 11 in; later 8.5 in by 12 in as used in stock certificates). [20] The initial System 1000 had a maximum throughput of 420 documents per minute per transport (later 500 documents per minute), contingent on document size and content. [21]

A feature unique to the System 1000 over other optical character recognition systems of the time was its ability to alert the operator when a field was unreadable or otherwise invalid. [16] This feature, called Document Referral, placed the document in front of the operator and displayed a blank field on the screen of the included CRT monitor for manual re-entry via keyboard. Once input, data could be output to 7- or 9-track tape, paper tape, punched cards and other mass storage media or to System/360 mainframes for further processing. [19]

The complete System 1000 could be purchased for US$69,000. Options for renting were $1,800 per month on a three-year lease or $1,600 per month for five years. [3] Computerworld wrote that it was less than half the cost of its competitors while more capable and user-friendly. Competing systems included the Recognition Equipment Retina, the Scan-Optics IC/20 and the Scan-Data 250/350. [19]

ReadRight

ReadRight processes individual letters topographically: it breaks down the scanned letter into parts—strokes, curves, angles, ascenders and descenders—and follows a tree structure of letters broken down into these parts to determine the corresponding character code. [22] ReadRight was entirely software-based, requiring no expansion card to work. [23] Version 2.01, the last version released for DOS, [24] runs in real mode in under 640 KB of RAM. [25] OCR Systems released the Windows-only version 3.0 in 1991 while offering version 2.01 alongside it. [26] The company unveiled a sister product, ReadRight Personal, dedicated to handheld scanners and for Windows only in October 1991. [27] This version adds real-time scanning—each word is updated to the screen while lines are being scanned. [28] ReadRight proper was later made a Windows-only product with version 3.1 in 1992. [29]

The inclusion of ReadRight 2.0 with Canon's IX-12F flatbed scanner led PC Magazine to award it an Editor's Choice rating in 1989. [30] Despite this, reviewer Robert Kendall found qualification with ReadRight's ability to parse proportional typefaces such as Helvetica and Times New Roman. [31] Mitt Jones of the same publication found version 2.01 to have improved its ability to read such typefaces and praised its ease of use and low resource intensiveness. [30] Jones disliked the inability to handle uneven page paragraph column widths and graphics, noting that the manual recommended the user block out graphics with a Post-it Note. [32]

Version 3.1 for Windows received mixed reviews. Mike Heck of InfoWorld wrote that its "low cost and rich collection of features are hard to ignore" but rated its speed and accuracy average. [23] Barry Simon of PC Magazine called it economical but inaccurate, unable to correct errors it did not detect, and found its spellchecker flawed and its speed lacking compared to Calera's WordScan Plus. [33] Gary Berline of the same publication wrote that "ReadRight produced serviceable accuracy on clean files with simple layouts, but at a less than sprightly pace", finding it unable to process small type and multicolumn text with small margins between columns. [29] The software also regularly interpreted graphical illustrations as text in his experience. [34] OCR Systems announced a follow-up release promising to correcting these issues in July 1992, which never came to fruition on account of Adobe buying the company. [35]

Citations

  1. 1 2 University of Pennsylvania 2005.
  2. 1 2 3 4 Sims 2005, p. B9.
  3. 1 2 3 Staff writer 1970a, p. 111.
  4. Staff writer 1973a, p. 148; Staff writer 1973c, p. 45.
  5. 1 2 3 4 Perfidio 1989, p. H4.
  6. Perfidio 1989, p. H4; Cauley 1992.
  7. Giles 1988, p. H23.
  8. Staff writer 1987, p. 35.
  9. Staff writer 1988, p. 102.
  10. Endrijonas 1989.
  11. Leitch 1991, p. B2.
  12. Staff writer 1992, p. B9.
  13. Staff writer 2001, p. 13.
  14. Cauley 1992, p. 3B.
  15. Staff writer 1970c, p. 26.
  16. 1 2 Staff writer 1971, p. 62.
  17. Staff writer 1971, p. 62; Staff writer 1970a, p. 111; Staff writer 1970b, p. 24.
  18. Staff writer 1973b, p. 61.
  19. 1 2 3 Staff writer 1970b, p. 24.
  20. Staff writer 1970b, p. 24; Staff writer 1970a, p. 111; Staff writer 1971, p. 62.
  21. Staff writer 1970a, p. 111; Staff writer 1971, p. 62.
  22. Garza et al. 1990, p. 74.
  23. 1 2 Heck 1991, p. 51.
  24. Nakamura 1990, p. 17.
  25. Garza et al. 1990, p. 78.
  26. Simon 1991, p. 52; Graggs 1991, p. 17.
  27. Graggs 1991, p. 17.
  28. Waters 1991, p. 16.
  29. 1 2 Jones 1992, p. 287.
  30. 1 2 Grunin 1990, p. 337.
  31. Stanton 1989, p. 205.
  32. Grunin 1990, p. 338.
  33. Simon 1991, p. 52.
  34. Jones 1992, pp. 287–288.
  35. Jones 1992, p. 288.

Related Research Articles

Optical character recognition Computer recognition of visual text

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image.

Image scanner Device that optically scans images, printed text

An image scanner—often abbreviated to just scanner—is a device that optically scans images, printed text, handwriting or an object and converts it to a digital image. Commonly used in offices are variations of the desktop flatbed scanner where the document is placed on a glass window for scanning. Hand-held scanners, where the device is moved by hand, have evolved from text scanning "wands" to 3D scanners used for industrial design, reverse engineering, test and measurement, orthotics, gaming and other applications. Mechanically driven scanners that move the document are typically used for large-format documents, where a flatbed design would be impractical.

Optical mark recognition is the process of reading information that people mark on surveys, tests and other paper documents.

Data entry clerk

A data entry clerk is a member of staff employed to enter or update data into a computer system. Data is often entered into a computer from paper documents using a keyboard. The keyboards used can often have special keys and multiple colors to help in the task and speed up the work. Proper ergonomics at the workstation is a common topic considered.

VueScan

VueScan is a computer program for image scanning, especially of photographs, including negatives. It supports optical character recognition (OCR) of text documents. The software can be downloaded and used free of charge, but adds a watermark on scans until a license is purchased.

Delrina Canadian software company founded in 1988

Delrina was an electronic form company in Canada that was acquired by the American software firm Symantec in 1995. The company was best known for WinFax, a software package which enabled computers equipped with fax modems to transmit copies of documents to standalone fax machines or other similarly equipped computers. It also sold PerForm and FormFlow.

WinFax is a discontinued Microsoft Windows-based software product designed to let computers equipped with fax-modems communicate directly to stand-alone fax machines, or other similarly equipped computers.

SmartScore X2 is a music OCR and scorewriter program, developed, published and distributed by Musitek Corporation based in Ojai, California.

CommSuite 95 was a communications software suite of products launched by the Canadian software company Delrina in late 1995.

PaperPort is commercial document management software published by Kofax, used for working with scanned documents. It uses a built-in optical character recognition to create files in searchable Portable Document Format (PDF); text in these files is indexed and can be searched for with appropriate software, such as Microsoft's Windows Search. Earlier versions of PaperPort used OmniPage to provide this function. It provides image editing tools for these files.

Microtek

Microtek International Inc. is a Taiwan-based multinational manufacturer of digital imaging products and other consumer electronics. It produces imaging equipment for medical, biological and industrial fields. It occupies 20 percent of the global imaging market and holds 450 patents worldwide.

TeleForm is a forms processing application originally developed by Cardiff Software, but now owned by the company OpenText.

A paperless office is a work environment in which the use of paper is eliminated or greatly reduced. This is done by converting documents and other papers into digital form, a process known as digitization. Proponents claim that "going paperless" can save money, boost productivity, save space, make documentation and information sharing easier, keep personal information more secure, and help the environment. The concept can be extended to communications outside the office as well.

Image translation is the machine translation of images of printed text. This is done by applying optical character recognition (OCR) technology to an image to extract any text contained in the image, and then have this text translated into a language of their choice, and the applying digital image processing on the original image to get the translated image with a new language.

Scan-Optics LLC, founded in 1968, is an enterprise content management services company and optical character recognition (OCR) and image scanner manufacturer headquartered in Manchester, Connecticut.

Dr. Halo

Dr. Halo is a raster graphics editor developed by Media Cybernetics and released for computers running MS-DOS. It was among the first graphics editors available for MS-DOS with its initial release in 1984. Media Cybernetics boasted about three million users of Dr. Halo between 1984 and 1993.

IBM PCradio Notebook computer released in 1991

The PCradio was a notebook computer released by International Business Machines (IBM) in late 1991. Designed primarily for mobile workers such as service technicians, salespersons and public safety workers, the PCradio featured a ruggedized build with no internal hard disk drive and was optioned with either a cellular or ARDIS RF modem, in addition to a standard landline modem.

Canon Computer Systems American subsidiary (1992–2001)

Canon Computer Systems, Inc. (CCSI), sometimes shortened to Canon Computer, was an American subsidiary of Canon Inc. formed in 1992 to develop and market the parent company's personal computers and workstations. The subsidiary also assumed the responsibility of marketing Canon's printers and photocopiers, which were formerly sold by other Canon divisions. It went defunct in January 2001.

DTK Computer

DTK Computer is the name for international branches of Datatech Enterprises, a Taiwanese computer manufacturer. Founded in 1981, the company was an early supplier of peripherals for IBM PCs as well as PC compatible motherboards. In the late 1980s, the company switched to developing complete systems under the DTK name as well as serving as an OEM for motherboards and cases, as bought by other small computer companies and systems integrators. The company was little-known in its own time but performed well in the marketplace. DTK was the 10th and 11th biggest personal computer manufacturer in the world in 1991 and 1992 respectively, according to Electronics magazine.

Advanced Logic Research American computer company

Advanced Logic Research, Inc. (ALR), was an American computer company founded in 1984 in Irvine, California by Gene Lu. The company marketed IBM PC compatibles across that standard's evolution until 1997, when it was acquired by Gateway 2000. ALR had a reputation for beating its larger competitors to market with compatibles featuring cutting-edge technologies but struggled with brand recognition among the fiercely competitive market of low-end PCs in the mid-1990s. According to computer journalist and collector Michael Nadeau, "ALR's business strategy was to be the first to market with the latest and fastest possible PC-compatible designs", a strategy that "often succeeded".

References