Data processing

Last updated

Data processing is the collection and manipulation of digital data to produce meaningful information. [1] Data processing is a form of information processing, which is the modification (processing) of information in any manner detectable by an observer. [note 1]

Contents

Functions

Data processing may involve various processes, including:

History

The United States Census Bureau history illustrates the evolution of data processing from manual through electronic procedures.

Manual data processing

Although widespread use of the term data processing dates only from the 1950s, [2] data processing functions have been performed manually for millennia. For example, bookkeeping involves functions such as posting transactions and producing reports like the balance sheet and the cash flow statement. Completely manual methods were augmented by the application of mechanical or electronic calculators. A person whose job was to perform calculations manually or using a calculator was called a "computer."

The 1890 United States Census schedule was the first to gather data by individual rather than household. A number of questions could be answered by making a check in the appropriate box on the form. From 1850 to 1880 the Census Bureau employed "a system of tallying, which, by reason of the increasing number of combinations of classifications required, became increasingly complex. Only a limited number of combinations could be recorded in one tally, so it was necessary to handle the schedules 5 or 6 times, for as many independent tallies." [3] "It took over 7 years to publish the results of the 1880 census" [4] using manual processing methods.

Automatic data processing

The term automatic data processing was applied to operations performed by means of unit record equipment, such as Herman Hollerith's application of punched card equipment for the 1890 United States Census. "Using Hollerith's punchcard equipment, the Census Office was able to complete tabulating most of the 1890 census data in 2 to 3 years, compared with 7 to 8 years for the 1880 census. It is estimated that using Hollerith's system saved some $5 million in processing costs" [4] in 1890 dollars even though there were twice as many questions as in 1880.

Computerized data processing

Computerized data processing, or electronic data processing represents a later development, with a computer used instead of several independent pieces of equipment. The Census Bureau first made limited use of electronic computers for the 1950 United States Census, using a UNIVAC I system, [3] delivered in 1952.

Other developments

The term data processing has mostly been subsumed by the more general term information technology (IT). [5] The older term "data processing" is suggestive of older technologies. For example, in 1996 the Data Processing Management Association (DPMA) changed its name to the Association of Information Technology Professionals. Nevertheless, the terms are approximately synonymous.

Applications

Commercial data processing

Commercial data processing involves a large volume of input data, relatively few computational operations, and a large volume of output. For example, an insurance company needs to keep records on tens or hundreds of thousands of policies, print and mail bills, and receive and post payments.

Data analysis

In science and engineering, the terms data processing and information systems are considered too broad, and the term data processing is typically used for the initial stage followed by a data analysis in the second stage of the overall data handling.

Data analysis uses specialized algorithms and statistical calculations that are less often observed in a typical general business environment. For data analysis, software suites like SPSS or SAS, or their free counterparts such as DAP, gretl, or PSPP are often used.

Systems

A data processing system is a combination of machines, people, and processes that for a set of inputs produces a defined set of outputs. The inputs and outputs are interpreted as data, facts, information etc. depending on the interpreter's relation to the system.

A term commonly used synonymously with data or storage (codes) processing system is information system . [6] With regard particularly to electronic data processing, the corresponding concept is referred to as electronic data processing system.

Examples

Simple example

A very simple example of a data processing system is the process of maintaining a check register. Transactions checks and deposits are recorded as they occur and the transactions are summarized to determine a current balance. Monthly the data recorded in the register is reconciled with a hopefully identical list of transactions processed by the bank.

A more sophisticated record keeping system might further identify the transactions for example deposits by source or checks by type, such as charitable contributions. This information might be used to obtain information like the total of all contributions for the year.

The important thing about this example is that it is a system, in which, all transactions are recorded consistently, and the same method of bank reconciliation is used each time.

Real-world example

This is a flowchart of a data processing system combining manual and computerized processing to handle accounts receivable, billing, and general ledger

Stockbridge system flowchart example.jpg

See also

Notes

  1. Data processing is distinct from word processing , which is manipulation of text specifically rather than data generally. "data processing". Webopedia. September 1996. Retrieved June 24, 2013.

Related Research Articles

<span class="mw-page-title-main">Herman Hollerith</span> American statistician and inventor

Herman Hollerith was a German-American statistician, inventor, and businessman who developed an electromechanical tabulating machine for punched cards to assist in summarizing information and, later, in accounting. His invention of the punched card tabulating machine, patented in 1884, marks the beginning of the era of mechanized binary code and semiautomatic data processing systems, and his concept dominated that landscape for nearly a century.

Information retrieval (IR) in computing and information science is the task of identifying and retrieving information system resources that are relevant to an information need. The information need can be specified in the form of a search query. In the case of document retrieval, queries can be based on full-text or other content-based indexing. Information retrieval is the science of searching for information in a document, searching for documents themselves, and also searching for the metadata that describes data, and for databases of texts, images or sounds.

<span class="mw-page-title-main">Punched card</span> Paper-based recording medium

A punched card is a piece of card stock that stores digital data using punched holes. Punched cards were once common in data processing and the control of automated machines.

<span class="mw-page-title-main">Punched card sorter</span>

A punched card sorter is a machine for sorting decks of punched cards.

A voting machine is a machine used to record votes in an election without paper. The first voting machines were mechanical but it is increasingly more common to use electronic voting machines. Traditionally, a voting machine has been defined by its mechanism, and whether the system tallies votes at each voting location, or centrally. Voting machines should not be confused with tabulating machines, which count votes done by paper ballot.

Electronic data processing (EDP) or business information processing can refer to the use of automated methods to process commercial data. Typically, this uses relatively simple, repetitive activities to process large volumes of similar information. For example: stock updates applied to an inventory, banking transactions applied to account and customer master files, booking and ticketing transactions to an airline's reservation system, billing for utility services. The modifier "electronic" or "automatic" was used with "data processing" (DP), especially c. 1960, to distinguish human clerical data processing from that done by computer.

<span class="mw-page-title-main">Powers Accounting Machine</span> Early 20th-century tabulating machine

The Powers Accounting Machine was an information processing device developed in the early 20th century for the U.S. Census Bureau. It was then produced and marketed by the Powers Accounting Machine Company, an information technology company founded by the machine's developer. The company thrived in the early 20th century as a producer of tabulating machines. It was a predecessor to the Unisys corporation.

<span class="mw-page-title-main">Unit record equipment</span> Electromechanical machines which processed data using punch cards

Starting at the end of the nineteenth century, well before the advent of electronic computers, data processing was performed using electromechanical machines collectively referred to as unit record equipment, electric accounting machines (EAM) or tabulating machines. Unit record machines came to be as ubiquitous in industry and government in the first two-thirds of the twentieth century as computers became in the last third. They allowed large volume, sophisticated data-processing tasks to be accomplished before electronic computers were invented and while they were still in their infancy. This data processing was accomplished by processing punched cards through various unit record machines in a carefully choreographed progression. This progression, or flow, from machine to machine was often planned and documented with detailed flowcharts that used standardized symbols for documents and the various machine functions. All but the earliest machines had high-speed mechanical feeders to process cards at rates from around 100 to 2,000 per minute, sensing punched holes with mechanical, electrical, or, later, optical sensors. The operation of many machines was directed by the use of a removable plugboard, control panel, or connection box. Initially all machines were manual or electromechanical. The first use of an electronic component was in 1937 when a photocell was used in a Social Security bill-feed machine. Electronic components were used on other machines beginning in the late 1940s.

<span class="mw-page-title-main">Keypunch</span> Device for punching holes into paper cards

A keypunch is a device for precisely punching holes into stiff paper cards at specific locations as determined by keys struck by a human operator. Other devices included here for that same function include the gang punch, the pantograph punch, and the stamp. The term was also used for similar machines used by humans to transcribe data onto punched tape media.

<span class="mw-page-title-main">Tabulating machine</span> Late 19th-century machine for summarizing information stored on punch cards

The tabulating machine was an electromechanical machine designed to assist in summarizing information stored on punched cards. Invented by Herman Hollerith, the machine was developed to help process data for the 1890 U.S. Census. Later models were widely used for business applications such as accounting and inventory control. It spawned a class of machines, known as unit record equipment, and the data processing industry.

The APE(X)C, or All Purpose Electronic (X) Computer series was designed by Andrew Donald Booth at Birkbeck College, London in the early 1950s. His work on the APE(X)C series was sponsored by the British Rayon Research Association. Although the naming conventions are slightly unclear, it seems the first model belonged to the BRRA. According to Booth, the X stood for X-company.

<span class="mw-page-title-main">1890 United States census</span> 11th US national census

The 1890 United States census was taken beginning June 2, 1890. The census determined the resident population of the United States to be 62,979,766, an increase of 25.5 percent over the 50,189,209 persons enumerated during the 1880 census. The data reported that the distribution of the population had resulted in the disappearance of the American frontier.

A card reader is a data input device that reads data from a card-shaped storage medium and provides the data to a computer. Card readers can acquire data from a card via a number of methods, including: optical scanning of printed text or barcodes or holes on punched cards, electrical signals from connections made or interrupted by a card's punched holes or embedded circuitry, or electronic devices that can read plastic cards embedded with either a magnetic strip, computer chip, RFID chip, or another storage medium.

A service bureau is a company that provides business services for a fee. The term has been extensively used to describe technology-based services to financial services companies, particularly banks. Service bureaus are a significant sector within the growing 3D printing industry that allow customers to make a decision whether to buy their own equipment or outsource production. Customers of service bureaus typically do not have the scale or expertise to incorporate these services into their internal operations and prefer to outsource them to a service bureau. Outsourced payroll services constitute a commonly provisioned service from a service bureau.

The IBM 101 Electronic Statistical Machine, introduced in 1952, combines in one unit the functions of sorting, counting, accumulating, balancing, editing, and printing of summaries of facts recorded in IBM cards.

<span class="mw-page-title-main">Mechanical computer</span> Computer built from mechanical components such as levers and gears

A mechanical computer is a computer built from mechanical components such as levers and gears rather than electronic components. The most common examples are adding machines and mechanical counters, which use the turning of gears to increment output displays. More complex examples could carry out multiplication and division—Friden used a moving head which paused at each column—and even differential analysis. One model, the Ascota 170 accounting machine sold in the 1960s, calculated square roots.

Paper data storage refers to the use of paper as a data storage device. This includes writing, illustrating, and the use of data that can be interpreted by a machine or is the result of the functioning of a machine. A defining feature of paper data storage is the ability of humans to produce it with only simple tools and interpret it visually.

<span class="mw-page-title-main">Control-flow diagram</span> Business process modeling tool

A control-flow diagram (CFD) is a diagram to describe the control flow of a business process, process or review.

<span class="mw-page-title-main">14th Weather Squadron</span> Military unit

The 14th Weather Squadron is a Geographically Separate Unit (GSU) of the 2nd Weather Group. The squadron is located in the Veach-Baley Federal Complex in Asheville, North Carolina. Its mission is military applied climatology. The 14 WS collects, protects and exploits authoritative climate data to optimize military and intelligence operations and planning in order to maximize the combat effectiveness of U.S. Department of Defense (DoD) personnel and weapons systems. It delivers environmental information worldwide to the United States Air Force (USAF), the Army, Unified Combatant Commands, the Intelligence Community, and the Department of Defense. The 14 WS also collaborates with the National Centers for Environmental Information (NCEI).

James Legrand Powers was a US inventor and entrepreneur, the founder of Powers Accounting Machine Company.

References

  1. French, Carl (1996). Data Processing and Information Technology (10th ed.). Thomson. p. 2. ISBN   1844801004.
  2. Google N gram viewer . Retrieved June 26, 2013.
  3. 1 2 Truesdell, Leon E. (1965). The development of punch card tabulation in the Bureau of the Census, 1890. United States Department of Commerce.
  4. 1 2 Bohme, Frederick; Wyatt, J. Paul; Curry, James P. (1991). 100 Years of Data Processing: The Punchcard Century. United States Bureau of the Census.
  5. Google N gram viewer . Retrieved April 28, 2018.
  6. Anthony Ralston; et al., eds. (2000). Encyclopedia of Computer Science 4th ed. Nature Publishing Group. p. 865.

Further reading