Symbolic Optimal Assembly Program

Last updated
SOAP II Coding form, 1957 SOAPII CodingForm.png
SOAP II Coding form, 1957

The Symbolic Optimal Assembly Program (SOAP) is an assembler for the IBM 650 Magnetic Drum Data-Processing Machine, an early computer first used in 1954. It was developed by Stan Poley at the IBM Thomas J. Watson Research Center. [1] SOAP is called "Optimal" (or "Optimum") because it attempts to store generated instructions on the storage drum to minimize the access time from one instruction to the next. SOAP is a multi-pass assembler, that is, it processes the source program more than once in order to generate the object program.

Contents

The first version of SOAP was succeeded by SOAP II in 1957, [2] which supported additional hardware features such as index registers and magnetic core memory, then SOAP IIA in 1958, [3] SOAP 2L, SOAP 2L Tape, SOAP 4000, and SOAP 42 in 1961. [4] SOAP was used as a backend to the FOR TRANSIT compiler in 1957. [1]

Donald Knuth independently produced versions named SOAP III in 1958 [5] and SUPERSOAP in 1959 [6] at Case Institute of Technology, now part of Case Western Reserve University in Cleveland, Ohio. The US National Bureau of Standards, under the direction of Herbert Howe, also wrote a version of SOAP, called ISOPAR, said to significantly improve optimization. [7]

Optimization

The IBM 650 uses a magnetic drum as main storage. The drum holds up to 4,000 words and rotates at 12,500 revolutions per minute (RPM), 4.8 msec per rotation, or average access time of 2.4  msec. "If you placed your instructions sequentially onto the drum, you would have to wait for a complete rotation of the drum before the CPU could obtain the next instruction. Since many instructions on the 650 could execute in around 3 milliseconds, you would try to optimize your code by placing the instructions on the drum in such a way that it would not take the drum a full revolution to access your next instruction." [8] Each instruction contains the address of the next instruction to execute; instead of a sequential array of instructions the memory appears to be a linked list. SOAP optimizes by placing instructions to be executed sequentially in locations around the drum such that the next instruction is available as soon as possible after the current instruction finishes. This optimization was said to make the assembled programs "run as much as six or seven times faster." [1]

SOAP features

Pseudo-operations

SOAP II supports the following pseudo-operations (assembly directives): [2]

SOAP II Pseudo-operations
OPNameDescription
HEDHeadingSeparates programs, possibly written separately, which are being assembled together. It can specify a character to be appended to symbol names in this section to avoid naming conflicts.
RELRelocatable Library ProgramDefines the start of a relocatable library program being assembled ahead of the main program. It specifies the amounts by which assembled addresses should be relocated.
RBRRelocatable Block ReservationReserves a range of drum locations as "erasable blocks within the routine," to avoid using a number of cards to specify a data area.
REQRelocatable EquivalenceEquates a symbol to the address of a relocatable subroutine, and optionally specifies a relocation amount.
BLRBlock ReservationMarks a range of drum addresses as unavailable for assignment by SOAP.
BLABlock AvailabilityMarks a range of drum addresses as available for assignment.
REGRegional SpecificationDefines a range of drum addresses as a "region", identified by a one-character region id. Locations within the region are specified as <region id>nnnn where nnnn is a number from 1 to the number of locations specified for the region.
ALFAlphabetic DataSpecifies up to five alphabetic characters to be assembled at the specified location.
PATPunch Assembly TableTells SOAP to punch a deck of fifty Type Y cards listing used and available drum locations at any point during assembly. This table can be examined manually and fed back into SOAP on a subsequent assembly to restore "the availability status which existed at some point of a prior assembly."
BOPBeginning of ProgramIndicates the start of a new program to be assembled separately from previous programs in the same card deck.
EQUEquivalenceEquates a symbol to an absolute, regional, or symbolic value.
SYNSynonymEquates a symbol to an absolute drum address. The address is marked unavailable for assignment

Comments

Each source card can contain up to ten characters of comments in columns 63–72. For longer comments, up to thirty characters may be entered on a Comments Card (Type 1, '1' in column 41) in columns 43–72

Related Research Articles

<span class="mw-page-title-main">Assembly language</span> Low-level programming language

In computer programming, assembly language, often referred to simply as assembly and commonly abbreviated as ASM or asm, is any low-level programming language with a very strong correspondence between the instructions in the language and the architecture's machine code instructions. Assembly language usually has one statement per machine instruction (1:1), but constants, comments, assembler directives, symbolic labels of, e.g., memory locations, registers, and macros are generally also supported.

<span class="mw-page-title-main">Linker (computing)</span> Computer program which combines multiple object files into a single file

In computing, a linker or link editor is a computer system program that takes one or more object files and combines them into a single executable file, library file, or another "object" file.

In processor design, microcode serves as an intermediary layer situated between the central processing unit (CPU) hardware and the programmer-visible instruction set architecture of a computer. It consists of a set of hardware-level instructions that implement higher-level machine code instructions or control internal finite-state machine sequencing in many digital processing components. While microcode is utilized in general-purpose CPUs in contemporary desktops, it also functions as a fallback path for scenarios that the faster hardwired control unit is unable to manage.

<span class="mw-page-title-main">Random access</span>

Random access is the ability to access an arbitrary element of a sequence in equal time or any datum from a population of addressable elements roughly as easily and efficiently as any other, no matter how many elements may be in the set. In computer science it is typically contrasted to sequential access which requires data to be retrieved in the order it was stored.

<span class="mw-page-title-main">IBM 650</span> Vacuum tube computer system

The IBM 650 Magnetic Drum Data-Processing Machine is an early digital computer produced by IBM in the mid-1950s. It was the first mass produced computer in the world. Almost 2,000 systems were produced, the last in 1962, and it was the first computer to make a meaningful profit. The first one was installed in late 1954 and it was the most-popular computer of the 1950s.

<span class="mw-page-title-main">UNIVAC</span> Series of mainframe computer models

UNIVAC was a line of electronic digital stored-program computers starting with the products of the Eckert–Mauchly Computer Corporation. Later the name was applied to a division of the Remington Rand company and successor organizations.

<span class="mw-page-title-main">Drum memory</span> Magnetic data storage device

Drum memory was a magnetic data storage device invented by Gustav Tauschek in 1932 in Austria. Drums were widely used in the 1950s and into the 1960s as computer memory.

In computer science, program optimization, code optimization, or software optimization is the process of modifying a software system to make some aspect of it work more efficiently or use fewer resources. In general, a computer program may be optimized so that it executes more rapidly, or to make it capable of operating with less memory storage or other resources, or draw less power.

<span class="mw-page-title-main">IBM 1130</span> 16-bit IBM minicomputer introduced in 1965

The IBM 1130 Computing System, introduced in 1965, was IBM's least expensive computer at that time. A binary 16-bit machine, it was marketed to price-sensitive, computing-intensive technical markets, like education and engineering, succeeding the decimal IBM 1620 in that market segment. Typical installations included a 1 megabyte disk drive that stored the operating system, compilers and object programs, with program source generated and maintained on punched cards. Fortran was the most common programming language used, but several others, including APL, were available.

Addressing modes are an aspect of the instruction set architecture in most central processing unit (CPU) designs. The various addressing modes that are defined in a given instruction set architecture define how the machine language instructions in that architecture identify the operand(s) of each instruction. An addressing mode specifies how to calculate the effective memory address of an operand by using information held in registers and/or constants contained within a machine instruction or elsewhere.

<span class="mw-page-title-main">IBM 305 RAMAC</span> First computer to use magnetic disk storage

The IBM 305 RAMAC was the first commercial computer that used a moving-head hard disk drive for secondary storage. The system was publicly announced on September 14, 1956, with test units already installed at the U.S. Navy and at private corporations. RAMAC stood for "Random Access Method of Accounting and Control", as its design was motivated by the need for real-time accounting in business.

<span class="mw-page-title-main">English Electric DEUCE</span>

The DEUCE was one of the earliest British commercially available computers, built by English Electric from 1955. It was the production version of the Pilot ACE, itself a cut-down version of Alan Turing's ACE.

<span class="mw-page-title-main">Bendix G-15</span> 1956 computer design

The Bendix G-15 is a computer introduced in 1956 by the Bendix Corporation, Computer Division, Los Angeles, California. It is about 5 by 3 by 3 feet and weighs about 966 pounds (438 kg). The G-15 has a drum memory of 2,160 29-bit words, along with 20 words used for special purposes and rapid-access storage. The base system, without peripherals, cost $49,500. A working model cost around $60,000. It could also be rented for $1,485 per month. It was meant for scientific and industrial markets. The series was gradually discontinued when Control Data Corporation took over the Bendix computer division in 1963.

Autocoder is any of a group of assemblers for a number of IBM computers of the 1950s and 1960s. The first Autocoders appear to have been the earliest assemblers to provide a macro facility.

In the history of computing, optimum programming, or optimum coding is the practice of arranging a computer program's instructions in memory so as to minimize the time the machine spends waiting for instructions. It is of historical interest mainly due to the design of many early digital computers.

The IBM Basic assembly language and successors is a series of assembly languages and assemblers made for the IBM System/360 mainframe system and its successors through the IBM Z.

<span class="mw-page-title-main">History of compiler construction</span>

In computing, a compiler is a computer program that transforms source code written in a programming language or computer language, into another computer language. The most common reason for transforming source code is to create an executable program.

<span class="mw-page-title-main">Control table</span> Data structures that control the execution order of computer commands

Control tables are tables that control the control flow or play a major part in program control. There are no rigid rules about the structure or content of a control table—its qualifying attribute is its ability to direct control flow in some way through "execution" by a processor or interpreter. The design of such tables is sometimes referred to as table-driven design. In some cases, control tables can be specific implementations of finite-state-machine-based automata-based programming. If there are several hierarchical levels of control table they may behave in a manner equivalent to UML state machines

A hot spot in computer science is most usually defined as a region of a computer program where a high proportion of executed instructions occur or where most time is spent during the program's execution.

FOR TRANSIT is a subset of the FORTRAN programming language for the IBM 650 Magnetic Drum Data-Processing Machine, developed by a group led by Bob Bemer. FORTRAN, the first high-level programming language, was developed for the IBM 704 in 1957, at the same time IBM wanted to provide something similar for customers of the older, less powerful, but popular 650—eventually over 2000 650s were sold.

References

  1. 1 2 3 Lorenzo, Mark Jones (2019). The History of the Fortran Programming Language. p. 162. ISBN   9781082395949 . Retrieved Sep 28, 2020.
  2. 1 2 IBM Corporation (1957). SOAP II for the IBM 650 Data Processing System (PDF). Retrieved Sep 27, 2020.
  3. IBM Corporation (1958). IBM 650 Data Processing System Bulletin . Retrieved Sep 27, 2020.}
  4. IBM Corporation (1961). SOAP 2L, SOAP 2L Tape, SOAP 4000, and SOAP 42 (PDF). Retrieved Sep 27, 2020.
  5. "SOAP III". Online Historical Encyclopaedia of Programming Languages. Retrieved Sep 27, 2020.
  6. Knuth, Donald E. (1959). SUPERSOAP Assembly System for the Augmented 650 (PDF). Retrieved Sep 27, 2020.
  7. Howe, Herbert. ISOPAR: a new and improved symbolic optimizing assembly routine for the IBM 650 (PDF). Retrieved Sep 28, 2020.
  8. Kugel, Herb (October 22, 2001). "The IBM 650". Dr. Dobb's. Retrieved Sep 28, 2020.