A Million Random Digits with 100,000 Normal Deviates

Last updated
Lines 10580-10594, columns 21-40, from A Million Random Digits with 100,000 Normal Deviates Random digits.png
Lines 10580–10594, columns 21–40, from A Million Random Digits with 100,000 Normal Deviates

A Million Random Digits with 100,000 Normal Deviates is a random number book by the RAND Corporation, originally published in 1955. The book, consisting primarily of a random number table, was an important 20th century work in the field of statistics and random numbers.

Contents

Production and background

It was produced starting in 1947 by an electronic simulation of a roulette wheel attached to a computer, the results of which were then carefully filtered and tested before being used to generate the table. The RAND table was an important breakthrough in delivering random numbers, because such a large and carefully prepared table had never before been available. In addition to being available in book form, one could also order the digits on a series of punched cards.

The table is formatted as 400 pages, each containing 50 lines of 50 digits. Columns and lines are grouped in fives, and the lines are numbered 00000 through 19999. The standard normal deviates are another 200 pages (10 per line, lines 0000 through 9999), with each deviate given to three decimal places. There are 28 additional pages of front matter. [1] [2] [3]

Utility

The main use of the tables was in statistics and the experimental design of scientific experiments, especially those that used the Monte Carlo method; in cryptography, they have also been used as nothing up my sleeve numbers, for example in the design of the Khafre cipher. The book was one of the last of a series of random number tables produced from the mid-1920s to the 1950s, after which the development of high-speed computers allowed faster operation through the generation of pseudorandom numbers rather than reading them from tables.

2001 edition

The book was reissued in 2001 ( ISBN   0-8330-3047-7) with a new foreword by RAND Executive Vice President Michael D. Rich. It has generated many humorous user reviews on Amazon.com. [4]

Sample

The digits (sequence A002205 in the OEIS ) begin:

10097 32533  76520 13586  34673 54876  80959 09117  39292 74945

Related Research Articles

A pseudorandom sequence of numbers is one that appears to be statistically random, despite having been produced by a completely deterministic and repeatable process. Simply put, the problem is that many of the sources of randomness available to humans rely on physical processes not readily available to computer programs.

<span class="mw-page-title-main">Benford's law</span> Observation that in many real-life datasets, the leading digit is likely to be small

Benford's law, also known as the Newcomb–Benford law, the law of anomalous numbers, or the first-digit law, is an observation that in many real-life sets of numerical data, the leading digit is likely to be small. In sets that obey the law, the number 1 appears as the leading significant digit about 30% of the time, while 9 appears as the leading significant digit less than 5% of the time. Uniformly distributed digits would each occur about 11.1% of the time. Benford's law also makes predictions about the distribution of second digits, third digits, digit combinations, and so on.

A pseudorandom number generator (PRNG), also known as a deterministic random bit generator (DRBG), is an algorithm for generating a sequence of numbers whose properties approximate the properties of sequences of random numbers. The PRNG-generated sequence is not truly random, because it is completely determined by an initial value, called the PRNG's seed. Although sequences that are closer to truly random can be generated using hardware random number generators, pseudorandom number generators are important in practice for their speed in number generation and their reproducibility.

<span class="mw-page-title-main">Monte Carlo method</span> Probabilistic problem-solving algorithm

Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results. The underlying concept is to use randomness to solve problems that might be deterministic in principle. The name comes from the Monte Carlo Casino in Monaco, where the primary developer of the method, physicist Stanislaw Ulam, was inspired by his uncle's gambling habits.

<span class="mw-page-title-main">Box–Muller transform</span> Statistical transform

The Box–Muller transform, by George Edward Pelham Box and Mervin Edgar Muller, is a random number sampling method for generating pairs of independent, standard, normally distributed random numbers, given a source of uniformly distributed random numbers. The method was first mentioned explicitly by Raymond E. A. C. Paley and Norbert Wiener in their 1934 treatise on Fourier transforms in the complex domain. Given the status of these latter authors and the widespread availability and use of their treatise, it is almost certain that Box and Muller were well aware of its contents.

<span class="mw-page-title-main">Hardware random number generator</span> Cryptographic device

In computing, a hardware random number generator (HRNG), true random number generator (TRNG), non-deterministic random bit generator (NRBG), or physical random number generator is a device that generates random numbers from a physical process capable of producing entropy, unlike the pseudorandom number generator that utilizes a deterministic algorithm and non-physical nondeterministic random bit generators that do not include hardware dedicated to generation of entropy.

<span class="mw-page-title-main">Mathematical table</span> List of values of a mathematical function

Mathematical tables are lists of numbers showing the results of a calculation with varying arguments. Trigonometric tables were used in ancient Greece and India for applications to astronomy and celestial navigation, and continued to be widely used until electronic calculators became cheap and plentiful in the 1970s, in order to simplify and drastically speed up computation. Tables of logarithms and trigonometric functions were common in math and science textbooks, and specialized tables were published for numerous applications.

Rand McNally is an American technology and publishing company that provides mapping, software and hardware for consumer electronics, commercial transportation and education markets. The company is headquartered in Chicago, with a distribution center in Richmond, Kentucky.

<span class="mw-page-title-main">Maurice Kendall</span> British statistician

Sir Maurice George Kendall, FBA was a prominent British statistician. The Kendall tau rank correlation is named after him.

<span class="mw-page-title-main">Middle-square method</span>

In mathematics and computer science, the middle-square method is a method of generating pseudorandom numbers. In practice it is a highly flawed method for many practical purposes, since its period is usually very short and it has some severe weaknesses; repeated enough times, the middle-square method will either begin repeatedly generating the same number or cycle to a previous number in the sequence and loop indefinitely.

Random number tables have been used in statistics for tasks such as selected random samples. This was much more effective than manually selecting the random samples. Nowadays, tables of random numbers have been replaced by computational random number generators.

<span class="mw-page-title-main">Random number generation</span> Producing a sequence that cannot be predicted better than by random chance

Random number generation is a process by which, often by means of a random number generator (RNG), a sequence of numbers or symbols that cannot be reasonably predicted better than by random chance is generated. This means that the particular outcome sequence will contain some patterns detectable in hindsight but impossible to foresee. True random number generators can be hardware random-number generators (HRNGs), wherein each generation is a function of the current value of a physical environment's attribute that is constantly changing in a manner that is practically impossible to model. This would be in contrast to so-called "random number generations" done by pseudorandom number generators (PRNGs), which generate numbers that only look random but are in fact pre-determined—these generations can be reproduced simply by knowing the state of the PRNG.

<span class="mw-page-title-main">Computational statistics</span> Interface between statistics and computer science

Computational statistics, or statistical computing, is the study which is the intersection of statistics and computer science, and refers to the statistical methods that are enabled by using computational methods. It is the area of computational science specific to the mathematical science of statistics. This area is fast developing. The view that the broader concept of computing must be taught as part of general statistical education is gaining momentum.

<span class="mw-page-title-main">Randomness</span> Apparent lack of pattern or predictability in events

In common usage, randomness is the apparent or actual lack of definite pattern or predictability in information. A random sequence of events, symbols or steps often has no order and does not follow an intelligible pattern or combination. Individual random events are, by definition, unpredictable, but if there is a known probability distribution, the frequency of different outcomes over repeated events is predictable. For example, when throwing two dice, the outcome of any particular roll is unpredictable, but a sum of 7 will tend to occur twice as often as 4. In this view, randomness is not haphazardness; it is a measure of uncertainty of an outcome. Randomness applies to concepts of chance, probability, and information entropy.

A standard normal deviate is a normally distributed deviate. It is a realization of a standard normal random variable, defined as a random variable with expected value 0 and variance 1. Where collections of such random variables are used, there is often an associated assumption that members of such collections are statistically independent.

Norman Zalmon Shapiro was an American mathematician, who was the co-author of the Rice–Shapiro theorem.

Non-uniform random variate generation or pseudo-random number sampling is the numerical practice of generating pseudo-random numbers (PRN) that follow a given probability distribution. Methods are typically based on the availability of a uniformly distributed PRN generator. Computational algorithms are then used to manipulate a single random variate, X, or often several such variates, into a new random variate Y such that these values have the required distribution. The first methods were developed for Monte-Carlo simulations in the Manhattan project, published by John von Neumann in the early 1950s.

<span class="mw-page-title-main">Random number book</span>

A random number book is a book whose main content is a large number of random numbers or random digits. Such books were used in early cryptography and experimental design, and were published by the Rand Corporation and others. The Rand corporation book A Million Random Digits with 100,000 Normal Deviates was first published in 1955 and was reissued in 2001. A sequel, A Million And One Random Digits was published in 2022.

Bernard Babington Smith, OBE was a British academic, wartime intelligence officer and amateur athlete.

References

  1. Tukey, John W. (1955). "Reviewed work: A Million Random Digits with 100,000 Normal Deviates, the Rand Corporation". Journal of the Operations Research Society of America. 3 (4): 568–571. JSTOR   166772.
  2. Moore, P. G. (1955). "Reviewed work: A Million Random Digits with 100,000 Normal Deviates., the Rand Corporation". Biometrika. 42 (3/4): 543. doi:10.2307/2333414. JSTOR   2333414.
  3. Davis, Jordan (2007). "A Million Random Digits". New England Review (1990-). 28 (4): 161–163. JSTOR   40245032.
  4. Heffernan, Virginia (January 15, 2010). "The Reviewing Stand". The New York Times Magazine . Retrieved 2011-03-09.

Additional sources