The Complexity of Songs

Last updated January 15, 2025

"The Complexity of Songs" is a scholarly article by computer scientist Donald Knuth published in 1977^[1] as an in-joke about computational complexity theory. The article capitalizes on what it argues is the tendency of popular songs to devolve from long and content-rich ballads to highly repetitive texts with little or no meaningful content.^[2] The article states that a song of length N words may be produced remembering, e.g., only O(log N) words ("space complexity" of the song) or even less.

Article summary

Knuth writes that "our ancient ancestors invented the concept of refrain" to reduce the space complexity of songs, which becomes crucial when a large number of songs is to be committed to one's memory. Knuth's Lemma 1 states that if N is the length of a song, then the refrain decreases the song complexity to cN, where the factor $c < 1$ .^[1]

Knuth further demonstrates a way of producing songs with $O (\sqrt N)$ complexity, an approach "further improved by a Scottish farmer named O. MacDonald".^[1]

More ingenious approaches yield songs of complexity $O(\log N)$ , a class known as "m bottles of beer on the wall".

Finally, the progress during the 20th century—stimulated by the fact that "the advent of modern drugs has led to demands for still less memory"—leads to the ultimate improvement: Arbitrarily long songs with space complexity ⁠ $O(1)$ ⁠ exist, e.g. a song defined by the recurrence relation ^[1]

S_{0}=\epsilon ,S_{k}=V_{k}S_{k-1},\,k\geq 1,

V k =

'That's the way,'

U

'I like it,'

U

, for all

k \geq 1

U =

'uh huh,' 'uh huh'

Further developments

Prof. Kurt Eisemann of San Diego State University in his letter to the Communications of the ACM ^[3] further improves the latter seemingly unbeatable estimate. He begins with an observation that for practical applications the value of the "hidden constant" c in the big O notation may be crucial in making the difference between the feasibility and unfeasibility: for example a constant value of 10⁸⁰ would exceed the capacity of any known device. He further notices that a technique has already been known in Mediaeval Europe whereby textual content of an arbitrary tune can be recorded basing on the recurrence relation $S_{k}=C_{2}S_{k-1}$ , where $C_{2}={\texttt {'la'}}$ , yielding the value of the big-O constant c equal to 2. However it turns out that another culture achieved the absolute lower bound of O(0). As Prof. Eisemann puts it:

When the Mayflower voyagers first descended on these shores, the native Americans proud of their achievement in the theory of information storage and retrieval, at first welcomed the strangers with the complete silence. This was meant to convey their peak achievement in the complexity of songs, namely the demonstration that a limit as low as c = 0 is indeed obtainable.

It is then claimed that the Europeans were unprepared to grasp this notion, and the chiefs, in order to establish a common ground to convey their achievements later proceeded to demonstrate an approach described by the recurrent relation $S_{k}=C_{1}S_{k-1}$ , where $C_{1}={\texttt {'i'}}$ , with a suboptimal complexity given by $c = 1$ .^[2]^[3]

The O(1) space complexity result was also implemented by Guy L. Steele, Jr., "perhaps challenged by Knuth's [article]."^[4] Dr. Steele's TELNET Song used a completely different algorithm based on exponential recursion, a parody on some implementations of TELNET.^[5]^[6]^[7]

Darrah Chavey suggested that the complexity analysis of human songs can be a useful pedagogic device for teaching students complexity theory.^[8]

The article "On Superpolylogarithmic Subexponential Functions" by Prof. Alan Sherman ^[9] writes that Knuth's article was seminal for analysis of a special class of functions.

Related Research Articles

In computer science, binary search, also known as half-interval search, logarithmic search, or binary chop, is a search algorithm that finds the position of a target value within a sorted array. Binary search compares the target value to the middle element of the array. If they are not equal, the half in which the target cannot lie is eliminated and the search continues on the remaining half, again taking the middle element to compare to the target value, and repeating this until the target value is found. If the search ends with the remaining half being empty, the target is not in the array.

The P versus NP problem is a major unsolved problem in theoretical computer science. Informally, it asks whether every problem whose solution can be quickly verified can also be quickly solved.

Presburger arithmetic is the first-order theory of the natural numbers with addition, named in honor of Mojżesz Presburger, who introduced it in 1929. The signature of Presburger arithmetic contains only the addition operation and equality, omitting the multiplication operation entirely. The theory is computably axiomatizable; the axioms include a schema of induction.

Big <i>O</i> notation Describes limiting behavior of a function

Big O notation is a mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. Big O is a member of a family of notations invented by German mathematicians Paul Bachmann, Edmund Landau, and others, collectively called Bachmann–Landau notation or asymptotic notation. The letter O was chosen by Bachmann to stand for Ordnung, meaning the order of approximation.

<span class="mw-page-title-main">Shellsort</span> Sorting algorithm which uses multiple comparison intervals

Shellsort, also known as Shell sort or Shell's method, is an in-place comparison sort. It can be seen as either a generalization of sorting by exchange or sorting by insertion. The method starts by sorting pairs of elements far apart from each other, then progressively reducing the gap between elements to be compared. By starting with far-apart elements, it can move some out-of-place elements into the position faster than a simple nearest-neighbor exchange. Donald Shell published the first version of this sort in 1959. The running time of Shellsort is heavily dependent on the gap sequence it uses. For many practical variants, determining their time complexity remains an open problem.

In theoretical computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by the algorithm, supposing that each elementary operation takes a fixed amount of time to perform. Thus, the amount of time taken and the number of elementary operations performed by the algorithm are taken to be related by a constant factor.

A randomized algorithm is an algorithm that employs a degree of randomness as part of its logic or procedure. The algorithm typically uses uniformly random bits as an auxiliary input to guide its behavior, in the hope of achieving good performance in the "average case" over all possible choices of random determined by the random bits; thus either the running time, or the output are random variables.

In computer science, a selection algorithm is an algorithm for finding the $th smallest value in a collection of ordered values, such as numbers. The value that it finds is called the th order statistic. Selection includes as special cases the problems of finding the minimum, median, and maximum element in the collection. Selection algorithms include quickselect, and the median of medians algorithm. When applied to a collection of values, these algorithms take linear time, as expressed using big O notation. For data that is already structured, faster algorithms may be possible; as an extreme case, selection in an already-sorted array takes time .$

In graph theory, a vertex cover of a graph is a set of vertices that includes at least one endpoint of every edge of the graph.

In computational complexity theory, the Cook–Levin theorem, also known as Cook's theorem, states that the Boolean satisfiability problem is NP-complete. That is, it is in NP, and any problem in NP can be reduced in polynomial time by a deterministic Turing machine to the Boolean satisfiability problem.

In computer science, a suffix tree is a compressed trie containing all the suffixes of the given text as their keys and positions in the text as their values. Suffix trees allow particularly fast implementations of many important string operations.

In computational complexity theory, the parallel computation thesis is a hypothesis which states that the time used by a (reasonable) parallel machine is polynomially related to the space used by a sequential machine. The parallel computation thesis was set forth by Chandra and Stockmeyer in 1976.

The Donald E. Knuth Prize is a prize for outstanding contributions to the foundations of computer science, named after the American computer scientist Donald E. Knuth.

Richard Jay Lipton is an American computer scientist who is Associate Dean of Research, Professor, and the Frederick G. Storey Chair in Computing in the College of Computing at the Georgia Institute of Technology. He has worked in computer science theory, cryptography, and DNA computing.

In theoretical computer science, smoothed analysis is a way of measuring the complexity of an algorithm. Since its introduction in 2001, smoothed analysis has been used as a basis for considerable research, for problems ranging from mathematical programming, numerical analysis, machine learning, and data mining. It can give a more realistic analysis of the practical performance of the algorithm compared to analysis that uses worst-case or average-case scenarios.

The Annual ACM Symposium on Theory of Computing (STOC) is an academic conference in the field of theoretical computer science. STOC has been organized annually since 1969, typically in May or June; the conference is sponsored by the Association for Computing Machinery special interest group SIGACT. Acceptance rate of STOC, averaged from 1970 to 2012, is 31%, with the rate of 29% in 2012.

In computer science, streaming algorithms are algorithms for processing data streams in which the input is presented as a sequence of items and can be examined in only a few passes, typically just one. These algorithms are designed to operate with limited memory, generally logarithmic in the size of the stream and/or in the maximum value in the stream, and may also have limited processing time per item.

In computational complexity theory, a problem is NP-complete when:

It is a decision problem, meaning that for any input to the problem, the output is either "yes" or "no".
When the answer is "yes", this can be demonstrated through the existence of a short solution.
The correctness of each solution can be verified quickly and a brute-force search algorithm can find a solution by trying all possible solutions.
The problem can be used to simulate every other problem for which we can verify quickly that a solution is correct. In this sense, NP-complete problems are the hardest of the problems to which solutions can be verified quickly. If we could find solutions of some NP-complete problem quickly, we could quickly find the solutions of every other problem to which a given solution can be easily verified.

In computational complexity theory, the exponential time hypothesis is an unproven computational hardness assumption that was formulated by Impagliazzo & Paturi (1999). It states that satisfiability of 3-CNF Boolean formulas cannot be solved in subexponential time, $. More precisely, the usual form of the hypothesis asserts the existence of a number such that all algorithms that correctly solve this problem require time at least The exponential time hypothesis, if true, would imply that P \neq NP, but it is a stronger statement. It implies that many computational problems are equivalent in complexity, in the sense that if one of them has a subexponential time algorithm then they all do, and that many known algorithms for these problems have optimal or near-optimal time complexity.$

In computer science, a parallel external memory (PEM) model is a cache-aware, external-memory abstract machine. It is the parallel-computing analogy to the single-processor external memory (EM) model. In a similar way, it is the cache-aware analogy to the parallel random-access machine (PRAM). The PEM model consists of a number of processors, together with their respective private caches and a shared main memory.

References

1 2 3 4 Knuth, Donald (Summer 1977). "The Complexity of Songs". ACM SIGACT News. 9 (2): 17–24. doi: 10.1145/1008354.1008355 . S2CID 17533775. Reprinted in: Knuth, Donald (1984). "The Complexity of Songs". Communications of the ACM . 27 (4): 344–346. doi: 10.1145/358027.358042 . MR 0784131.
1 2 Steven Krantz (2005) "Mathematical Apocrypha Redux", ISBN 0-88385-554-2, pp.2, 3.
1 2 Ashenhurst, Robert L. (1985). "ACM forum". Communications of the ACM. 28 (3): 235–240. doi:10.1145/3166.315004.
↑ Neumann, Peter G. (1984). "A further view of the first quarter century". Communications of the ACM. 27 (4): 343. doi:10.1145/358027.358040.
↑ Steele, Guy L. (1984). "THE TELNET SONG: ("Control-Uparrow Q.")". Communications of the ACM. 27 (4): 347–348. doi:10.1145/358027.1035691.
↑ Text of the TELNET Song (retrieved January 5, 2012).
↑ Telnet song in MIDI format.
↑ Chavey, Darrah (1996). "Songs and the analysis of algorithms". Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education. pp. 4–8. doi:10.1145/236452.236475. ISBN 089791757X. S2CID 148247 . Retrieved 7 January 2013.
↑ Sherman, Alan T. (1991). "On Superpolylogarithlnic Subexponential Functions (Part I)". ACM SIGACT News. 22: 65. doi: 10.1145/122413.990652 .

External links

"The Complexity of Songs", Knuth, Donald E. (1984).

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[knu-1] 1 2 3 4 Knuth, Donald (Summer 1977). "The Complexity of Songs". ACM SIGACT News. 9 (2): 17–24. doi: 10.1145/1008354.1008355 . S2CID 17533775. Reprinted in: Knuth, Donald (1984). "The Complexity of Songs". Communications of the ACM . 27 (4): 344–346. doi: 10.1145/358027.358042 . MR 0784131.

[redux-2] 1 2 Steven Krantz (2005) "Mathematical Apocrypha Redux", ISBN 0-88385-554-2, pp.2, 3.

[kurt-3] 1 2 Ashenhurst, Robert L. (1985). "ACM forum". Communications of the ACM. 28 (3): 235–240. doi:10.1145/3166.315004.

[4] Neumann, Peter G. (1984). "A further view of the first quarter century". Communications of the ACM. 27 (4): 343. doi:10.1145/358027.358040.

[5] Steele, Guy L. (1984). "THE TELNET SONG: ("Control-Uparrow Q.")". Communications of the ACM. 27 (4): 347–348. doi:10.1145/358027.1035691.

[6] Text of the TELNET Song (retrieved January 5, 2012).

[7] Telnet song in MIDI format.

[Chavey1996-8] Chavey, Darrah (1996). "Songs and the analysis of algorithms". Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science education. pp. 4–8. doi:10.1145/236452.236475. ISBN 089791757X. S2CID 148247 . Retrieved 7 January 2013.

[9] Sherman, Alan T. (1991). "On Superpolylogarithlnic Subexponential Functions (Part I)". ACM SIGACT News. 22: 65. doi: 10.1145/122413.990652 .

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

v t e Donald Knuth
Publications	The Art of Computer Programming "The Complexity of Songs" Computers and Typesetting Concrete Mathematics Surreal Numbers Things a Computer Scientist Rarely Talks About Selected papers series
Software	TeX Metafont MIXAL (MIX MMIX)
Fonts	AMS Euler Computer Modern Concrete Roman
Literate programming	WEB CWEB
Algorithms	Knuth's Algorithm X Knuth–Bendix completion algorithm Knuth–Morris–Pratt algorithm Knuth shuffle Robinson–Schensted–Knuth correspondence Trabb Pardo–Knuth algorithm Generalization of Dijkstra's algorithm Knuth's Simpath algorithm
Other	Dancing Links Knuth reward check Knuth Prize Knuth's up-arrow notation Man or boy test Quater-imaginary base -yllion Potrzebie system of weights and measures