# Cantor function

Last updated

In mathematics, the Cantor function is an example of a function that is continuous, but not absolutely continuous. It is a notorious counterexample in analysis, because it challenges naive intuitions about continuity, derivative, and measure. Though it is continuous everywhere and has zero derivative almost everywhere, its value still goes from 0 to 1 as its argument reaches from 0 to 1. Thus, in one sense the function seems very much like a constant one which cannot grow, and in another, it does indeed monotonically grow, by construction.

## Contents

It is also referred to as the Cantor ternary function, the Lebesgue function, [1] Lebesgue's singular function, the Cantor–Vitali function, the Devil's staircase, [2] the Cantor staircase function, [3] and the Cantor–Lebesgue function. [4] GeorgCantor  ( 1884 ) introduced the Cantor function and mentioned that Scheeffer pointed out that it was a counterexample to an extension of the fundamental theorem of calculus claimed by Harnack. The Cantor function was discussed and popularized by Scheeffer (1884), Lebesgue (1904) and Vitali (1905).

## Definition

See figure. To formally define the Cantor function c : [0,1] → [0,1], let x be in [0,1] and obtain c(x) by the following steps:

1. Express x in base 3.
2. If x contains a 1, replace every digit strictly after the first 1 by 0.
3. Replace any remaining 2s with 1s.
4. Interpret the result as a binary number. The result is c(x).

For example:

• 1/4 is 0.02020202... in base 3. There are no 1s so the next stage is still 0.02020202... This is rewritten as 0.01010101... When read in base 2, this corresponds to 1/3, so c(1/4) = 1/3.
• 1/5 is 0.01210121... in base 3. The digits after the first 1 are replaced by 0s to produce 0.01000000... This is not rewritten since there are no 2s. When read in base 2, this corresponds to 1/4, so c(1/5) = 1/4.
• 200/243 is 0.21102 (or 0.211012222...) in base 3. The digits after the first 1 are replaced by 0s to produce 0.21. This is rewritten as 0.11. When read in base 2, this corresponds to 3/4, so c(200/243) = 3/4.

Equivalently, if ${\displaystyle {\mathcal {C}}}$ is the Cantor set on [0,1], then the Cantor function c : [0,1] → [0,1] can be defined as

${\displaystyle c(x)={\begin{cases}\sum _{n=1}^{\infty }{\frac {a_{n}}{2^{n}}},&x=\sum _{n=1}^{\infty }{\frac {2a_{n}}{3^{n}}}\in {\mathcal {C}}\ \mathrm {for} \ a_{n}\in \{0,1\};\\\sup _{y\leq x,\,y\in {\mathcal {C}}}c(y),&x\in [0,1]\setminus {\mathcal {C}}.\\\end{cases}}}$

This formula is well-defined, since every member of the Cantor set has a unique base 3 representation that only contains the digits 0 or 2. (For some members of ${\displaystyle {\mathcal {C}}}$, the ternary expansion is repeating with trailing 2's and there is an alternative non-repeating expansion ending in 1. For example, 1/3 = 0.13 = 0.02222...3 is a member of the Cantor set). Since c(0) = 0 and c(1) = 1, and c is monotonic on ${\displaystyle {\mathcal {C}}}$, it is clear that 0 ≤ c(x) ≤ 1 also holds for all ${\displaystyle x\in [0,1]\setminus {\mathcal {C}}}$.

## Properties

The Cantor function challenges naive intuitions about continuity and measure; though it is continuous everywhere and has zero derivative almost everywhere, ${\textstyle c(x)}$ goes from 0 to 1 as ${\textstyle x}$ goes from 0 to 1, and takes on every value in between. The Cantor function is the most frequently cited example of a real function that is uniformly continuous (precisely, it is Hölder continuous of exponent α = log 2/log 3) but not absolutely continuous. It is constant on intervals of the form (0.x1x2x3...xn022222..., 0.x1x2x3...xn200000...), and every point not in the Cantor set is in one of these intervals, so its derivative is 0 outside of the Cantor set. On the other hand, it has no derivative at any point in an uncountable subset of the Cantor set containing the interval endpoints described above.

The Cantor function can also be seen as the cumulative probability distribution function of the 1/2-1/2 Bernoulli measure μ supported on the Cantor set: ${\textstyle c(x)=\mu ([0,x])}$. This probability distribution, called the Cantor distribution, has no discrete part. That is, the corresponding measure is atomless. This is why there are no jump discontinuities in the function; any such jump would correspond to an atom in the measure.

However, no non-constant part of the Cantor function can be represented as an integral of a probability density function; integrating any putative probability density function that is not almost everywhere zero over any interval will give positive probability to some interval to which this distribution assigns probability zero. In particular, as Vitali (1905) pointed out, the function is not the integral of its derivative even though the derivative exists almost everywhere.

The Cantor function is the standard example of a singular function.

The Cantor function is non-decreasing, and so in particular its graph defines a rectifiable curve. Scheeffer (1884) showed that the arc length of its graph is 2.

### Lack of absolute continuity

Because the Lebesgue measure of the uncountably infinite Cantor set is 0, for any positive ε < 1 and δ, there exists a finite sequence of pairwise disjoint sub-intervals with total length < δ over which the Cantor function cumulatively rises more than ε.

In fact, for every δ > 0 there are finitely many pairwise disjoint intervals (xk,yk) (1  k  M) with ${\displaystyle \sum \limits _{k=1}^{M}(y_{k}-x_{k})<\delta }$ and ${\displaystyle \sum \limits _{k=1}^{M}(c(y_{k})-c(x_{k}))=1}$.

## Alternative definitions

### Iterative construction

Below we define a sequence {fn} of functions on the unit interval that converges to the Cantor function.

Let f0(x) = x.

Then, for every integer n 0, the next function fn+1(x) will be defined in terms of fn(x) as follows:

Let fn+1(x) = 1/2 ×fn(3x),  when 0 ≤ x ≤ 1/3;

Let fn+1(x) = 1/2,  when 1/3 ≤ x ≤ 2/3;

Let fn+1(x) = 1/2 + 1/2 ×fn(3x 2),  when 2/3 ≤ x ≤ 1.

The three definitions are compatible at the end-points 1/3 and 2/3, because fn(0) = 0 and fn(1) = 1 for every n, by induction. One may check that fn converges pointwise to the Cantor function defined above. Furthermore, the convergence is uniform. Indeed, separating into three cases, according to the definition of fn+1, one sees that

${\displaystyle \max _{x\in [0,1]}|f_{n+1}(x)-f_{n}(x)|\leq {\frac {1}{2}}\,\max _{x\in [0,1]}|f_{n}(x)-f_{n-1}(x)|,\quad n\geq 1.}$

If f denotes the limit function, it follows that, for every n  0,

${\displaystyle \max _{x\in [0,1]}|f(x)-f_{n}(x)|\leq 2^{-n+1}\,\max _{x\in [0,1]}|f_{1}(x)-f_{0}(x)|.}$

Also the choice of starting function does not really matter, provided f0(0) = 0, f0(1) = 1 and f0 is bounded [ citation needed ].

### Fractal volume

The Cantor function is closely related to the Cantor set. The Cantor set C can be defined as the set of those numbers in the interval [0, 1] that do not contain the digit 1 in their base-3 (triadic) expansion, except if the 1 is followed by zeros only (in which case the tail 1000${\displaystyle \ldots }$ can be replaced by 0222${\displaystyle \ldots }$ to get rid of any 1). It turns out that the Cantor set is a fractal with (uncountably) infinitely many points (zero-dimensional volume), but zero length (one-dimensional volume). Only the D-dimensional volume ${\displaystyle H_{D}}$ (in the sense of a Hausdorff-measure) takes a finite value, where ${\displaystyle D=\log(2)/\log(3)}$ is the fractal dimension of C. We may define the Cantor function alternatively as the D-dimensional volume of sections of the Cantor set

${\displaystyle f(x)=H_{D}(C\cap (0,x)).}$

## Self-similarity

The Cantor function possesses several symmetries. For ${\displaystyle 0\leq x\leq 1}$, there is a reflection symmetry

${\displaystyle c(x)=1-c(1-x)}$

and a pair of magnifications, one on the left and one on the right:

${\displaystyle c\left({\frac {x}{3}}\right)={\frac {c(x)}{2}}}$

and

${\displaystyle c\left({\frac {x+2}{3}}\right)={\frac {1+c(x)}{2}}}$

The magnifications can be cascaded; they generate the dyadic monoid. This is exhibited by defining several helper functions. Define the reflection as

${\displaystyle r(x)=1-x}$

The first self-symmetry can be expressed as

${\displaystyle r\circ c=c\circ r}$

where the symbol ${\displaystyle \circ }$ denotes function composition. That is, ${\displaystyle (r\circ c)(x)=r(c(x))=1-c(x)}$ and likewise for the other cases. For the left and right magnifications, write the left-mappings

${\displaystyle L_{D}(x)={\frac {x}{2}}}$ and ${\displaystyle L_{C}(x)={\frac {x}{3}}}$

Then the Cantor function obeys

${\displaystyle L_{D}\circ c=c\circ L_{C}}$

Similarly, define the right mappings as

${\displaystyle R_{D}(x)={\frac {1+x}{2}}}$ and ${\displaystyle R_{C}(x)={\frac {2+x}{3}}}$

Then, likewise,

${\displaystyle R_{D}\circ c=c\circ R_{C}}$

The two sides can be mirrored one onto the other, in that

${\displaystyle L_{D}\circ r=r\circ R_{D}}$

and likewise,

${\displaystyle L_{C}\circ r=r\circ R_{C}}$

These operations can be stacked arbitrarily. Consider, for example, the sequence of left-right moves ${\displaystyle LRLLR.}$ Adding the subscripts C and D, and, for clarity, dropping the composition operator ${\displaystyle \circ }$ in all but a few places, one has:

${\displaystyle L_{D}R_{D}L_{D}L_{D}R_{D}\circ c=c\circ L_{C}R_{C}L_{C}L_{C}R_{C}}$

Arbitrary finite-length strings in the letters L and R correspond to the dyadic rationals, in that every dyadic rational can be written as both ${\displaystyle y=n/2^{m}}$ for integer n and m and as finite length of bits ${\displaystyle y=0.b_{1}b_{2}b_{3}\cdots b_{m}}$ with ${\displaystyle b_{k}\in \{0,1\}.}$ Thus, every dyadic rational is in one-to-one correspondence with some self-symmetry of the Cantor function.

Some notational rearrangements can make the above slightly easier to express. Let ${\displaystyle g_{0}}$ and ${\displaystyle g_{1}}$ stand for L and R. Function composition extends this to a monoid, in that one can write ${\displaystyle g_{010}=g_{0}g_{1}g_{0}}$ and generally, ${\displaystyle g_{A}g_{B}=g_{AB}}$ for some binary strings of digits A, B, where AB is just the ordinary concatenation of such strings. The dyadic monoid M is then the monoid of all such finite-length left-right moves. Writing ${\displaystyle \gamma \in M}$ as a general element of the monoid, there is a corresponding self-symmetry of the Cantor function:

${\displaystyle \gamma _{D}\circ c=c\circ \gamma _{C}}$

The dyadic monoid itself has several interesting properties. It can be viewed as a finite number of left-right moves down an infinite binary tree; the infinitely distant "leaves" on the tree correspond to the points on the Cantor set, and so, the monoid also represents the self-symmetries of the Cantor set. In fact, a large class of commonly occurring fractals are described by the dyadic monoid; additional examples can be found in the article on de Rham curves. Other fractals possessing self-similarity are described with other kinds of monoids. The dyadic monoid is itself a sub-monoid of the modular group ${\displaystyle SL(2,\mathbb {Z} ).}$

Note that the Cantor function bears more than a passing resemblance to Minkowski's question-mark function. In particular, it obeys the exact same symmetry relations, although in an altered form.

## Generalizations

Let

${\displaystyle y=\sum _{k=1}^{\infty }b_{k}2^{-k}}$

be the dyadic (binary) expansion of the real number 0 ≤ y ≤ 1 in terms of binary digits bk {0,1}. This expansion is discussed in greater detail in the article on the dyadic transformation. Then consider the function

${\displaystyle C_{z}(y)=\sum _{k=1}^{\infty }b_{k}z^{k}.}$

For z = 1/3, the inverse of the function x = 2 C1/3(y) is the Cantor function. That is, y = y(x) is the Cantor function. In general, for any z < 1/2, Cz(y) looks like the Cantor function turned on its side, with the width of the steps getting wider as z approaches zero.

As mentioned above, the Cantor function is also the cumulative distribution function of a measure on the Cantor set. Different Cantor functions, or Devil's Staircases, can be obtained by considering different atom-less probability measures supported on the Cantor set or other fractals. While the Cantor function has derivative 0 almost everywhere, current research focusses on the question of the size of the set of points where the upper right derivative is distinct from the lower right derivative, causing the derivative to not exist. This analysis of differentiability is usually given in terms of fractal dimension, with the Hausdorff dimension the most popular choice. This line of research was started in the 1990s by Darst, [5] who showed that the Hausdorff dimension of the set of non-differentiability of the Cantor function is the square of the dimension of the Cantor set, ${\displaystyle (\log 2/\log 3)^{2}}$. Subsequently Falconer [6] showed that this squaring relationship holds for all Ahlfor's regular, singular measures, i.e.

${\displaystyle \dim _{H}\left\{x:f'(x)=\lim _{h\to 0^{+}}{\frac {\mu ([x,x+h])}{h}}{\text{ does not exist}}\right\}=\left(\dim _{H}\operatorname {supp} (\mu )\right)^{2}}$

Later, Troscheit [7] obtain a more comprehensive picture of the set where the derivative does not exist for more general normalized Gibb's measures supported on self-conformal and self-similar sets.

Hermann Minkowski's question mark function loosely resembles the Cantor function visually, appearing as a "smoothed out" form of the latter; it can be constructed by passing from a continued fraction expansion to a binary expansion, just as the Cantor function can be constructed by passing from a ternary expansion to a binary expansion. The question mark function has the interesting property of having vanishing derivatives at all rational numbers.

## Notes

1. Vestrup 2003 , Section 4.6.
2. Bass 2013 , p. 28.
3. Darst, Richard (1993-09-01). "The Hausdorff Dimension of the Nondifferentiability Set of the Cantor Function is [ ln(2)/ln(3) ]2". Proceedings of the American Mathematical Society. 119 (1): 105–108. doi:10.2307/2159830. JSTOR   2159830.
4. Falconer, Kenneth J. (2004-01-01). "One-sided multifractal analysis and points of non-differentiability of devil's staircases". Mathematical Proceedings of the Cambridge Philosophical Society. 136 (1): 167–174. Bibcode:2004MPCPS.136..167F. doi:10.1017/S0305004103006960. ISSN   1469-8064.
5. Troscheit, Sascha (2014-03-01). "Hölder differentiability of self-conformal devil's staircases". Mathematical Proceedings of the Cambridge Philosophical Society. 156 (2): 295–311. arXiv:. Bibcode:2014MPCPS.156..295T. doi:10.1017/S0305004113000698. ISSN   1469-8064. S2CID   56402751.

## Related Research Articles

In calculus, the chain rule is a formula that expresses the derivative of the composition of two differentiable functions f and g in terms of the derivatives f and g. More precisely, if is the function such that for every x, then the chain rule is, in Lagrange's notation,

In mathematics, the Cantor set is a set of points lying on a single line segment that has a number of unintuitive properties. It was discovered in 1874 by Henry John Stephen Smith and introduced by German mathematician Georg Cantor in 1883.

In vector calculus, the gradient of a scalar-valued differentiable function f of several variables is the vector field whose value at a point is the vector whose components are the partial derivatives of at . That is, for , its gradient is defined at the point in n-dimensional space as the vector:

In mathematical analysis, the Haar measure assigns an "invariant volume" to subsets of locally compact topological groups, consequently defining an integral for functions on those groups.

In mathematical optimization, the method of Lagrange multipliers is a strategy for finding the local maxima and minima of a function subject to equality constraints. It is named after the mathematician Joseph-Louis Lagrange. The basic idea is to convert a constrained problem into a form such that the derivative test of an unconstrained problem can still be applied. The relationship between the gradient of the function and gradients of the constraints rather naturally leads to a reformulation of the original problem, known as the Lagrangian function.

In mathematics, K-theory is, roughly speaking, the study of a ring generated by vector bundles over a topological space or scheme. In algebraic topology, it is a cohomology theory known as topological K-theory. In algebra and algebraic geometry, it is referred to as algebraic K-theory. It is also a fundamental tool in the field of operator algebras. It can be seen as the study of certain kinds of invariants of large matrices.

In differential geometry, the Lie derivative, named after Sophus Lie by Władysław Ślebodziński, evaluates the change of a tensor field, along the flow defined by another vector field. This change is coordinate invariant and therefore the Lie derivative is defined on any differentiable manifold.

In the calculus of variations and classical mechanics, the Euler-Lagrange equations is a system of second-order ordinary differential equations whose solutions are stationary points of the given action functional. The equations were discovered in the 1750s by Swiss mathematician Leonhard Euler and Italian mathematician Joseph-Louis Lagrange.

In measure theory, Lebesgue's dominated convergence theorem provides sufficient conditions under which almost everywhere convergence of a sequence of functions implies convergence in the L1 norm. Its power and utility are two of the primary theoretical advantages of Lebesgue integration over Riemann integration.

In category theory, a branch of mathematics, a monad is an endofunctor, together with two natural transformations required to fulfill certain coherence conditions. Monads are used in the theory of pairs of adjoint functors, and they generalize closure operators on partially ordered sets to arbitrary categories.

Faà di Bruno's formula is an identity in mathematics generalizing the chain rule to higher derivatives. It is named after Francesco Faà di Bruno, although he was not the first to state or prove the formula. In 1800, more than 50 years before Faà di Bruno, the French mathematician Louis François Antoine Arbogast had stated the formula in a calculus textbook, which is considered to be the first published reference on the subject.

In mathematics, the Minkowski question-mark function denoted by ?(x), is a function possessing various unusual fractal properties, defined by Hermann Minkowski. It maps quadratic irrationals to rational numbers on the unit interval, via an expression relating the continued fraction expansions of the quadratics to the binary expansions of the rationals, given by Arnaud Denjoy in 1938. In addition, it maps rational numbers to dyadic rationals, as can be seen by a recursive definition closely related to the Stern–Brocot tree.

The dyadic transformation is the mapping

In mathematics, an operad is concerned with prototypical algebras that model properties such as commutativity or anticommutativity as well as various amounts of associativity. Operads generalize the various associativity properties already observed in algebras and coalgebras such as Lie algebras or Poisson algebras by modeling computational trees within the algebra. Algebras are to operads as group representations are to groups. An operad can be seen as a set of operations, each one having a fixed finite number of inputs (arguments) and one output, which can be composed one with others. They form a category-theoretic analog of universal algebra.

In mathematics, the Frölicher–Nijenhuis bracket is an extension of the Lie bracket of vector fields to vector-valued differential forms on a differentiable manifold.

In mathematics, the blancmange curve is a self-affine curve constructible by midpoint subdivision. It is also known as the Takagi curve, after Teiji Takagi who described it in 1901, or as the Takagi–Landsberg curve, a generalization of the curve named after Takagi and Georg Landsberg. The name blancmange comes from its resemblance to a pudding of the same name. It is a special case of the more general de Rham curve; see also fractal curve.

In mathematics, a de Rham curve is a certain type of fractal curve named in honor of Georges de Rham.

In the mathematical field of differential topology, the Lie bracket of vector fields, also known as the Jacobi–Lie bracket or the commutator of vector fields, is an operator that assigns to any two vector fields X and Y on a smooth manifold M a third vector field denoted [X, Y].

In complex analysis of one and several complex variables, Wirtinger derivatives, named after Wilhelm Wirtinger who introduced them in 1927 in the course of his studies on the theory of functions of several complex variables, are partial differential operators of the first order which behave in a very similar manner to the ordinary derivatives with respect to one real variable, when applied to holomorphic functions, antiholomorphic functions or simply differentiable functions on complex domains. These operators permit the construction of a differential calculus for such functions that is entirely analogous to the ordinary differential calculus for functions of real variables.

In mathematics, infinite compositions of analytic functions (ICAF) offer alternative formulations of analytic continued fractions, series, products and other infinite expansions, and the theory evolving from such compositions may shed light on the convergence/divergence of these expansions. Some functions can actually be expanded directly as infinite compositions. In addition, it is possible to use ICAF to evaluate solutions of fixed point equations involving infinite expansions. Complex dynamics offers another venue for iteration of systems of functions rather than a single function. For infinite compositions of a single function see Iterated function. For compositions of a finite number of functions, useful in fractal theory, see Iterated function system.