Complexity index

Last updated December 03, 2022

In modern computer science and statistics, the complexity index of a function denotes the level of informational content, which in turn affects the difficulty of learning the function from examples. This is different from computational complexity, which is the difficulty to compute a function. Complexity indices characterize the entire class of functions to which the one we are interested in belongs. Focusing on Boolean functions, the detail of a class ${\mathsf {C}}$ of Boolean functions c essentially denotes how deeply the class is articulated.

Technical definition

To identify this index we must first define a sentry function of ${\mathsf {C}}$ . Let us focus for a moment on a single function c, call it a concept defined on a set ${\mathcal {X}}$ of elements that we may figure as points in a Euclidean space. In this framework, the above function associates to c a set of points that, since are defined to be external to the concept, prevent it from expanding into another function of ${\mathsf {C}}$ . We may dually define these points in terms of sentinelling a given concept c from being fully enclosed (invaded) by another concept within the class. Therefore, we call these points either sentinels or sentry points; they are assigned by the sentry function ${\boldsymbol {S}}$ to each concept of ${\mathsf {C}}$ in such a way that:

the sentry points are external to the concept c to be sentineled and internal to at least one other including it,
each concept $c'$ including c has at least one of the sentry points of c either in the gap between c and $c'$ , or outside $c'$ and distinct from the sentry points of $c'$ , and
they constitute a minimal set with these properties.

The technical definition coming from ( Apolloni 2006 ) is rooted in the inclusion of an augmented concept $c^{+}$ made up of c plus its sentry points by another $\left(c'\right)^{+}$ in the same class.

Definition of sentry function

For a concept class ${\mathsf {C}}$ on a space ${\mathfrak {X}}$ , a sentry function is a total function ${\boldsymbol {S}}:{\mathsf {C}}\cup \{\emptyset ,{\mathfrak {X}}\}\mapsto 2^{\mathfrak {X}}$ satisfying the following conditions:

Sentinels are outside the sentineled concept ( $c\cap {\boldsymbol {S}}(c)=\emptyset$ for all $c\in {\mathsf {C}}$ ).
Sentinels are inside the invading concept (Having introduced the sets $c^{+}=c\cup {\boldsymbol {S}}(c)$ , an invading concept $c'\in {\mathsf {C}}$ is such that $c'\not \subseteq c$ and $c^{+}\subseteq \left(c'\right)^{+}$ . Denoting $\mathrm {up} (c)$ the set of concepts invading c, we must have that if $c_{2}\in \mathrm {up} (c_{1})$ , then $c_{2}\cap {\boldsymbol {S}}(c_{1})\neq \emptyset$ ).
${\boldsymbol {S}}(c)$ is a minimal set with the above properties (No ${\boldsymbol {S}}'\neq {\boldsymbol {S}}$ exists satisfying (1) and (2) and having the property that ${\boldsymbol {S}}'(c)\subseteq {\boldsymbol {S}}(c)$ for every $c\in {\mathsf {C}}$ ).
Sentinels are honest guardians. It may be that $c\subseteq \left(c'\right)^{+}$ but ${\boldsymbol {S}}(c)\cap c'=\emptyset$ so that $c'\not \in \mathrm {up} (c)$ . This however must be a consequence of the fact that all points of ${\boldsymbol {S}}(c)$ are involved in really sentineling c against other concepts in $\mathrm {up} (c)$ and not just in avoiding inclusion of $c^{+}$ by $(c')^{+}$ . Thus if we remove $c',{\boldsymbol {S}}(c)$ remains unchanged (Whenever $c_{1}$ and $c_{2}$ are such that $c_{1}\subset c_{2}\cup {\boldsymbol {S}}(c_{2})$ and $c_{2}\cap {\boldsymbol {S}}(c_{1})=\emptyset$ , then the restriction of ${\boldsymbol {S}}$ to $\{c_{1}\}\cup \mathrm {up} (c_{1})-\{c_{2}\}$ is a sentry function on this set).

${\boldsymbol {S}}(c)$ is the frontier of c upon ${\boldsymbol {S}}$ .

A schematic outlook of outer sentineling functionality SentinelEx.png — A schematic outlook of outer sentineling functionality

With reference to the picture on the right, $\{x_{1},x_{2},x_{3}\}$ is a candidate frontier of $c_{0}$ against $c_{1},c_{2},c_{3},c_{4}$ . All points are in the gap between a $c_{i}$ and $c_{0}$ . They avoid inclusion of $c_{0}\cup \{x_{1},x_{2},x_{3}\}$ in $c_{3}$ , provided that these points are not used by the latter for sentineling itself against other concepts. Vice versa we expect that $c_{1}$ uses $x_{1}$ and $x_{3}$ as its own sentinels, $c_{2}$ uses $x_{2}$ and $x_{3}$ and $c_{4}$ uses $x_{1}$ and $x_{2}$ analogously. Point $x_{4}$ is not allowed as a $c_{0}$ sentry point since, like any diplomatic seat, it should be located outside all other concepts just to ensure that it is not occupied in case of invasion by $c_{0}$ .

Definition of detail

The frontier size of the most expensive concept to be sentineled with the least efficient sentineling function, i.e. the quantity

\mathrm {D} _{\mathsf {C}}=\sup _{{\boldsymbol {S}},c}\#{\boldsymbol {S}}(c)

,

is called detail of ${\mathsf {C}}$ . ${\boldsymbol {S}}$ spans also over sentry functions on subsets of ${\mathfrak {X}}$ sentineling in this case the intersections of the concepts with these subsets. Actually, proper subsets of ${\mathfrak {X}}$ may host sentineling tasks that prove harder than those emerging with ${\mathfrak {X}}$ itself.

The detail $\mathrm {D} _{\mathsf {C}}$ is a complexity measure of concept classes dual to the VC dimension $\mathrm {D} _{{\mathsf {V}}C}$ . The former uses points to separate sets of concepts, the latter concepts for partitioning sets of points. In particular the following inequality holds ( Apolloni 1997 )

\mathrm {D} _{\mathsf {C}}\leq \mathrm {D} _{{\mathsf {V}}C}+1

See also Rademacher complexity for a recently introduced class complexity index.

Example: continuous spaces

Class C of circles in $\mathbb {R} ^{2}$ has detail $\mathrm {D} _{\mathsf {C}}=2$ , as shown in the picture on left below. Similarly, for the class of segments on $\mathbb {R}$ , as shown in the picture on right.

Two points
x
1
,
x
2
{\displaystyle x_{1},x_{2}}
outside c (thick circle) are sufficient to prevent a larger circle not containing them from including it SentinelCircle.png — Two points $x_{1},x_{2}$ outside c (thick circle) are sufficient to prevent a larger circle not containing them from including it

$The class of segments in R {\displaystyle \mathbb {R} } and two points needed to sentinel its concepts SentinelSegment.png$

The class of segments in

\mathbb {R}

and two points needed to sentinel its concepts

Example: discrete spaces

The class ${\mathsf {C}}=\{c_{1},c_{2},c_{3},c_{4}\}$ on ${\mathfrak {X}}=\{x_{1},x_{2},x_{3}\}$ whose concepts are illustrated in the following scheme, where "+" denotes an element $x_{j}$ belonging to $c_{i}$ , "-" an element outside $c_{i}$ , and ⃝ a sentry point:

	$x_{1}$	$x_{2}$	$x_{3}$
$c_{1}=$	-⃝	-⃝	-
$c_{2}=$	-⃝	+	+
$c_{3}=$	+	-⃝	+
$c_{4}=$	+	+	+

This class has $\mathrm {D} _{\mathsf {C}}=2$ . As usual we may have different sentineling functions. A worst case $S$ , as illustrated, is: $\mathbf {S} (c_{1})=\{x_{1},x_{2}\},\mathbf {S} (c_{2})=\{x_{1}\},\mathbf {S} (c_{3})=\{x_{2}\},\mathbf {S} (c_{4})=\emptyset$ . However a cheaper one is $\mathbf {S} (c_{1})=\{x_{3}\},\mathbf {S} (c_{2})=\{x_{1}\},\mathbf {S} (c_{3})=\{x_{2}\},\mathbf {S} (c_{4})=\emptyset$ :

	$x_{1}$	$x_{2}$	$x_{3}$
$c_{1}=$	-	-	-⃝
$c_{2}=$	-⃝	+	+
$c_{3}=$	+	-⃝	+
$c_{4}=$	+	+	+

Related Research Articles

In computational complexity theory, bounded-error quantum polynomial time (BQP) is the class of decision problems solvable by a quantum computer in polynomial time, with an error probability of at most 1/3 for all instances. It is the quantum analogue to the complexity class BPP.

In mathematics, specifically category theory, a functor is a mapping between categories. Functors were first considered in algebraic topology, where algebraic objects are associated to topological spaces, and maps between these algebraic objects are associated to continuous maps between spaces. Nowadays, functors are used throughout modern mathematics to relate various categories. Thus, functors are important in all areas within mathematics to which category theory is applied.

In computational complexity theory, the class NC is the set of decision problems decidable in polylogarithmic time on a parallel computer with a polynomial number of processors. In other words, a problem with input size n is in NC if there exist constants c and k such that it can be solved in time $O (log c n)$ using $O (n k)$ parallel processors. Stephen Cook coined the name "Nick's class" after Nick Pippenger, who had done extensive research on circuits with polylogarithmic depth and polynomial size.

In the mathematical discipline of set theory, forcing is a technique for proving consistency and independence results. It was first used by Paul Cohen in 1963, to prove the independence of the axiom of choice and the continuum hypothesis from Zermelo–Fraenkel set theory.

<span class="mw-page-title-main">Hamiltonian mechanics</span> Formulation of classical mechanics using momenta

Hamiltonian mechanics emerged in 1833 as a reformulation of Lagrangian mechanics. Introduced by Sir William Rowan Hamilton, Hamiltonian mechanics replaces (generalized) velocities $used in Lagrangian mechanics with (generalized) momenta . Both theories provide interpretations of classical mechanics and describe the same physical phenomena.$

In topology and related branches of mathematics, the Kuratowski closure axioms are a set of axioms that can be used to define a topological structure on a set. They are equivalent to the more commonly used open set definition. They were first formalized by Kazimierz Kuratowski, and the idea was further studied by mathematicians such as Wacław Sierpiński and António Monteiro, among others.

In computational complexity theory, a complexity class is a set of computational problems of related resource-based complexity. The two most commonly analyzed resources are time and memory.

In computational complexity theory, P, also known as PTIME or DTIME(n^O), is a fundamental complexity class. It contains all decision problems that can be solved by a deterministic Turing machine using a polynomial amount of computation time, or polynomial time.

In mathematics, a norm is a function from a real or complex vector space to the non-negative real numbers that behaves in certain ways like the distance from the origin: it commutes with scaling, obeys a form of the triangle inequality, and is zero only at the origin. In particular, the Euclidean distance of a vector from the origin is a norm, called the Euclidean norm, or 2-norm, which may also be defined as the square root of the inner product of a vector with itself.

The Gauss–Newton algorithm is used to solve non-linear least squares problems, which is equivalent to minimizing a sum of squared function values. It is an extension of Newton's method for finding a minimum of a non-linear function. Since a sum of squares must be nonnegative, the algorithm can be viewed as using Newton's method to iteratively approximate zeroes of the sum, and thus minimizing the sum. It has the advantage that second derivatives, which can be challenging to compute, are not required.

In mathematics, fuzzy measure theory considers generalized measures in which the additive property is replaced by the weaker property of monotonicity. The central concept of fuzzy measure theory is the fuzzy measure which was introduced by Choquet in 1953 and independently defined by Sugeno in 1974 in the context of fuzzy integrals. There exists a number of different classes of fuzzy measures including plausibility/belief measures; possibility/necessity measures; and probability measures which are a subset of classical measures.

In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model by the principle of least squares: minimizing the sum of the squares of the differences between the observed dependent variable in the input dataset and the output of the (linear) function of the independent variable.

<span class="mw-page-title-main">Flow (mathematics)</span>

In mathematics, a flow formalizes the idea of the motion of particles in a fluid. Flows are ubiquitous in science, including engineering and physics. The notion of flow is basic to the study of ordinary differential equations. Informally, a flow may be viewed as a continuous motion of points over time. More formally, a flow is a group action of the real numbers on a set.

Bayesian linear regression is a type of conditional modeling in which the mean of one variable is described by a linear combination of other variables, with the goal of obtaining the posterior probability of the regression coefficients and ultimately allowing the out-of-sample prediction of the regressandconditional on observed values of the regressors. The simplest and most widely used version of this model is the normal linear model, in which $given is distributed Gaussian. In this model, and under a particular choice of prior probabilities for the parameters—so-called conjugate priors—the posterior can be found analytically. With more arbitrarily chosen priors, the posteriors generally have to be approximated.$

In mathematics, the axis–angle representation of a rotation parameterizes a rotation in a three-dimensional Euclidean space by two quantities: a unit vector $e$ indicating the direction of an axis of rotation, and an angle $θ$ describing the magnitude of the rotation about the axis. Only two numbers, not three, are needed to define the direction of a unit vector $e$ rooted at the origin because the magnitude of $e$ is constrained. For example, the elevation and azimuth angles of $e$ suffice to locate it in any particular Cartesian coordinate frame.

Non-linear least squares is the form of least squares analysis used to fit a set of m observations with a model that is non-linear in n unknown parameters (m ≥ n). It is used in some forms of nonlinear regression. The basis of the method is to approximate the model by a linear one and to refine the parameters by successive iterations. There are many similarities to linear least squares, but also some significant differences. In economic theory, the non-linear least squares method is applied in (i) the probit regression, (ii) threshold regression, (iii) smooth regression, (iv) logistic link regression, (v) Box-Cox transformed regressors.

Stokes's theorem, also known as the Kelvin–Stokes theorem after Lord Kelvin and George Stokes, the fundamental theorem for curls or simply the curl theorem, is a theorem in vector calculus on $R 3$ . Given a vector field, the theorem relates the integral of the curl of the vector field over some surface, to the line integral of the vector field around the boundary of the surface. The classical Stokes' theorem can be stated in one sentence: The line integral of a vector field over a loop is equal to the flux of its curl through the enclosed surface.

Linear least squares (LLS) is the least squares approximation of linear functions to data. It is a set of formulations for solving statistical problems involved in linear regression, including variants for ordinary (unweighted), weighted, and generalized (correlated) residuals. Numerical methods for linear least squares include inverting the matrix of the normal equations and orthogonal decomposition methods.

In mathematics, a polyadic space is a topological space that is the image under a continuous function of a topological power of an Alexandroff one-point compactification of a discrete space.

Short integer solution (SIS) and ring-SIS problems are two average-case problems that are used in lattice-based cryptography constructions. Lattice-based cryptography began in 1996 from a seminal work by Miklós Ajtai who presented a family of one-way functions based on SIS problem. He showed that it is secure in an average case if the shortest vector problem $is hard in a worst-case scenario.$

References

Apolloni, B.; Malchiodi, D.; Gaito, S. (2006). Algorithmic Inference in Machine Learning. International Series on Advanced Intelligence. Vol. 5 (2nd ed.). Adelaide: Magill. Advanced Knowledge International
Apolloni, B.; Chiaravalli, S. (1997). "PAC learning of concept classes through the boundaries of their items". Theoretical Computer Science. 172 (1–2): 91–120. doi: 10.1016/S0304-3975(95)00240-5 .

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.