Armstrong's axioms

Last updated December 06, 2024

Armstrong's axioms are a set of axioms (or, more precisely, inference rules) used to infer all the functional dependencies on a relational database. They were developed by William W. Armstrong in his 1974 paper.^[1] The axioms are sound in generating only functional dependencies in the closure of a set of functional dependencies (denoted as $F^{+}$ ) when applied to that set (denoted as $F$ ). They are also complete in that repeated application of these rules will generate all functional dependencies in the closure $F^{+}$ .

More formally, let $\langle R(U),F\rangle$ denote a relational scheme over the set of attributes $U$ with a set of functional dependencies $F$ . We say that a functional dependency $f$ is logically implied by $F$ , and denote it with $F\models f$ if and only if for every instance $r$ of $R$ that satisfies the functional dependencies in $F$ , $r$ also satisfies $f$ . We denote by $F^{+}$ the set of all functional dependencies that are logically implied by $F$ .

Furthermore, with respect to a set of inference rules $A$ , we say that a functional dependency $f$ is derivable from the functional dependencies in $F$ by the set of inference rules $A$ , and we denote it by $F\vdash _{A}f$ if and only if $f$ is obtainable by means of repeatedly applying the inference rules in $A$ to functional dependencies in $F$ . We denote by $F_{A}^{*}$ the set of all functional dependencies that are derivable from $F$ by inference rules in $A$ .

Then, a set of inference rules $A$ is sound if and only if the following holds:

$F_{A}^{*}\subseteq F^{+}$

that is to say, we cannot derive by means of $A$ functional dependencies that are not logically implied by $F$ . The set of inference rules $A$ is said to be complete if the following holds:

$F^{+}\subseteq F_{A}^{*}$

more simply put, we are able to derive by $A$ all the functional dependencies that are logically implied by $F$ .

Axioms (primary rules)

Let $R(U)$ be a relation scheme over the set of attributes $U$ . Henceforth we will denote by letters $X$ , $Y$ , $Z$ any subset of $U$ and, for short, the union of two sets of attributes $X$ and $Y$ by $XY$ instead of the usual $X\cup Y$ ; this notation is rather standard in database theory when dealing with sets of attributes.

Axiom of reflexivity

If $X$ is a set of attributes and $Y$ is a subset of $X$ , then $X$ holds $Y$ . Hereby, $X$ holds $Y$ [ $X\to Y$ ] means that $X$ functionally determines $Y$ .

If

Y\subseteq X

then

X\to Y

.

Axiom of augmentation

If $X$ holds $Y$ and $Z$ is a set of attributes, then $XZ$ holds $YZ$ . It means that attribute in dependencies does not change the basic dependencies.

If

X\to Y

, then

XZ\to YZ

for any

Z

.

Axiom of transitivity

If $X$ holds $Y$ and $Y$ holds $Z$ , then $X$ holds $Z$ .

If

X\to Y

and

Y\to Z

, then

X\to Z

.

Additional rules (Secondary Rules)

These rules can be derived from the above axioms.

Decomposition

If $X\to YZ$ then $X\to Y$ and $X\to Z$ .

Proof

1. $X\to YZ$	(Given)
2. $YZ\to Y$	(Reflexivity)
3. $X\to Y$	(Transitivity of 1 & 2)

Composition

If $X\to Y$ and $A\to B$ then $XA\to YB$ .

Proof

1. $X\to Y$	(Given)
2. $A\to B$	(Given)
3. $XA\to YA$	(Augmentation of 1 & A)
4. $YA\to YB$	(Augmentation of 2 & Y)
5. $XA\to YB$	(Transitivity of 3 and 4)

Union

If $X\to Y$ and $X\to Z$ then $X\to YZ$ .

Proof

1. $X\to Y$	(Given)
2. $X\to Z$	(Given)
3. $X\to XZ$	(Augmentation of 2 & X)
4. $XZ\to YZ$	(Augmentation of 1 & Z)
5. $X\to YZ$	(Transitivity of 3 and 4)

Pseudo transitivity

If $X\to Y$ and $YZ\to W$ then $XZ\to W$ .

Proof

1. $X\to Y$	(Given)
2. $YZ\to W$	(Given)
3. $XZ\to YZ$	(Augmentation of 1 & Z)
4. $XZ\to W$	(Transitivity of 3 and 2)

Self determination

$I\to I$ for any $I$ . This follows directly from the axiom of reflexivity.

Extensivity

The following property is a special case of augmentation when $Z=X$ .

If

X\to Y

, then

X\to XY

.

Extensivity can replace augmentation as axiom in the sense that augmentation can be proved from extensivity together with the other axioms.

Proof

1. $XZ\to X$	(Reflexivity)
2. $X\to Y$	(Given)
3. $XZ\to Y$	(Transitivity of 1 & 2)
4. $XZ\to XYZ$	(Extensivity of 3)
5. $XYZ\to YZ$	(Reflexivity)
6. $XZ\to YZ$	(Transitivity of 4 & 5)

Armstrong relation

Given a set of functional dependencies $F$ , an Armstrong relation is a relation which satisfies all the functional dependencies in the closure $F^{+}$ and only those dependencies. Unfortunately, the minimum-size Armstrong relation for a given set of dependencies can have a size which is an exponential function of the number of attributes in the dependencies considered.^[2]

Related Research Articles

First-order logic—also called predicate logic, predicate calculus, quantificational logic—is a collection of formal systems used in mathematics, philosophy, linguistics, and computer science. First-order logic uses quantified variables over non-logical objects, and allows the use of sentences that contain variables. Rather than propositions such as "all men are mortal", in first-order logic one can have expressions in the form "for all x, if x is a man, then x is mortal"; where "for all x" is a quantifier, x is a variable, and "... is a man" and "... is mortal" are predicates. This distinguishes it from propositional logic, which does not use quantifiers or relations; in this sense, propositional logic is the foundation of first-order logic.

The relational model (RM) is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data is represented in terms of tuples, grouped into relations. A database organized in terms of the relational model is a relational database.

In topology, the closure of a subset $S$ of points in a topological space consists of all points in $S$ together with all limit points of $S$ . The closure of $S$ may equivalently be defined as the union of $S$ and its boundary, and also as the intersection of all closed sets containing $S$ . Intuitively, the closure can be thought of as all the points that are either in $S$ or "very near" $S$ . A point which is in the closure of $S$ is a point of closure of $S$ . The notion of closure is in many ways dual to the notion of interior.

Cox's theorem, named after the physicist Richard Threlkeld Cox, is a derivation of the laws of probability theory from a certain set of postulates. This derivation justifies the so-called "logical" interpretation of probability, as the laws of probability derived by Cox's theorem are applicable to any proposition. Logical probability is a type of Bayesian probability. Other forms of Bayesianism, such as the subjective interpretation, are given other justifications.

In set theory, Zermelo–Fraenkel set theory, named after mathematicians Ernst Zermelo and Abraham Fraenkel, is an axiomatic system that was proposed in the early twentieth century in order to formulate a theory of sets free of paradoxes such as Russell's paradox. Today, Zermelo–Fraenkel set theory, with the historically controversial axiom of choice (AC) included, is the standard form of axiomatic set theory and as such is the most common foundation of mathematics. Zermelo–Fraenkel set theory with the axiom of choice included is abbreviated ZFC, where C stands for "choice", and ZF refers to the axioms of Zermelo–Fraenkel set theory with the axiom of choice excluded.

In relational database theory, a functional dependency is the following constraint between two attribute sets in a relation: Given a relation R and attribute sets $, X is said to functionally determine Y if each X value is associated with precisely one Y value. R is then said to satisfy the functional dependency X \to Y . Equivalently, the projection is a function, that is, Y is a function of X . In simple words, if the values for the X attributes are known, then the values for the Y attributes corresponding to x can be determined by looking them up in any tuple of R containing x . Customarily X is called the determinant set and Y the dependent set. A functional dependency FD: X \to Y is called trivial if Y is a subset of X .$

In category theory, a category is Cartesian closed if, roughly speaking, any morphism defined on a product of two objects can be naturally identified with a morphism defined on one of the factors. These categories are particularly important in mathematical logic and the theory of programming, in that their internal language is the simply typed lambda calculus. They are generalized by closed monoidal categories, whose internal language, linear type systems, are suitable for both quantum and classical computation.

In mathematics, there are several equivalent ways of defining the real numbers. One of them is that they form a complete ordered field that does not contain any smaller complete ordered field. Such a definition does not prove that such a complete ordered field exists, and the existence proof consists of constructing a mathematical structure that satisfies the definition.

In the foundations of mathematics, von Neumann–Bernays–Gödel set theory (NBG) is an axiomatic set theory that is a conservative extension of Zermelo–Fraenkel–choice set theory (ZFC). NBG introduces the notion of class, which is a collection of sets defined by a formula whose quantifiers range only over sets. NBG can define classes that are larger than sets, such as the class of all sets and the class of all ordinals. Morse–Kelley set theory (MK) allows classes to be defined by formulas whose quantifiers range over classes. NBG is finitely axiomatizable, while ZFC and MK are not.

In functional analysis and related branches of mathematics, the Banach–Alaoglu theorem states that the closed unit ball of the dual space of a normed vector space is compact in the weak* topology. A common proof identifies the unit ball with the weak-* topology as a closed subset of a product of compact sets with the product topology. As a consequence of Tychonoff's theorem, this product, and hence the unit ball within, is compact.

In mathematics, a Moufang loop is a special kind of algebraic structure. It is similar to a group in many ways but need not be associative. Moufang loops were introduced by Ruth Moufang. Smooth Moufang loops have an associated algebra, the Malcev algebra, similar in some ways to how a Lie group has an associated Lie algebra.

Tarski's axioms are an axiom system for Euclidean geometry, specifically for that portion of Euclidean geometry that is formulable in first-order logic with identity. As such, it does not require an underlying set theory. The only primitive objects of the system are "points" and the only primitive predicates are "betweenness" and "congruence". The system contains infinitely many axioms.

In set theory, $-induction$ , also called epsilon-induction or set-induction, is a principle that can be used to prove that all sets satisfy a given property. Considered as an axiomatic principle, it is called the axiom schema of set induction.

Axiomatic constructive set theory is an approach to mathematical constructivism following the program of axiomatic set theory. The same first-order language with " $" and " " of classical set theory is usually used, so this is not to be confused with a constructive types approach. On the other hand, some constructive theories are indeed motivated by their interpretability in type theories.$

Tarski–Grothendieck set theory is an axiomatic set theory. It is a non-conservative extension of Zermelo–Fraenkel set theory (ZFC) and is distinguished from other axiomatic set theories by the inclusion of Tarski's axiom, which states that for each set there is a "Tarski universe" it belongs to. Tarski's axiom implies the existence of inaccessible cardinals, providing a richer ontology than ZFC. For example, adding this axiom supports category theory.

In database theory, a multivalued dependency is a full constraint between two sets of attributes in a relation.

In mathematics, an ordered vector space or partially ordered vector space is a vector space equipped with a partial order that is compatible with the vector space operations.

A canonical cover $for F is a set of dependencies such that F logically implies all dependencies in, and logically implies all dependencies in F.$

The counting lemmas this article discusses are statements in combinatorics and graph theory. The first one extracts information from $-regular pairs of subsets of vertices in a graph, in order to guarantee patterns in the entire graph; more explicitly, these patterns correspond to the count of copies of a certain graph in . The second counting lemma provides a similar yet more general notion on the space of graphons, in which a scalar of the cut distance between two graphs is correlated to the homomorphism density between them and .$

In mathematics, specifically the field of abstract algebra, Bergman's Diamond Lemma is a method for confirming whether a given set of monomials of an algebra forms a $-basis. It is an extension of Gröbner bases to non-commutative rings. The proof of the lemma gives rise to an algorithm for obtaining a non-commutative Gröbner basis of the algebra from its defining relations. However, in contrast to Buchberger's algorithm, in the non-commutative case, this algorithm may not terminate.$

References

↑ William Ward Armstrong: Dependency Structures of Data Base Relationships , page 580-583. IFIP Congress, 1974.
↑ Beeri, C.; Dowd, M.; Fagin, R.; Statman, R. (1984). "On the Structure of Armstrong Relations for Functional Dependencies" (PDF). Journal of the ACM. 31: 30–46. CiteSeerX 10.1.1.68.9320 . doi:10.1145/2422.322414. Archived from the original (PDF) on 2018-07-23.

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] William Ward Armstrong: Dependency Structures of Data Base Relationships , page 580-583. IFIP Congress, 1974.

[2] Beeri, C.; Dowd, M.; Fagin, R.; Statman, R. (1984). "On the Structure of Armstrong Relations for Functional Dependencies" (PDF). Journal of the ACM. 31: 30–46. CiteSeerX 10.1.1.68.9320 . doi:10.1145/2422.322414. Archived from the original (PDF) on 2018-07-23.

[1]

[2]

v t e Database management systems
Types	Object-oriented comparison Relational list comparison Key–value Column-oriented list Document-oriented Wide-column store Graph NoSQL NewSQL In-memory list Multi-model comparison Cloud Blockchain-based database
Concepts	Database ACID Armstrong's axioms Codd's 12 rules CAP theorem CRUD Null Candidate key Foreign key PACELC theorem Superkey Surrogate key Unique key
Objects	Relation table column row View Transaction Transaction log Trigger Index Stored procedure Cursor Partition
Components	Concurrency control Data dictionary JDBC XQJ ODBC Query language Query optimizer Query rewriting system Query plan
Functions	Administration Query optimization Replication Sharding
Related topics	Database models Database normalization Database storage Distributed database Federated database system Referential integrity Relational algebra Relational calculus Relational model Object–relational database Transaction processing
Category Outline

v t e Database normalization
Unnormalized form (UNF) First normal form (1NF) Second normal form (2NF) Third normal form (3NF) Elementary key normal form (EKNF) Boyce–Codd normal form (3.5NF / BCNF) Fourth normal form (4NF) Fifth normal form (5NF / PJNF) Domain-key normal form (DKNF) Sixth normal form (6NF)
Dependencies Functional dependency Multivalued dependency Join dependency Lossless join decomposition Temporal database
Denormalization

Armstrong's axioms

Contents

Axioms (primary rules)

Axiom of reflexivity

Axiom of augmentation

Axiom of transitivity

Additional rules (Secondary Rules)

Decomposition

Proof

Composition

Proof

Union

Proof

Pseudo transitivity

Proof

Self determination

Extensivity

Proof

Armstrong relation

Related Research Articles

References

External links