AVL tree

AVL tree
AVL tree
Type	Tree
Invented	1962
Invented by	Georgy Adelson-Velsky and Evgenii Landis
Complexities in big O notation
Space
Space complexity
Space
Time complexity
Function
Search
Insert
Delete

Last updated December 14, 2024

In computer science, an AVL tree (named after inventors Adelson-Velsky and Landis) is a self-balancing binary search tree. In an AVL tree, the heights of the two child subtrees of any node differ by at most one; if at any time they differ by more than one, rebalancing is done to restore this property. Lookup, insertion, and deletion all take $O (log n)$ time in both the average and worst cases, where $n$ is the number of nodes in the tree prior to the operation. Insertions and deletions may require the tree to be rebalanced by one or more tree rotations.

AVL trees are often compared with red–black trees because both support the same set of operations and take ${\text{O}}(\log n)$ time for the basic operations. For lookup-intensive applications, AVL trees are faster than red–black trees because they are more strictly balanced.^[4] Similar to red–black trees, AVL trees are height-balanced. Both are, in general, neither weight-balanced nor $\mu$ -balanced for any $\mu \leq {\tfrac {1}{2}}$ ;^[5] that is, sibling nodes can have hugely differing numbers of descendants.

Definition

Balance factor

In a binary tree the balance factor of a node X is defined to be the height difference

{\text{BF}}(X):={\text{Height}}({\text{RightSubtree}}(X))-{\text{Height}}({\text{LeftSubtree}}(X))

^[6]^: 459

of its two child sub-trees rooted by node X.

A node X with ${\text{BF}}(X)<0$ is called "left-heavy", one with ${\text{BF}}(X)>0$ is called "right-heavy", and one with ${\text{BF}}(X)=0$ is sometimes simply called "balanced".

Properties

Balance factors can be kept up-to-date by knowing the previous balance factors and the change in height – it is not necessary to know the absolute height. For holding the AVL balance information, two bits per node are sufficient.^[7]

The height $h$ (counted as the maximal number of levels) of an AVL tree with $n$ nodes lies in the interval:^[6]^: 460

\log _{2}(n+1)\leq h<\log _{\varphi }(n+2)+b

where ${\displaystyle \varphi$ is the golden ratio and $b:={\frac {\log _{2}5}{2\log _{2}\varphi }}-2\approx \;-0.3277.$ This is because an AVL tree of height $h$ contains at least $F_{h+2}-1$ nodes where $\{F_{n}\}_{n\in \mathbb {N} }$ is the Fibonacci sequence with the seed values $F_{1}=F_{2}=1.$

Operations

Read-only operations of an AVL tree involve carrying out the same actions as would be carried out on an unbalanced binary search tree, but modifications have to observe and restore the height balance of the sub-trees.

Searching

Searching for a specific key in an AVL tree can be done the same way as that of any balanced or unbalanced binary search tree.^[8]^{: ch. 8} In order for search to work effectively it has to employ a comparison function which establishes a total order (or at least a total preorder) on the set of keys.^[9]^: 23 The number of comparisons required for successful search is limited by the height $h$ and for unsuccessful search is very close to $h$ , so both are in $O(log n)$ .^[10]^: 216

Traversal

As a read-only operation the traversal of an AVL tree functions the same way as on any other binary tree. Exploring all $n$ nodes of the tree visits each link exactly twice: one downward visit to enter the subtree rooted by that node, another visit upward to leave that node's subtree after having explored it.

Once a node has been found in an AVL tree, the next or previous node can be accessed in amortized constant time.^[11]^: 58 Some instances of exploring these "nearby" nodes require traversing up to $h \propto log(n)$ links (particularly when navigating from the rightmost leaf of the root's left subtree to the root or from the root to the leftmost leaf of the root's right subtree; in the AVL tree of figure 1, navigating from node P to the next-to-the-right node Q takes 3 steps). Since there are $n -1$ links in any tree, the amortized cost is $2\times(n -1)/ n$ , or approximately 2.

Insert

When inserting a node into an AVL tree, you initially follow the same process as inserting into a Binary Search Tree. If the tree is empty, then the node is inserted as the root of the tree. If the tree is not empty, then we go down the root, and recursively go down the tree searching for the location to insert the new node. This traversal is guided by the comparison function. In this case, the node always replaces a NULL reference (left or right) of an external node in the tree i.e., the node is either made a left-child or a right-child of the external node.

After this insertion, if a tree becomes unbalanced, only ancestors of the newly inserted node are unbalanced. This is because only those nodes have their sub-trees altered.^[12] So it is necessary to check each of the node's ancestors for consistency with the invariants of AVL trees: this is called "retracing". This is achieved by considering the balance factor of each node.^[6]^{: 458–481}^[11]^: 108

Since with a single insertion the height of an AVL subtree cannot increase by more than one, the temporary balance factor of a node after an insertion will be in the range [–2,+2]. For each node checked, if the temporary balance factor remains in the range from –1 to +1 then only an update of the balance factor and no rotation is necessary. However, if the temporary balance factor is ±2, the subtree rooted at this node is AVL unbalanced, and a rotation is needed.^[9]^: 52 With insertion as the code below shows, the adequate rotation immediately perfectly rebalances the tree.

In figure 1, by inserting the new node Z as a child of node X the height of that subtree Z increases from 0 to 1.

Invariant of the retracing loop for an insertion

The height of the subtree rooted by Z has increased by 1. It is already in AVL shape.

Example code for an insert operation

for(X=parent(Z);X!=null;X=parent(Z)){// Loop (possibly up to the root)// BF(X) has to be updated:if(Z==right_child(X)){// The right subtree increasesif(BF(X)>0){// X is right-heavy// ==> the temporary BF(X) == +2// ==> rebalancing is required.G=parent(X);// Save parent of X around rotationsif(BF(Z)<0)// Right Left Case  (see figure 3)N=rotate_RightLeft(X,Z);// Double rotation: Right(Z) then Left(X)else// Right Right Case (see figure 2)N=rotate_Left(X,Z);// Single rotation Left(X)// After rotation adapt parent link}else{if(BF(X)<0){BF(X)=0;// Z’s height increase is absorbed at X.break;// Leave the loop}BF(X)=+1;Z=X;// Height(Z) increases by 1continue;}}else{// Z == left_child(X): the left subtree increasesif(BF(X)<0){// X is left-heavy// ==> the temporary BF(X) == -2// ==> rebalancing is required.G=parent(X);// Save parent of X around rotationsif(BF(Z)>0)// Left Right CaseN=rotate_LeftRight(X,Z);// Double rotation: Left(Z) then Right(X)else// Left Left CaseN=rotate_Right(X,Z);// Single rotation Right(X)// After rotation adapt parent link}else{if(BF(X)>0){BF(X)=0;// Z’s height increase is absorbed at X.break;// Leave the loop}BF(X)=-1;Z=X;// Height(Z) increases by 1continue;}}// After a rotation adapt parent link:// N is the new root of the rotated subtree// Height does not change: Height(N) == old Height(X)parent(N)=G;if(G!=null){if(X==left_child(G))left_child(G)=N;elseright_child(G)=N;}elsetree->root=N;// N is the new root of the total treebreak;// There is no fall thru, only break; or continue;}// Unless loop is left via break, the height of the total tree increases by 1.

In order to update the balance factors of all nodes, first observe that all nodes requiring correction lie from child to parent along the path of the inserted leaf. If the above procedure is applied to nodes along this path, starting from the leaf, then every node in the tree will again have a balance factor of −1, 0, or 1.

The retracing can stop if the balance factor becomes 0 implying that the height of that subtree remains unchanged.

If the balance factor becomes ±1 then the height of the subtree increases by one and the retracing needs to continue.

If the balance factor temporarily becomes ±2, this has to be repaired by an appropriate rotation after which the subtree has the same height as before (and its root the balance factor 0).

The time required is $O(log n)$ for lookup, plus a maximum of $O(log n)$ retracing levels ( $O(1)$ on average) on the way back to the root, so the operation can be completed in $O(log n)$ time.^[9]^: 53

Delete

The preliminary steps for deleting a node are described in section Binary search tree#Deletion. There, the effective deletion of the subject node or the replacement node decreases the height of the corresponding child tree either from 1 to 0 or from 2 to 1, if that node had a child.

Starting at this subtree, it is necessary to check each of the ancestors for consistency with the invariants of AVL trees. This is called "retracing".

Since with a single deletion the height of an AVL subtree cannot decrease by more than one, the temporary balance factor of a node will be in the range from −2 to +2. If the balance factor remains in the range from −1 to +1 it can be adjusted in accord with the AVL rules. If it becomes ±2 then the subtree is unbalanced and needs to be rotated. (Unlike insertion where a rotation always balances the tree, after delete, there may be BF(Z) ≠ 0 (see figures 2 and 3), so that after the appropriate single or double rotation the height of the rebalanced subtree decreases by one meaning that the tree has to be rebalanced again on the next higher level.) The various cases of rotations are described in section Rebalancing.

Invariant of the retracing loop for a deletion

The height of the subtree rooted by N has decreased by 1. It is already in AVL shape.

Example code for a delete operation

for(X=parent(N);X!=null;X=G){// Loop (possibly up to the root)G=parent(X);// Save parent of X around rotations// BF(X) has not yet been updated!if(N==left_child(X)){// the left subtree decreasesif(BF(X)>0){// X is right-heavy// ==> the temporary BF(X) == +2// ==> rebalancing is required.Z=right_child(X);// Sibling of N (higher by 2)b=BF(Z);if(b<0)// Right Left Case  (see figure 3)N=rotate_RightLeft(X,Z);// Double rotation: Right(Z) then Left(X)else// Right Right Case (see figure 2)N=rotate_Left(X,Z);// Single rotation Left(X)// After rotation adapt parent link}else{if(BF(X)==0){BF(X)=+1;// N’s height decrease is absorbed at X.break;// Leave the loop}N=X;BF(N)=0;// Height(N) decreases by 1continue;}}else{// (N == right_child(X)): The right subtree decreasesif(BF(X)<0){// X is left-heavy// ==> the temporary BF(X) == -2// ==> rebalancing is required.Z=left_child(X);// Sibling of N (higher by 2)b=BF(Z);if(b>0)// Left Right CaseN=rotate_LeftRight(X,Z);// Double rotation: Left(Z) then Right(X)else// Left Left CaseN=rotate_Right(X,Z);// Single rotation Right(X)// After rotation adapt parent link}else{if(BF(X)==0){BF(X)=-1;// N’s height decrease is absorbed at X.break;// Leave the loop}N=X;BF(N)=0;// Height(N) decreases by 1continue;}}// After a rotation adapt parent link:// N is the new root of the rotated subtreeparent(N)=G;if(G!=null){if(X==left_child(G))left_child(G)=N;elseright_child(G)=N;}elsetree->root=N;// N is the new root of the total treeif(b==0)break;// Height does not change: Leave the loop// Height(N) decreases by 1 (== old Height(X)-1)}// If (b != 0) the height of the total tree decreases by 1.

The retracing can stop if the balance factor becomes ±1 (it must have been 0) meaning that the height of that subtree remains unchanged.

If the balance factor becomes 0 (it must have been ±1) then the height of the subtree decreases by one and the retracing needs to continue.

If the balance factor temporarily becomes ±2, this has to be repaired by an appropriate rotation. It depends on the balance factor of the sibling Z (the higher child tree in figure 2) whether the height of the subtree decreases by one –and the retracing needs to continue– or does not change (if Z has the balance factor 0) and the whole tree is in AVL-shape.

The time required is $O(log n)$ for lookup, plus a maximum of $O(log n)$ retracing levels ( $O(1)$ on average) on the way back to the root, so the operation can be completed in $O(log n)$ time.

Set operations and bulk operations

In addition to the single-element insert, delete and lookup operations, several set operations have been defined on AVL trees: union, intersection and set difference. Then fast bulk operations on insertions or deletions can be implemented based on these set functions. These set operations rely on two helper operations, Split and Join. With the new operations, the implementation of AVL trees can be more efficient and highly-parallelizable.^[13]

The function Join on two AVL trees $t 1$ and $t 2$ and a key $k$ will return a tree containing all elements in $t 1$ , $t 2$ as well as $k$ . It requires $k$ to be greater than all keys in $t 1$ and smaller than all keys in $t 2$ . If the two trees differ by height at most one, Join simply create a new node with left subtree $t 1$ , root $k$ and right subtree $t 2$ . Otherwise, suppose that $t 1$ is higher than $t 2$ for more than one (the other case is symmetric). Join follows the right spine of $t 1$ until a node $c$ which is balanced with $t 2$ . At this point a new node with left child $c$ , root $k$ and right child $t 2$ is created to replace c. The new node satisfies the AVL invariant, and its height is one greater than $c$ . The increase in height can increase the height of its ancestors, possibly invalidating the AVL invariant of those nodes. This can be fixed either with a double rotation if invalid at the parent or a single left rotation if invalid higher in the tree, in both cases restoring the height for any further ancestor nodes. Join will therefore require at most two rotations. The cost of this function is the difference of the heights between the two input trees.

Pseudocode implementation for the Join algorithm

function JoinRightAVL(T_L, k, T_R)     (l, k', c) = expose(T_L)     if (Height(c) <= Height(T_R)+1)        T' = Node(c, k, T_R)        if (Height(T') <= Height(l)+1) then return Node(l, k', T')        else return rotateLeft(Node(l, k', rotateRight(T')))     else          T' = JoinRightAVL(c, k, T_R)         T'' = Node(l, k', T')         if (Height(T') <= Height(l)+1) return T''         elsereturn rotateLeft(T'')

function JoinLeftAVL(T_L, k, T_R)   /* symmetric to JoinRightAVL */

function Join(T_L, k, T_R)     if (Height(T_L)>Height(T_R)+1) return JoinRightAVL(T_L, k, T_R)     if (Height(T_R)>Height(T_L)+1) return JoinLeftAVL(T_L, k, T_R)     return Node(T_L, k, T_R)

Here Height(v) is the height of a subtree (node) $v$ . (l,k,r) = expose(v) extracts $v$ 's left child $l$ , the key $k$ of $v$ 's root, and the right child $r$ . Node(l,k,r) means to create a node of left child $l$ , key $k$ , and right child $r$ .

To split an AVL tree into two smaller trees, those smaller than key $k$ , and those greater than key $k$ , first draw a path from the root by inserting $k$ into the AVL. After this insertion, all values less than $k$ will be found on the left of the path, and all values greater than $k$ will be found on the right. By applying Join, all the subtrees on the left side are merged bottom-up using keys on the path as intermediate nodes from bottom to top to form the left tree, and the right part is asymmetric. The cost of Split is $O(log n)$ , order of the height of the tree.

Pseudocode implementation for the Split algorithm

function Split(T, k)     if (T = nil) return (nil, false, nil)     (L,m,R) = expose(T)     if (k = m) return (L, true, R)     if (k<m)         (L',b,R') = Split(L,k)        return (L', b, Join(R', m, R))     if (k>m)         (L',b,R') = Split(R, k)        return (Join(L, m, L'), b, R'))

The union of two AVL trees $t 1$ and $t 2$ representing sets $A$ and $B$ , is an AVL $t$ that represents $A \cup B$ .

Pseudocode implementation for the Union algorithm

function Union(t₁, t₂):     if t₁ = nil:         return t₂if t₂ = nil:         return t₁     (t_<, b, t_>) = Split(t₂, t₁.root)     return Join(Union(left(t₁), t_<), t₁.root, Union(right(t₁), t_>))

Here, Split is presumed to return two trees: one holding the keys less its input key, one holding the greater keys. (The algorithm is non-destructive, but an in-place destructive version exists as well.)

The algorithm for intersection or difference is similar, but requires the Join2 helper routine that is the same as Join but without the middle key. Based on the new functions for union, intersection or difference, either one key or multiple keys can be inserted to or deleted from the AVL tree. Since Split calls Join but does not deal with the balancing criteria of AVL trees directly, such an implementation is usually called the "join-based" implementation.

The complexity of each of union, intersection and difference is ${\text{O}}\left(m\log \left({n \over m}+1\right)\right)$ for AVL trees of sizes $m$ and $n\;(\geq m)$ . More importantly, since the recursive calls to union, intersection or difference are independent of each other, they can be executed in parallel with a parallel depth ${\text{O}}(\log m\log n)$ .^[13] When $m=1$ , the join-based implementation has the same computational DAG as single-element insertion and deletion.

Rebalancing

If during a modifying operation the height difference between two child subtrees changes, this may, as long as it is < 2, be reflected by an adaption of the balance information at the parent. During insert and delete operations a (temporary) height difference of 2 may arise, which means that the parent subtree has to be "rebalanced". The given repair tools are the so-called tree rotations, because they move the keys only "vertically", so that the ("horizontal") in-order sequence of the keys is fully preserved (which is essential for a binary-search tree).^[6]^{: 458–481}^[11]^: 33

Let X be the node that has a (temporary) balance factor of −2 or +2. Its left or right subtree was modified. Let Z be the child with the higher subtree (see figures 2 and 3). Note that both children are in AVL shape by induction hypothesis.

In case of insertion this insertion has happened to one of Z's children in a way that Z's height has increased. In case of deletion this deletion has happened to the sibling t₁ of Z in a way so that t₁'s height being already lower has decreased. (This is the only case where Z's balance factor may also be 0.)

There are four possible variants of the violation:

Right Right	⟹ Z is a right	child of its parent X and BF(Z) ≥ 0
Left Left	⟹ Z is a left	child of its parent X and BF(Z) ≤ 0
Right Left	⟹ Z is a right	child of its parent X and BF(Z) < 0
Left Right	⟹ Z is a left	child of its parent X and BF(Z) > 0

And the rebalancing is performed differently:

Right Right	⟹ X is rebalanced with a	simple	rotation `rotate_Left`	(see figure 2)
Left Left	⟹ X is rebalanced with a	simple	rotation `rotate_Right`	(mirror-image of figure 2)
Right Left	⟹ X is rebalanced with a	double	rotation `rotate_RightLeft`	(see figure 3)
Left Right	⟹ X is rebalanced with a	double	rotation `rotate_LeftRight`	(mirror-image of figure 3)

Thereby, the situations are denoted as C B, where C (= child direction) and B (= balance) come from the set { Left, Right} with Right := −Left. The balance violation of case C == B is repaired by a simple rotation rotate_(−C), whereas the case C != B is repaired by a double rotation rotate_CB.

The cost of a rotation, either simple or double, is constant.

Simple rotation

Figure 2 shows a Right Right situation. In its upper half, node X has two child trees with a balance factor of +2. Moreover, the inner child t₂₃ of Z (i.e., left child when Z is right child, or right child when Z is left child) is not higher than its sibling t₄. This can happen by a height increase of subtree t₄ or by a height decrease of subtree t₁. In the latter case, also the pale situation where t₂₃ has the same height as t₄ may occur.

The result of the left rotation is shown in the lower half of the figure. Three links (thick edges in figure 2) and two balance factors are to be updated.

As the figure shows, before an insertion, the leaf layer was at level h+1, temporarily at level h+2 and after the rotation again at level h+1. In case of a deletion, the leaf layer was at level h+2, where it is again, when t₂₃ and t₄ were of same height. Otherwise the leaf layer reaches level h+1, so that the height of the rotated tree decreases.

Code snippet of a simple left rotation

Input:	X = root of subtree to be rotated left
	Z = right child of X, Z is right-heavy
	with height == $Height(LeftSubtree($ X $))+2$
Result:	new root of rebalanced subtree

node*rotate_Left(node*X,node*Z){// Z is by 2 higher than its siblingt23=left_child(Z);// Inner child of Zright_child(X)=t23;if(t23!=null)parent(t23)=X;left_child(Z)=X;parent(X)=Z;// 1st case, BF(Z) == 0,//   only happens with deletion, not insertion:if(BF(Z)==0){// t23 has been of same height as t4BF(X)=+1;// t23 now higherBF(Z)=–1;// t4 now lower than X}else{// 2nd case happens with insertion or deletion:BF(X)=0;BF(Z)=0;}returnZ;// return new root of rotated subtree}

Double rotation

Figure 3 shows a Right Left situation. In its upper third, node X has two child trees with a balance factor of +2. But unlike figure 2, the inner child Y of Z is higher than its sibling t₄. This can happen by the insertion of Y itself or a height increase of one of its subtrees t₂ or t₃ (with the consequence that they are of different height) or by a height decrease of subtree t₁. In the latter case, it may also occur that t₂ and t₃ are of the same height.

The result of the first, the right, rotation is shown in the middle third of the figure. (With respect to the balance factors, this rotation is not of the same kind as the other AVL single rotations, because the height difference between Y and t₄ is only 1.) The result of the final left rotation is shown in the lower third of the figure. Five links (thick edges in figure 3) and three balance factors are to be updated.

As the figure shows, before an insertion, the leaf layer was at level h+1, temporarily at level h+2 and after the double rotation again at level h+1. In case of a deletion, the leaf layer was at level h+2 and after the double rotation it is at level h+1, so that the height of the rotated tree decreases.

Code snippet of a right-left double rotation

Input:	X = root of subtree to be rotated
	Z = its right child, left-heavy
	with height == $Height(LeftSubtree($ X $))+2$
Result:	new root of rebalanced subtree

node*rotate_RightLeft(node*X,node*Z){// Z is by 2 higher than its siblingY=left_child(Z);// Inner child of Z// Y is by 1 higher than siblingt3=right_child(Y);left_child(Z)=t3;if(t3!=null)parent(t3)=Z;right_child(Y)=Z;parent(Z)=Y;t2=left_child(Y);right_child(X)=t2;if(t2!=null)parent(t2)=X;left_child(Y)=X;parent(X)=Y;// 1st case, BF(Y) == 0if(BF(Y)==0){BF(X)=0;BF(Z)=0;}elseif(BF(Y)>0){// t3 was higherBF(X)=–1;// t1 now higherBF(Z)=0;}else{// t2 was higherBF(X)=0;BF(Z)=+1;// t4 now higher}BF(Y)=0;returnY;// return new root of rotated subtree}

Comparison to other structures

Both AVL trees and red–black (RB) trees are self-balancing binary search trees and they are related mathematically. Indeed, every AVL tree can be colored red–black,^[14] but there are RB trees which are not AVL balanced. For maintaining the AVL (or RB) tree's invariants, rotations play an important role. In the worst case, even without rotations, AVL or RB insertions or deletions require $O(log n)$ inspections and/or updates to AVL balance factors (or RB colors). RB insertions and deletions and AVL insertions require from zero to three tail-recursive rotations and run in amortized $O(1)$ time,^[15]^{: pp.165, 158}^[16] thus equally constant on average. AVL deletions requiring $O(log n)$ rotations in the worst case are also $O(1)$ on average. RB trees require storing one bit of information (the color) in each node, while AVL trees mostly use two bits for the balance factor, although, when stored at the children, one bit with meaning «lower than sibling» suffices. The bigger difference between the two data structures is their height limit.

For a tree of size $n \geq 1$

an AVL tree's height is at most
${\begin{array}{ll}h&\leqq \;c\log _{2}(n+d)+b\\&<\;c\log _{2}(n+2)+b\end{array}}$

where

{\displaystyle \varphi

the golden ratio,

c:={\tfrac {1}{\log _{2}\varphi }}\approx 1.440,

b:={\tfrac {c}{2}}\log _{2}5-2\approx \;-0.328,

and

d:=1+{\tfrac {1}{\varphi ^{4}{\sqrt {5}}}}\approx 1.065

.

a RB tree's height is at most
${\begin{array}{ll}h&\leqq \;2\log _{2}(n+1)\end{array}}$ .^[17]

AVL trees are more rigidly balanced than RB trees with an asymptotic relation AVL/RB ≈0.720 of the maximal heights. For insertions and deletions, Ben Pfaff shows in 79 measurements a relation of AVL/RB between 0.677 and 1.077 with median ≈0.947 and geometric mean ≈0.910.^[4]

Related Research Articles

In computer science, a binary search tree (BST), also called an ordered or sorted binary tree, is a rooted binary tree data structure with the key of each internal node being greater than all the keys in the respective node's left subtree and less than the ones in its right subtree. The time complexity of operations on the binary search tree is linear with respect to the height of the tree.

In computer science, a binary tree is a tree data structure in which each node has at most two children, referred to as the left child and the right child. That is, it is a k-ary tree with $k = 2$ . A recursive definition using set theory is that a binary tree is a tuple (L, S, R), where L and R are binary trees or the empty set and S is a singleton set containing the root.

In computer science, a B-tree is a self-balancing tree data structure that maintains sorted data and allows searches, sequential access, insertions, and deletions in logarithmic time. The B-tree generalizes the binary search tree, allowing for nodes with more than two children. Unlike other self-balancing binary search trees, the B-tree is well suited for storage systems that read and write relatively large blocks of data, such as databases and file systems.

In computer science, a red–black tree is a self-balancing binary search tree data structure noted for fast storage and retrieval of ordered information. The nodes in a red-black tree hold an extra "color" bit, often drawn as red and black, which help ensure that the tree is always approximately balanced.

A splay tree is a binary search tree with the additional property that recently accessed elements are quick to access again. Like self-balancing binary search trees, a splay tree performs basic operations such as insertion, look-up and removal in O(log n) amortized time. For random access patterns drawn from a non-uniform random distribution, their amortized time can be faster than logarithmic, proportional to the entropy of the access pattern. For many patterns of non-random operations, also, splay trees can take better than logarithmic time, without requiring advance knowledge of the pattern. According to the unproven dynamic optimality conjecture, their performance on all access patterns is within a constant factor of the best possible performance that could be achieved by any other self-adjusting binary search tree, even one selected to fit that pattern. The splay tree was invented by Daniel Sleator and Robert Tarjan in 1985.

<span class="mw-page-title-main">Treap</span> Random search tree data structure

In computer science, the treap and the randomized binary search tree are two closely related forms of binary search tree data structures that maintain a dynamic set of ordered keys and allow binary searches among the keys. After any sequence of insertions and deletions of keys, the shape of the tree is a random variable with the same probability distribution as a random binary tree; in particular, with high probability its height is proportional to the logarithm of the number of keys, so that each search, insertion, or deletion operation takes logarithmic time to perform.

In computer science, a self-balancing binary search tree (BST) is any node-based binary search tree that automatically keeps its height small in the face of arbitrary item insertions and deletions. These operations when designed for a self-balancing binary search tree, contain precautionary measures against boundlessly increasing tree height, so that these abstract data structures receive the attribute "self-balancing".

In computer science, a search tree is a tree data structure used for locating specific keys from within a set. In order for a tree to function as a search tree, the key for each node must be greater than any keys in subtrees on the left, and less than any keys in subtrees on the right.

<span class="mw-page-title-main">T-tree</span> Data structure in computer science

In computer science a T-tree is a type of binary tree data structure that is used by main-memory databases, such as Datablitz, eXtremeDB, MySQL Cluster, Oracle TimesTen and MobileLite.

In computer science, a scapegoat tree is a self-balancing binary search tree, invented by Arne Andersson in 1989 and again by Igal Galperin and Ronald L. Rivest in 1993. It provides worst-case lookup time and $amortized insertion and deletion time.$

In computer science, an interval tree is a tree data structure to hold intervals. Specifically, it allows one to efficiently find all intervals that overlap with any given interval or point. It is often used for windowing queries, for instance, to find all roads on a computerized map inside a rectangular viewport, or to find all visible elements inside a three-dimensional scene. A similar data structure is the segment tree.

An AA tree in computer science is a form of balanced tree used for storing and retrieving ordered data efficiently. AA trees are named after their originator, Swedish computer scientist Arne Andersson.

In computer science, a leftist tree or leftist heap is a priority queue implemented with a variant of a binary heap. Every node x has an s-value which is the distance to the nearest leaf in subtree rooted at x. In contrast to a binary heap, a leftist tree attempts to be very unbalanced. In addition to the heap property, leftist trees are maintained so the right descendant of each node has the lower s-value.

Left rotation refers to the following

In computer science, weight-balanced binary trees (WBTs) are a type of self-balancing binary search trees that can be used to implement dynamic sets, dictionaries (maps) and sequences. These trees were introduced by Nievergelt and Reingold in the 1970s as trees of bounded balance, or BB[α] trees. Their more common name is due to Knuth.

Right rotation refers to the following:

In computer science, a fractal tree index is a tree data structure that keeps data sorted and allows searches and sequential access in the same time as a B-tree but with insertions and deletions that are asymptotically faster than a B-tree. Like a B-tree, a fractal tree index is a generalization of a binary search tree in that a node can have more than two children. Furthermore, unlike a B-tree, a fractal tree index has buffers at each node, which allow insertions, deletions and other changes to be stored in intermediate locations. The goal of the buffers is to schedule disk writes so that each write performs a large amount of useful work, thereby avoiding the worst-case performance of B-trees, in which each disk write may change a small amount of data on disk. Like a B-tree, fractal tree indexes are optimized for systems that read and write large blocks of data. The fractal tree index has been commercialized in databases by Tokutek. Originally, it was implemented as a cache-oblivious lookahead array, but the current implementation is an extension of the B^ε tree. The B^ε is related to the Buffered Repository Tree. The Buffered Repository Tree has degree 2, whereas the B^ε tree has degree B^ε. The fractal tree index has also been used in a prototype filesystem. An open source implementation of the fractal tree index is available, which demonstrates the implementation details outlined below.

In computer science, a WAVL tree or weak AVL tree is a self-balancing binary search tree. WAVL trees are named after AVL trees, another type of balanced search tree, and are closely related both to AVL trees and red–black trees, which all fall into a common framework of rank balanced trees. Like other balanced binary search trees, WAVL trees can handle insertion, deletion, and search operations in time $O (log n)$ per operation.

In computer science, the list-labeling problem involves maintaining a totally ordered set S supporting the following operations:

In computer science, join-based tree algorithms are a class of algorithms for self-balancing binary search trees. This framework aims at designing highly-parallelized algorithms for various balanced binary search trees. The algorithmic framework is based on a single operation join. Under this framework, the join operation captures all balancing criteria of different balancing schemes, and all other functions join have generic implementation across different balancing schemes. The join-based algorithms can be applied to at least four balancing schemes: AVL trees, red–black trees, weight-balanced trees and treaps.

References

1 2 3 4 5 6 Eric Alexander. "AVL Trees". Archived from the original on July 31, 2019.
↑ Adelson-Velsky, Georgy; Landis, Evgenii (1962). "An algorithm for the organization of information". Proceedings of the USSR Academy of Sciences (in Russian). 146: 263–266. English translation by Myron J. Ricci in Soviet Mathematics - Doklady, 3:1259–1263, 1962.
↑ Sedgewick, Robert (1983). "Balanced Trees" . Algorithms. Addison-Wesley. p. 199. ISBN 0-201-06672-6.
1 2 Pfaff, Ben (June 2004). "Performance Analysis of BSTs in System Software" (PDF). Stanford University.
↑ AVL trees are not weight-balanced? (meaning: AVL trees are not μ-balanced?)
Thereby: A Binary Tree is called $\mu$ -balanced, with $0\leq \mu \leq {\tfrac {1}{2}}$ , if for every node $N$ , the inequality
${\tfrac {1}{2}}-\mu \leq {\tfrac {|N_{l}|}{|N|+1}}\leq {\tfrac {1}{2}}+\mu$
holds and $\mu$ is minimal with this property. $|N|$ is the number of nodes below the tree with $N$ as root (including the root) and $N_{l}$ is the left child node of $N$ .
1 2 3 4 Knuth, Donald E. (2000). Sorting and searching (2. ed., 6. printing, newly updated and rev. ed.). Boston [u.a.]: Addison-Wesley. ISBN 0-201-89685-0.
↑ However, the balance information can be kept in the child nodes as one bit indicating whether the parent is higher by 1 or by 2; thereby higher by 2 cannot occur for both children. This way the AVL tree is a "rank balanced" tree, as coined by Haeupler, Sen and Tarjan.
↑ Dixit, J. B. (2010). Mastering data structures through 'C' language. New Delhi, India: University Science Press, an imprint of Laxmi Publications Pvt. Ltd. ISBN 9789380386720. OCLC 939446542.
1 2 3 Brass, Peter (2008). Advanced data structures. Cambridge: Cambridge University Press. ISBN 9780511438202. OCLC 312435417.
↑ Hubbard, John Rast (2000). Schaum's outline of theory and problems of data structures with Java . New York: McGraw-Hill. ISBN 0071378707. OCLC 48139308.
1 2 3 Pfaff, Ben (2004). An Introduction to Binary Search Trees and Balanced Trees. Free Software Foundation, Inc.
↑ Weiss, Mark Allen (2006). Data structures and algorithm analysis in C++ (3rd ed.). Boston: Pearson Addison-Wesley. p. 145. ISBN 0-321-37531-9. OCLC 61278554.
1 2 Blelloch, Guy E.; Ferizovic, Daniel; Sun, Yihan (2016), "Just join for parallel ordered sets", Symposium on Parallel Algorithms and Architectures, ACM, pp. 253–264, arXiv: 1602.02120 , doi:10.1145/2935764.2935768, ISBN 978-1-4503-4210-0, S2CID 2897793 .
↑ Paul E. Black (2015-04-13). "AVL tree". Dictionary of Algorithms and Data Structures. National Institute of Standards and Technology . Retrieved 2016-07-02.
↑ Mehlhorn, Kurt; Sanders, Peter (2008). Algorithms and Data Structures. Berlin, Heidelberg: Springer Berlin Heidelberg. doi:10.1007/978-3-540-77978-0. ISBN 978-3-540-77977-3.
↑ Dinesh P. Mehta; Sartaj Sahni, eds. (2017-12-15). Handbook of Data Structures and Applications (2 ed.). New York: Chapman and Hall/CRC. doi:10.1201/9781315119335. ISBN 978-1-315-11933-5.{{cite book}}: |journal= ignored (help)
↑ Red–black tree#Proof of bounds

External links

This article incorporates public domain material from Paul E. Black. "AVL Tree". Dictionary of Algorithms and Data Structures . NIST.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[wiscurl-1] 1 2 3 4 5 6 Eric Alexander. "AVL Trees". Archived from the original on July 31, 2019.

[2] Adelson-Velsky, Georgy; Landis, Evgenii (1962). "An algorithm for the organization of information". Proceedings of the USSR Academy of Sciences (in Russian). 146: 263–266. English translation by Myron J. Ricci in Soviet Mathematics - Doklady, 3:1259–1263, 1962.

[3] Sedgewick, Robert (1983). "Balanced Trees" . Algorithms. Addison-Wesley. p. 199. ISBN 0-201-06672-6.

[Pfaff1-4] 1 2 Pfaff, Ben (June 2004). "Performance Analysis of BSTs in System Software" (PDF). Stanford University.

[5] AVL trees are not weight-balanced? (meaning: AVL trees are not μ-balanced?)
Thereby: A Binary Tree is called $\mu$ -balanced, with $0\leq \mu \leq {\tfrac {1}{2}}$ , if for every node $N$ , the inequality
${\tfrac {1}{2}}-\mu \leq {\tfrac {|N_{l}|}{|N|+1}}\leq {\tfrac {1}{2}}+\mu$
holds and $\mu$ is minimal with this property. $|N|$ is the number of nodes below the tree with $N$ as root (including the root) and $N_{l}$ is the left child node of $N$ .

[Knuth-6] 1 2 3 4 Knuth, Donald E. (2000). Sorting and searching (2. ed., 6. printing, newly updated and rev. ed.). Boston [u.a.]: Addison-Wesley. ISBN 0-201-89685-0.

[7] However, the balance information can be kept in the child nodes as one bit indicating whether the parent is higher by 1 or by 2; thereby higher by 2 cannot occur for both children. This way the AVL tree is a "rank balanced" tree, as coined by Haeupler, Sen and Tarjan.

[dixit-mastering-data-structures-8] Dixit, J. B. (2010). Mastering data structures through 'C' language. New Delhi, India: University Science Press, an imprint of Laxmi Publications Pvt. Ltd. ISBN 9789380386720. OCLC 939446542.

[brass-advanced-data-structures-9] 1 2 3 Brass, Peter (2008). Advanced data structures. Cambridge: Cambridge University Press. ISBN 9780511438202. OCLC 312435417.

[hubbard-outline-theory-problems-10] Hubbard, John Rast (2000). Schaum's outline of theory and problems of data structures with Java . New York: McGraw-Hill. ISBN 0071378707. OCLC 48139308.

[Pfaff-11] 1 2 3 Pfaff, Ben (2004). An Introduction to Binary Search Trees and Balanced Trees. Free Software Foundation, Inc.

[12] Weiss, Mark Allen (2006). Data structures and algorithm analysis in C++ (3rd ed.). Boston: Pearson Addison-Wesley. p. 145. ISBN 0-321-37531-9. OCLC 61278554.

[join-based-13] 1 2 Blelloch, Guy E.; Ferizovic, Daniel; Sun, Yihan (2016), "Just join for parallel ordered sets", Symposium on Parallel Algorithms and Architectures, ACM, pp. 253–264, arXiv: 1602.02120 , doi:10.1145/2935764.2935768, ISBN 978-1-4503-4210-0, S2CID 2897793 .

[14] Paul E. Black (2015-04-13). "AVL tree". Dictionary of Algorithms and Data Structures. National Institute of Standards and Technology . Retrieved 2016-07-02.

[15] Mehlhorn, Kurt; Sanders, Peter (2008). Algorithms and Data Structures. Berlin, Heidelberg: Springer Berlin Heidelberg. doi:10.1007/978-3-540-77978-0. ISBN 978-3-540-77977-3.

[Dinesh-16] Dinesh P. Mehta; Sartaj Sahni, eds. (2017-12-15). Handbook of Data Structures and Applications (2 ed.). New York: Chapman and Hall/CRC. doi:10.1201/9781315119335. ISBN 978-1-315-11933-5.{{cite book}}: |journal= ignored (help)

[17] Red–black tree#Proof of bounds

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

v t e Tree data structures
Search trees (dynamic sets/associative arrays)	2–3 2–3–4 AA (a,b) AVL B B+ B* B^x (Optimal) Binary search Dancing HTree Interval Order statistic Palindrome (Left-leaning) Red–black Scapegoat Splay T Treap UB Weight-balanced
Heaps	Binary Binomial Brodal d-ary Fibonacci Leftist Pairing Skew binomial Skew van Emde Boas Weak
Tries	Ctrie C-trie (compressed ADT) Hash Radix Suffix Ternary search X-fast Y-fast
Spatial data partitioning trees	Ball BK BSP Cartesian Hilbert R k-d (implicit k-d) M Metric MVP Octree PH Priority R Quad R R+ R* Segment VP X
Other trees	Cover Exponential Fenwick Finger Fractal tree index Fusion Hash calendar iDistance K-ary Left-child right-sibling Link/cut Log-structured merge Merkle PQ Range SPQR Top

v t e Data structures
Types	Collection Container
Abstract	Associative array Multimap Retrieval Data Structure List Stack Queue Double-ended queue Priority queue Double-ended priority queue Set Multiset Disjoint-set
Arrays	Bit array Circular buffer Dynamic array Hash table Hashed array tree Sparse matrix
Linked	Association list Linked list Skip list Unrolled linked list XOR linked list
Trees	B-tree Binary search tree AA tree AVL tree Red–black tree Self-balancing tree Splay tree Heap Binary heap Binomial heap Fibonacci heap R-tree R* tree R+ tree Hilbert R-tree Trie Hash tree
Graphs	Binary decision diagram Directed acyclic graph Directed acyclic word graph
List of data structures