Loop inversion

Last updated February 20, 2025

In computer science, loop inversion is a compiler optimization and loop transformation in which a while loop is replaced by an if block containing a do..while loop.^[1] When used correctly,^{[ clarification needed ]} it may improve performance due to instruction pipelining ^{[ citation needed ]} or avoiding jump instructions to reduce branch mis-prediction.^[1]

Example in Java

voidpre_inversion(){while(/* condition */){/* loop body */}}

is equivalent to:

voidpost_inversion(){if(/* condition */){do{/* loop body */}while(/* condition */);}}

No change in performance occurs for initial and non-final iterations through the loop, including when the loop is not entered because the condition is false. However, the final iteration has 2 fewer jump instructions when the loop is entered because the do-while loop does not need to jump to the start of the while loop to evaluate the loop condition.^[1]

Example in C

inti,a[100];i=0;while(i<100){a[i]=0;i++;}

is equivalent to:

inti,a[100];i=0;if(i<100){do{a[i]=0;i++;}while(i<100);}

Despite the seemingly greater complexity of the second example, it may actually run faster on modern CPUs because they use an instruction pipeline. By nature, any jump in the code causes a pipeline stall, which is a detriment to performance. ^{[ citation needed ]}

Additionally, loop inversion allows safe loop-invariant code motion.^{[ citation needed ]}^{[ clarification needed ]}

Example in three-address code

^{[ clarification needed ]}

      i := 0  L1:  if i >= 100 goto L2       a[i] := 0       i := i + 1       goto L1  L2:

If i had been initialized at 100, the instructions executed at runtime would have been:

  if i >= 100   goto L2

Let us assume that i had been initialized to some value less than 100. Now let us look at the instructions executed at the moment after i has been incremented to 99 in the loop:

  goto L1   if i < 100   a[i] := 0   i := i + 1   goto L1   if i >= 100   goto L2   <<at L2>>

Now, let's look at the optimized version:

      i := 0       if i >= 100 goto L2  L1:  a[i] := 0       i := i + 1       if i < 100 goto L1  L2:

Again, let's look at the instructions executed if i is initialized to 100:

  if i >= 100   goto L2

We didn't waste any cycles compared to the original version. Now consider the case where i has been incremented to 99:

  if i < 100   goto L1   a[i] := 0   i := i + 1   if i < 100   <<at L2>>

As you can see, two gotos (and thus, two pipeline stalls) have been eliminated in the execution.

References

1 2 3 Jubb, Chae, Loop Optimizations in Modern C Compilers (PDF)

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[x86_loop_optimizations-1] 1 2 3 Jubb, Chae, Loop Optimizations in Modern C Compilers (PDF)

[1]

v t e Compiler optimizations
Basic block	Peephole optimization Local value numbering Loop nest optimization Loop splitting Loop unrolling Loop unswitching Software pipelining Strength reduction
Data-flow analysis	Available expression Common subexpression elimination Constant folding Dead store elimination Induction variable recognition and elimination Live-variable analysis Upwards exposed uses Use-define chain Reaching definitions
SSA-based	Global value numbering Sparse conditional constant propagation
Code generation	Instruction scheduling Instruction selection Register allocation Rematerialization
Functional	Deforestation Tail-call elimination
Global	Interprocedural optimization
Other	Bounds-checking elimination Compile-time function execution Dead-code elimination Expression templates Inline expansion Jump threading Partial evaluation Profile-guided optimization
Static analysis	Alias analysis Array-access analysis Control-flow analysis Data-flow analysis Dependence analysis Escape analysis Pointer analysis Shape analysis Value range analysis

Loop inversion

Contents

Example in Java

Example in C

Example in three-address code

References