Ciphertext stealing

Last updated January 14, 2024

In cryptography, ciphertext stealing (CTS) is a general method of using a block cipher mode of operation that allows for processing of messages that are not evenly divisible into blocks without resulting in any expansion of the ciphertext, at the cost of slightly increased complexity.

General characteristics

Ciphertext stealing is a technique for encrypting plaintext using a block cipher, without padding the message to a multiple of the block size, so the ciphertext is the same size as the plaintext.

It does this by altering processing of the last two blocks of the message. The processing of all but the last two blocks is unchanged, but a portion of the second-to-last block's ciphertext is "stolen" to pad the last plaintext block. The padded final block is then encrypted as usual.

The final ciphertext, for the last two blocks, consists of the partial penultimate block (with the "stolen" portion omitted) plus the full final block, which are the same size as the original plaintext.

Decryption requires decrypting the final block first, then restoring the stolen ciphertext to the penultimate block, which can then be decrypted as usual.

In principle any block-oriented block cipher mode of operation can be used, but stream-cipher-like modes can already be applied to messages of arbitrary length without padding, so they do not benefit from this technique. The common modes of operation that are coupled with ciphertext stealing are Electronic Codebook (ECB) and Cipher Block Chaining (CBC).

Ciphertext stealing for ECB mode requires the plaintext to be longer than one block. A possible workaround is to use a stream cipher-like block cipher mode of operation when the plaintext length is one block or less, such as the CTR, CFB or OFB modes.

Ciphertext stealing for CBC mode doesn't necessarily require the plaintext to be longer than one block. In the case where the plaintext is one block long or less, the Initialization vector (IV) can act as the prior block of ciphertext. In this case a modified IV must be sent to the receiver. This may not be possible in situations where the IV can not be freely chosen by the sender when the ciphertext is sent (e.g., when the IV is a derived or pre-established value), and in this case ciphertext stealing for CBC mode can only occur in plaintexts longer than one block.

To implement CTS encryption or decryption for data of unknown length, the implementation must delay processing (and buffer) the two most recent blocks of data, so that they can be properly processed at the end of the data stream.

Ciphertext format

There are several different ways to arrange the ciphertext for transmission. The ciphertext bits are the same in all cases, just transmitted in a different order, so the choice has no security implications; it is purely one of implementation convenience.

The numbering here is taken from Dworkin, who describes them all. The third is the most popular, and described by Daemen and Schneier; Meyer describes a related, but incompatible scheme (with respect to bit ordering and key use).

CS1

Arguably the most obvious way to arrange the ciphertext is to transmit the truncated penultimate block, followed by the full final block. This is not convenient for the receiver for two reasons:

The receiver must decrypt the final block first in any case, and
This results in the final block not being aligned on a natural boundary, complicating hardware implementations.

This does have the advantage that, if the final plaintext block happens to be a multiple of the block size, the ciphertext is identical to that of the original mode of operation without ciphertext stealing.

CS2

It is often more convenient to swap the final two ciphertext blocks, so the ciphertext ends with the full final block, followed by the truncated penultimate block. This results in naturally aligned ciphertext blocks.

In order to maintain compatibility with the non-stealing modes, option CS2 performs this swap only if the amount of stolen ciphertext is non-zero, i.e. the original message was not a multiple of the block size.

This maintains natural alignment, and compatibility with the non-stealing modes, but requires treating the cases of aligned and unaligned message size differently.

CS3

The most popular alternative swaps the final two ciphertext blocks unconditionally. This is the ordering used in the descriptions below.

Ciphertext stealing mode description

In order to encrypt or decrypt data, use the standard block cipher mode of operation on all but the last two blocks of data.

The following steps describe how to handle the last two blocks of the plaintext, called P_n−1 and P_n, where the length of P_n−1 equals the block size of the cipher in bits, B; the length of the last block, P_n, is M bits; and K is the key that is in use. M can range from 1 to B, inclusive, so P_n could possibly be a complete block. The CBC mode description also makes use of the ciphertext block just previous to the blocks concerned, C_n−2, which may in fact be the IV if the plaintext fits within two blocks.

For this description, the following functions and operators are used:

Head (data, a): returns the first a bits of the 'data' string.
Tail (data, a): returns the last a bits of the 'data' string.
Encrypt (K, data): use the underlying block cipher in encrypt mode on the 'data' string using the key K.
Decrypt (K, data): use the underlying block cipher in decrypt mode on the 'data' string using the key K.
XOR: Bitwise Exclusive-OR. Equivalent to bitwise addition without use of a carry bit.
||: Concatenation operator. Combine the strings on either side of the operator.
0^a: a string of a 0 bits.

ECB ciphertext stealing

Ciphertext stealing in ECB mode introduces an inter-block dependency within the last two blocks, resulting in altered error propagation behavior for the last two blocks.

ECB encryption steps (see figure)

ECB Encryption Steps for CTS CTS ECB Encryption.png — ECB Encryption Steps for CTS

E_n−1 = Encrypt (K, P_n−1). Encrypt P_n−1 to create E_n−1. This is equivalent to the behavior of standard ECB mode.
C_n = Head (E_n−1, M). Select the first M bits of E_n−1 to create C_n. The final ciphertext block, C_n, is composed of the leading M bits of the second-to-last ciphertext block. In all cases, the last two blocks are sent in a different order than the corresponding plaintext blocks.
D_n = P_n || Tail (E_n−1, B−M). Pad P_n with the low order bits from E_n−1.
C_n−1 = Encrypt (K, D_n). Encrypt D_n to create C_n−1. For the first M bits, this is equivalent to what would happen in ECB mode (other than the ciphertext ordering). For the last B−M bits, this is the second time that these data have been encrypted under this key (It was already encrypted in the production of E_n−1 in step 2).

ECB decryption steps

D_n = Decrypt (K, C_n−1). Decrypt C_n−1 to create D_n. This undoes step 4 of the encryption process.
E_n−1 = C_n || Tail (D_n, B−M). Pad C_n with the extracted ciphertext in the tail end of D_n (placed there in step 3 of the ECB encryption process).
P_n = Head (D_n, M). Select the first M bits of D_n to create P_n. As described in step 3 of the ECB encryption process, the first M bits of D_n contain P_n. We queue this last (possibly partial) block for eventual output.
P_n−1 = Decrypt (K, E_n−1). Decrypt E_n−1 to create P_n−1. This reverses encryption step 1.

ECB ciphertext stealing error propagation

A bit error in the transmission of C_n−1 would result in the block-wide corruption of both P_n−1 and P_n. A bit error in the transmission of C_n would result in the block-wide corruption of P_n−1. This is a significant change from ECB's error propagation behavior.

CBC ciphertext stealing

In CBC, there is already interaction between processing of different adjacent blocks, so CTS has less conceptual impact in this mode. Error propagation is affected.

CBC encryption steps

X_n−1 = P_n−1 XOR C_n−2. Exclusive-OR P_n−1 with the previous ciphertext block, C_n−2, to create X_n−1. This is equivalent to the behavior of standard CBC mode.
E_n−1 = Encrypt (K, X_n−1). Encrypt X_n−1 to create E_n−1. This is equivalent to the behavior of standard CBC mode.
C_n = Head (E_n−1, M). Select the first M bits of E_n−1 to create C_n. The final ciphertext block, C_n, is composed of the leading M bits of the second-to-last ciphertext block. In all cases, the last two blocks are sent in a different order than the corresponding plaintext blocks.
P = P_n || 0^B−M. Pad P_n with zeros at the end to create P of length B. The zero padding in this step is important for step 5.
D_n = E_n−1 XOR P. Exclusive-OR E_n−1 with P to create D_n. For the first M bits of the block, this is equivalent to CBC mode; the first M bits of the previous block's ciphertext, E_n−1, are XORed with the M bits of plaintext of the last plaintext block. The zero padding of P in step 4 was important, because it makes the XOR operation's effect on the last B−M bits equivalent to copying the last B−M bits of E_n−1 to the end of D_n. These are the same bits that were stripped off of E_n−1 in step 3 when C_n was created.
C_n−1 = Encrypt (K, D_n). Encrypt D_n to create C_n−1. For the first M bits, this is equivalent to what would happen in CBC mode (other than the ciphertext ordering). For the last B−M bits, this is the second time that these data have been encrypted under this key (It was already encrypted in the production of E_n−1 in step 2).

CBC decryption steps

D_n = Decrypt (K, C_n−1). Decrypt C_n−1 to create D_n. This undoes step 6 of the encryption process.
C = C_n || 0^B−M. Pad C_n with zeros at the end to create a block C of length B. We are padding C_n with zeros to help in step 3.
X_n = D_n XOR C. Exclusive-OR D_n with C to create X_n. Looking at the first M bits, this step has the result of XORing C_n (the first M bits of the encryption process' E_n−1) with the (now decrypted) P_n XOR Head (E_n−1, M) (see steps 4-5 of the encryption process). In other words, we have CBC decrypted the first M bits of P_n. Looking at the last B−M bits, this recovers the last B−M bits of E_n−1.
P_n = Head (X_n, M). Select the first M bits of X_n to create P_n. As described in step 3, the first M bits of X_n contain P_n. We queue this last (possibly partial) block for eventual output.
E_n−1 = C_n || Tail (X_n, B−M). Append the tail (B−M) bits of X_n to C_n to create E_n−1. As described in step 3, E_n−1 is composed of all of C_n (which is M bits long) appended with the last B−M bits of X_n. We reassemble E_n−1 (which is the same E_n−1 seen in the encryption process) for processing in step 6.
X_n−1 = Decrypt (K, E_n−1). Decrypt E_n−1 to create X_n−1. This reverses encryption step 2. X_n−1 is the same as in the encryption process.
P_n−1 = X_n−1 XOR C_n−2. Exclusive-OR X_n−1 with the previous ciphertext block, C_n−2, to create P_n−1. Finally, we reverse the XOR step from step 1 of the encryption process.

CBC implementation notes

For CBC ciphertext stealing, there is a clever (but opaque) method of implementing the described ciphertext stealing process using a standard CBC interface. Using this method imposes a performance penalty in the decryption stage of one extra block decryption operation over what would be necessary using a dedicated implementation.

CBC ciphertext stealing encryption using a standard CBC interface

Pad the last partial plaintext block with 0.
Encrypt the whole padded plaintext using the standard CBC mode.
Swap the last two ciphertext blocks.
Truncate the ciphertext to the length of the original plaintext.

CBC ciphertext stealing decryption using a standard CBC interface

D_n = Decrypt (K, C_n−1). Decrypt the second-to-last ciphertext block using ECB mode.
C_n = C_n || Tail (D_n, B−M). Pad the ciphertext to the nearest multiple of the block size using the last B−M bits of block cipher decryption of the second-to-last ciphertext block.
Swap the last two ciphertext blocks.
Decrypt the (modified) ciphertext using the standard CBC mode.
Truncate the plaintext to the length of the original ciphertext.

CBC ciphertext stealing error propagation

A bit error in the transmission of C_n−1 would result in the block-wide corruption of both P_n−1 and P_n. A bit error in the transmission of C_n would result in a corresponding bit error in P_n, and in the block-wide corruption of P_n−1.

Related Research Articles

In cryptography, a block cipher is a deterministic algorithm that operates on fixed-length groups of bits, called blocks. Block ciphers are the elementary building blocks of many cryptographic protocols. They are ubiquitous in the storage and exchange of data, where such data is secured and authenticated via encryption.

In cryptography, a cipher is an algorithm for performing encryption or decryption—a series of well-defined steps that can be followed as a procedure. An alternative, less common term is encipherment. To encipher or encode is to convert information into cipher or code. In common parlance, "cipher" is synonymous with "code", as they are both a set of steps that encrypt a message; however, the concepts are distinct in cryptography, especially classical cryptography.

A stream cipher is a symmetric key cipher where plaintext digits are combined with a pseudorandom cipher digit stream (keystream). In a stream cipher, each plaintext digit is encrypted one at a time with the corresponding digit of the keystream, to give a digit of the ciphertext stream. Since encryption of each digit is dependent on the current state of the cipher, it is also known as state cipher. In practice, a digit is typically a bit and the combining operation is an exclusive-or (XOR).

Symmetric-key algorithms are algorithms for cryptography that use the same cryptographic keys for both the encryption of plaintext and the decryption of ciphertext. The keys may be identical, or there may be a simple transformation to go between the two keys. The keys, in practice, represent a shared secret between two or more parties that can be used to maintain a private information link. The requirement that both parties have access to the secret key is one of the main drawbacks of symmetric-key encryption, in comparison to public-key encryption. However, symmetric-key encryption algorithms are usually better for bulk encryption. With exception of the one-time pad they have a smaller key size, which means less storage space and faster transmission. Due to this, asymmetric-key encryption is often used to exchange the secret key for symmetric-key encryption.

A chosen-plaintext attack (CPA) is an attack model for cryptanalysis which presumes that the attacker can obtain the ciphertexts for arbitrary plaintexts. The goal of the attack is to gain information that reduces the security of the encryption scheme.

In cryptography, an initialization vector (IV) or starting variable is an input to a cryptographic primitive being used to provide the initial state. The IV is typically required to be random or pseudorandom, but sometimes an IV only needs to be unpredictable or unique. Randomization is crucial for some encryption schemes to achieve semantic security, a property whereby repeated usage of the scheme under the same key does not allow an attacker to infer relationships between segments of the encrypted message. For block ciphers, the use of an IV is described by the modes of operation.

In cryptography, a block cipher mode of operation is an algorithm that uses a block cipher to provide information security such as confidentiality or authenticity. A block cipher by itself is only suitable for the secure cryptographic transformation of one fixed-length group of bits called a block. A mode of operation describes how to repeatedly apply a cipher's single-block operation to securely transform amounts of data larger than a block.

In cryptography, ciphertext or cyphertext is the result of encryption performed on plaintext using an algorithm, called a cipher. Ciphertext is also known as encrypted or encoded information because it contains a form of the original plaintext that is unreadable by a human or computer without the proper cipher to decrypt it. This process prevents the loss of sensitive information via hacking. Decryption, the inverse of encryption, is the process of turning ciphertext into readable plaintext. Ciphertext is not to be confused with codetext because the latter is a result of a code, not a cipher.

In cryptography, padding is any of a number of distinct practices which all include adding data to the beginning, middle, or end of a message prior to encryption. In classical cryptography, padding may include adding nonsense phrases to a message to obscure the fact that many messages end in predictable ways, e.g. sincerely yours.

Multiple encryption is the process of encrypting an already encrypted message one or more times, either using the same or a different algorithm. It is also known as cascade encryption, cascade ciphering, multiple encryption, and superencipherment. Superencryption refers to the outer-level encryption of a multiple encryption.

In cryptography, residual block termination is a variation of cipher block chaining mode (CBC) that does not require any padding. It does this by effectively changing to cipher feedback mode for one block. The cost is the increased complexity.

Disk encryption is a special case of data at rest protection when the storage medium is a sector-addressable device. This article presents cryptographic aspects of the problem. For an overview, see disk encryption. For discussion of different software packages and hardware devices devoted to this problem, see disk encryption software and disk encryption hardware.

In cryptography, Galois/Counter Mode (GCM) is a mode of operation for symmetric-key cryptographic block ciphers which is widely adopted for its performance. GCM throughput rates for state-of-the-art, high-speed communication channels can be achieved with inexpensive hardware resources.

Institute of Electrical and Electronics Engineers (IEEE) standardization project for encryption of stored data, but more generically refers to the Security in Storage Working Group (SISWG), which includes a family of standards for protection of stored data and for the corresponding cryptographic key management.

In cryptography, a keystream is a stream of random or pseudorandom characters that are combined with a plaintext message to produce an encrypted message.

In cryptography, format-preserving encryption (FPE), refers to encrypting in such a way that the output is in the same format as the input. The meaning of "format" varies. Typically only finite sets of characters are used; numeric, alphabetic or alphanumeric. For example:

There are various implementations of the Advanced Encryption Standard, also known as Rijndael.

In cryptography, a padding oracle attack is an attack which uses the padding validation of a cryptographic message to decrypt the ciphertext. In cryptography, variable-length plaintext messages often have to be padded (expanded) to be compatible with the underlying cryptographic primitive. The attack relies on having a "padding oracle" who freely responds to queries about whether a message is correctly padded or not. Padding oracle attacks are mostly associated with CBC mode decryption used within block ciphers. Padding modes for asymmetric algorithms such as OAEP may also be vulnerable to padding oracle attacks.

The xor–encrypt–xor (XEX) is a (tweakable) mode of operation of a block cipher. In tweaked-codebook mode with ciphertext stealing, it is one of the more popular modes of operation for whole-disk encryption. XEX is also a common form of key whitening, and part of some smart card proposals.

Crypto-PAn is a cryptographic algorithm for anonymizing IP addresses while preserving their subnet structure. That is, the algorithm encrypts any string of bits $to a new string, while ensuring that for any pair of bit-strings which share a common prefix of length, their images also share a common prefix of length . A mapping with this property is called prefix-preserving . In this way, Crypto-PAn is a kind of format-preserving encryption.$

References

Daemen, Joan (1995). "2.5.1 and 2.5.2". Cipher and Hash Function Design, Strategies Based on Linear and Differential Cryptanalysis (PDF) (Ph.D. thesis). Katholieke Universiteit Leuven.
Schneier, Bruce (1995). Applied Cryptography (2nd ed.). John Wiley & Sons, Inc. pp. 191, 195. ISBN 978-0-471-12845-8.
Meyer, Carl H.; Matyas, Stephen M. (1982). Cryptography: A New Dimension in Computer Data Security. John Wiley & Sons, Inc. pp. 77–85. ISBN 978-0-471-04892-3.
R. Baldwin; R. Rivest (October 1996). The RC5, RC5-CBC, RC5-CBC-Pad, and RC5-CTS Algorithms. doi: 10.17487/RFC2040 . RFC 2040.
Dworkin, Morris (October 2011). Recommendation for Block Cipher Modes of Operation: Three Variants of Ciphertext Stealing for CBC Mode (PDF). US National Institute of Standards and Technology (NIST). Addendum to NIST Special Pub 800-38A.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.