Lucky 13 and other padding oracle attacks on CBC ciphers

Lucky 13 is a padding oracle timing attack on CBC ciphers, which required multiple patches to solve. Does this mean that this vulnerability is now solved for good, or that it is the vulnerability that keeps on giving?

Padding

Block ciphers encrypt data in blocks of 16 bytes. The message is separated into blocks of 16 bytes, and each block is fed through the encryption function. But sometimes you want to encrypt a message that is not a multiple of 16. If your message is 10 bytes, you need to fill up the last six bytes with something before you can feed it into the block cipher. This is called padding.

A naive approach would be to just pad the message with some special character, such as nul-bytes or xs. However, that will interfere with messages ending in such a character. Since we want to be able to send any character through AES, we need to encode the length of the message or the length of the padding somehow in the message. The standard way to do this (PKCS #7) is to use the length of the padding as the padding byte. If the padding is 7 bytes long, it is padded with seven 0x07 bytes.

Padding oracle

The padding algorithm specifies that a certain number of bytes should have a certain value. This also makes it possible for the padding to be incorrect. If the decrypted message ends with 7, it must end with 7777777. Anything else is invalid padding. If an attacker can notice whether the padding is valid or invalid during decryption, then this provides him with a little bit information. A server that leaks this information is called a padding oracle.

The attacker can send manipulated messages to the server. Most of these will have invalid padding. If a message has valid padding, it probably ends with 1. A one repeating once is the easiest padding to get right. The attacker learns one byte of plaintext, and can continue to try to get the padding 22. In the end, they can decrypt the whole message this way, just by asking the server whether the padding is valid.

Solution

The structural solution is to avoid CBC ciphers and PKCS #7 padding, and these are indeed no longer supported with TLS 1.3.

However, many attempts have been made to fix padded CBC ciphers. To avoid becoming a padding oracle, the apparent behaviour must be the same whether the padding is correct or incorrect. The TLS implementation must check the padding and reject the message if invalid, but it must not behave differently to the client when it has done so. This turns out to be pretty hard to implement.

In the original attack, the TLS implementation simply returned a different error message when the padding was incorrect. That was quickly fixed, but that did not entirely solve the problem. Even though the error message was the same, the time it took to return a result varied. When the padding is incorrect, the message is quickly discarded instead of decrypted, which naturally takes a shorter time. This time difference reveals information to the client about whether the padding was correct. This new timing attack on padding validity was called Lucky 13.

Side-channel threat model

The difference between a correct and incorrect padding results in microseconds of difference. This is difficult to measure over the internet. However, even when attacking a remote target, it’s often possible to get on the local network or even the same computer. If the target site is hosted on a Platform-as-a-Service (PAAS) provider, the adversary can start their own VM inside the same datacenter as the victim. In some cases, the adversary’s VM can be running on the same physical host. This makes it much more feasible to perform timing attacks, and may even enable other side channels such as cache timing attacks.

Timing solutions

To avoid timing differences and thus being vulnerable to Lucky 13, operations should take the same time, whether the padding is valid or not. There are two possible solutions for this:

pseudo constant time implementations: if the padding is invalid, perform some computations similar to what an actual decryption would do. Also, sleep for a random time, so that attackers have more difficulty to measure the time of operations.
constant-time, constant memory-access: the code performs the exact same operations, whether the padding is valid or not.

Both solutions are hard to get correct.

Pseudo constant time implementations

In pseudo constant time implementations, the TLS library attempts to perform a similar amount of operations when the padding is incorrect, as would be done by an actual encryption when the padding is correct. How much operations to perform depends on the decryption algorithm, and getting these numbers right for every decryption algorithm is not that easy. Several bugs were found in multiple TLS libraries, where the amount of fake work did not match the work of an actual decryption.

Another countermeasure is to randomize the time a decryption takes, so it is harder for an attacker to measure the time. This has also been proven to contain bugs, at least in Amazon’s s2n. That TLS library passed a random number to usleep, to introduce a random time delay. The bug was that this sleeps for an integer amount of microseconds. If the attacker detects that something took 8.7 microseconds, the 8 part was random, but the .7 part still conveyed sufficient information.

Constant-time, constant memory-access

OpenSSL rewrote their CBC implementation to be completely constant-time. This is hard to do and makes the code large and messy.

its complexity is such that around 500 lines of new code were required to implement it, and it is arguable whether the code would be understandable by all but a few crypto-expert developers.

Of course, writing so many complex code has the potential to introduce new bugs. And it did:

The padding oracle vulnerability we discovered in OpenSSL (CVE-2016-2107) was introduced by writing a constant-time patch that should have mitigated the Lucky 13 attack

The patch that should have solved Lucky 13 introduced an even worse security vulnerability. A similar bug was identified in MatrixSSL, showing that it is not easy to solve Lucky 13 using constant time code.

Conclusion

All TLS libraries have been patched against Lucky 13, multiple times. Patches have been implemented by various TLS libraries for attack variants released in 1998, 2002, 2013, 2014, 2015, 2016 and 2018. Whether CBC ciphers are now secure depends on your view of whether the seventh time is the charm, or that this is apparently an unsolvable problem.