Defend against replay attack

Question

Defend against replay attack

riobard opened this issue 7 years ago · comments

https://en.wikipedia.org/wiki/Replay_attack

A replay attack (also known as playback attack) is a form of network attack in which a valid data transmission is maliciously or fraudulently repeated or delayed. This is carried out either by the originator or by an adversary who intercepts the data and re-transmits it, possibly as part of a masquerade attack by IP packet substitution.

Replay Attack potentially allows adversaries to identify a Shadowsocks server by observing and replaying the data streams between a valid client and the server. Ideally we want to recognize that a data stream is replayed, and react accordingly.

二階堂真紅 · Answer 1 · Wed Feb 15 2017 15:12:59 GMT+0800 (China Standard Time)

From my comments in #36 :

Just put a timestamp into the request header, then the server verifies that the timestamp should not exceed the server's local time +/- 60s. This timestamp is authenticated by the request's tag. (It is OK to use the timestamp as AD but then we cannot tell timestamp errors from authentication errors...)

The server keeps recording valid incoming requests (by its IV or the seed in the preshared-key protocol or anything that can identify a request) for more than the last 2 minutes (2 * 60s), and rejects duplicated requests. We can use any reasonable data structures such as binary trees, hash tables or bloom filters.

Drawback: sometimes the times on clients and servers are not synchronized...

Also note that the server must handle the replay connections in the same way as other errors.

Max Lv · Answer 2 · Wed Feb 15 2017 15:43:43 GMT+0800 (China Standard Time)

There two kinds of replay attacks for shadowsocks: short-term and long-term.

To defend any short-term replay, I think our current implementation is already good enough. In shadowsocks, we are maintaining a nonce cache on the server to detect any nonce reuse. The default size of that cache is 1024, which should be large enough for personal usage. Any replay attack detected here would cause the source IP being blocked.

For long-term replay, the easiest way is including a time stamp in our header. However, is it really necessary? If the adversaries are performing this kind of attack for a specific port of your server, I think your shadowsocks service is already exposed.

Max Lv · Answer 3 · Wed Feb 15 2017 15:56:02 GMT+0800 (China Standard Time)

@wongsyrone A random chunk within a TCP stream? I don't think it's possible, because currently we assume the nonce of each chunk increased by 2. If one chunk is sent twice, then an authentication error would be detected.

Mygod · Answer 4 · Wed Feb 15 2017 15:57:01 GMT+0800 (China Standard Time)

I prefer not using storage to defend against replay attack as that would cause additional problems.

I agree that defending against replay attack should be plugins' responsibility.

Mygod · Answer 5 · Wed Feb 15 2017 16:09:32 GMT+0800 (China Standard Time)

Plugins are also a part of Shadowsocks. It's just more convenient to put this part altogether with obfuscation since obfuscation protocols usually use handshake that we can transmit additional data with. I'm not saying we don't need to care about replay attack.

Max Lv · Answer 6 · Wed Feb 15 2017 16:18:13 GMT+0800 (China Standard Time)

Does it mean someone should reconstruct the whole TCP stream or only a request header in the first packet?

As our current design is based on session key and session unique nonce, to perform a replay attack, the adversary should replay the whole TCP stream.

Rio · Answer 7 · Wed Feb 15 2017 21:17:09 GMT+0800 (China Standard Time)

@madeye Currently the nonce cache in libev port is suboptimal in space usage. Since we're not really interested in retrieving anything from that cache, the cache itself provides only test of set membership.

We could instead use a Bloom filter to test if the given salt/nonce has been seen before (with a low rate of false positives). 1MB of space can store millions of entries in a standard Bloom filter.

This is a recommendation for Shadowsocks implementations to defend against bad CSPRNG. It does not change the protocol.

Max Lv · Answer 8 · Wed Feb 15 2017 21:31:00 GMT+0800 (China Standard Time)

@riobard I did some research for bloom filter before. There two concerns:

False positive.
False positive increasea with more inserts.

To overcome these limitations, we may clear the bloom filter from time to time. I think it may introduce new problems.

Rio · Answer 9 · Wed Feb 15 2017 22:58:05 GMT+0800 (China Standard Time)

@madeye With proper handling I think false positive is not a major concern. The server should react in the following way:

Receive salt from client
Perform AEAD decryption
If successful, look up the Bloom filter and check if the salt has been seen before
If yes, disconnect to force the client to initiate a new connection (with a new randomly generated salt). Possibly logging to warn server operator of potential replay attack.
Otherwise, add the salt to the filter.

The Bloom filter can be fine-tuned to produce a very low rate of false positive. We can use two Bloom filters: one in active use, and one in standby. If the active one is filled over a pre-defined threshold, future insertions should be directed to the standby filter, and the active filter gets reset.

Mygod · Answer 10 · Thu Feb 16 2017 01:22:22 GMT+0800 (China Standard Time)

@riobard
Supplement more data to support your claim. How large should the threshold of possibility of false positives be? In this case, how long should a server flush the filter? How does it perform compared to a simple table, considering space usage, and the minimum time for a feasible replay attack considering a number of UDP packets sent steadily to the server? Etcetera.

By the way, I don't think this probabilistic data structure is designed for our needs. If we are going for a probabilistic algorithm, I think there should be something more suitable.

二階堂真紅 · Answer 11 · Thu Feb 16 2017 10:27:41 GMT+0800 (China Standard Time)

@madeye
First of all, I don't think anyone has performed replay attacks because the same request will lead to unexpected result such as submitting a form twice. But if there was any evidence, we would better defend it. Also implementing this feature is a way to acknowledge us whether the attack is happening.

Bloom filter is the best structure if we design it carefully. We can design it with a low false positive rate which can hardly cause a false positive in practice. And if we introduce timestamps and only allow requests of +/- 1min, then we only need to clear and switch to another filter every 2mins. In that way we record the requests in the last 2-4mins.

@riobard Don't just disconnect but we must keep the behavior same to handling authentication errors. Also I am not too concerned about false positives, because I think the rate of network errors is much higher.

I used to implement the protocol with some kind of timing check and, I required the connection to finish the request header in 30 seconds or I will drop the connection. So when someone either replays or sends me fake data, or even connect then do nothing, the server will disconnect at the exactly 30s. Although this can be a character, this is the only behavior for any kinds of active detection and I think the attacker cannot determine the sort of service with only the timeout.

@Mygod There is a formula to calculate the false positive rate. You can find it on Wikipedia.

Rio · Answer 12 · Thu Feb 16 2017 17:45:56 GMT+0800 (China Standard Time)

The math is pretty simple. Classic Bloom filter requires 1.44*log2(1/e) bits per item for storage where e is the false positive rate. Assuming we are aiming for e=1/10^6 (which means very rare), and 1 million salt/nonce to track (which is a LOT even for a very busy server), the Bloom filter needs less than 4MB of memory (which is doable even on low-end routers).

Max Lv · Answer 13 · Fri Feb 17 2017 10:10:43 GMT+0800 (China Standard Time)

Implemented a Ping-Pong bloom filter in the branch https://github.com/shadowsocks/shadowsocks-libev/tree/bloomfilter.

Current parameters are: 1000000 entries and 0.00001 error rate. The additional memory usage is 2.85 MBytes.

I'm running tests in a production environment. Let's see if this error rate (0.00001) works.

Max Lv · Answer 14 · Fri Feb 17 2017 10:14:11 GMT+0800 (China Standard Time)

Since it's a Ping-Pong bloom filter, it forgets half of history (500,000 entries) after each resetting.

Mygod · Answer 15 · Fri Feb 17 2017 10:37:20 GMT+0800 (China Standard Time)

@madeye 0.00001 error rate is too high. Consider the aftermath of a false positive: legitimate users will be blocked by the server. I think that's pretty catastrophic. I prefer having the error rate comparable to non-recoverable error rate of a hard disk, i.e. 1e-15 to 1e-18.

Max Lv · Answer 16 · Fri Feb 17 2017 10:50:48 GMT+0800 (China Standard Time)

@Mygod 1e-15 is really a small number, 8 MBytes is required. I'm wondering is there a non-recoverable error rate of a internet connection?

@wongsyrone The endian issue here shouldn't be a problem as we never try to pass this hash between machines.

Max Lv · Answer 17 · Fri Feb 17 2017 11:03:02 GMT+0800 (China Standard Time)

@wongsyrone AFAIK, all the modern CPUs (x86, ARM, MIPS) support misaligned accesses for scalar instructions. You may have alignment fault on ARM for vector instruction, but it's not our case here.

Max Lv · Answer 18 · Fri Feb 17 2017 11:13:22 GMT+0800 (China Standard Time)

@wongsyrone Continuous 8 replay errors would cause a IP blocked. Maybe it's already large enough?

Rio · Answer 19 · Fri Feb 17 2017 12:06:20 GMT+0800 (China Standard Time)

The purpose of the Bloom filter is to detect salt/nonce reuse. As for the action to take after replay is detected, we should leave it up to server operators. Would it be better if we take an argument to a shell script, which is invoked each time a replay is detected along with necessary information like IP addresses? Server operators can write their own policy in such shell script to either null route the connection or use tools like fail2ban.

Max Lv · Answer 20 · Fri Feb 17 2017 12:50:15 GMT+0800 (China Standard Time)

@riobard Yes, the behavior after a replay detected should be not part of this proposal.

@wongsyrone I think it's safe to block scanners.

Mygod · Answer 21 · Fri Feb 17 2017 13:15:19 GMT+0800 (China Standard Time)

Hmm I think things are getting a little bit too complicated here (again). Mind you that bloom filter doesn't solve replay attack completely. It's just a slightly better (or not) approach than simply storing IVs.

If one is running Shadowsocks on 80/443, he/she might as well use plugins like simple-obfs. I think plugins can decide what action should be taken when a malicious request is received.

Max Lv · Answer 22 · Fri Feb 17 2017 13:53:25 GMT+0800 (China Standard Time)

Yes, please go ahead with replay attack related topics here. The bloom filter is just an implementation enhancement, which is not part of shadowsocks protocol.

Rio · Answer 23 · Fri Feb 17 2017 15:28:36 GMT+0800 (China Standard Time)

@wongsyrone It won't. The salt/nonce is only tested against the Bloom filter after successful decryption and authentication. Non-Shadowsocks traffic will not trigger the defensive mechanism against replay attack.

二階堂真紅 · Answer 24 · Fri Feb 17 2017 15:34:50 GMT+0800 (China Standard Time)

Yep those requests have nothing to do with our protocol and are simply dropped.

@Mygod And what will you do if the banned IP performs more requests? This can be a character.

Rio · Answer 25 · Fri Feb 17 2017 15:40:45 GMT+0800 (China Standard Time)

@wongsyrone I think you misunderstood the intention of the Bloom filter.

二階堂真紅 · Answer 26 · Fri Feb 17 2017 15:44:57 GMT+0800 (China Standard Time)

@wongsyrone No, it should be checked after authentication of the request header. Authentication must be done in the very first, which is the common practice. It makes no sense that someone is attacking with spam data which cannot pass the authentication.

Rio · Answer 27 · Fri Feb 17 2017 15:49:47 GMT+0800 (China Standard Time)

@wongsyrone Please educate.

Rio · Answer 28 · Fri Feb 17 2017 16:36:50 GMT+0800 (China Standard Time)

I'm afraid you missed an important keyword in the definition. In a replay attack, the adversary sends a valid data stream. If decryption/authencation failed, the data stream is invalid, and it's not the concern of this proposal.

And you still misunderstood the intention of the Bloom filter. It solves exactly the problem of checking large amount of used salt while using little space. There's no need for a supercomputer.

Max Lv · Answer 29 · Mon Feb 20 2017 09:06:25 GMT+0800 (China Standard Time)

Here are some updates for the new ping-pong bloom filter:

The test parameters are: 1,000,000 entries, 1e-5 error rate.
This test is performed on a server shared by a small startup (10~20 people) with chacha20-ietf-poly1305 cipher.
In the three days test, no replay attack is observed according to the server log.
No abnormal connection reset is observed locally.
To verify that the bloom filter really works, I constructed several valid relay requests and all of them got identified as expected.

In summary, the bloom filter works much better than I thought. I think at least the short-term replay attack won't be a problem for us now.

Rio · Answer 30 · Mon Feb 20 2017 09:28:21 GMT+0800 (China Standard Time)

@madeye Awesome!

There's one thing I'm curious about: have you measured the rate of connection establishment in such setting? It's a measure of how busy the server is. In other words, how often do we switch between the ping and pong filters? This is important because every time we switch, we lose half of the history. I'm interested in the time window of effective defense.

Additionally, we should probably also use the filter to protect UDP packets.

Max Lv · Answer 31 · Mon Feb 20 2017 10:15:02 GMT+0800 (China Standard Time)

@riobard Nope, I didn't measure the connection rate there. But actually I have not observed any bloom filter resetting until now. So, we may assume the connection establishing rate is smaller than 1,000,000 / 3600 / 24 / 3 = 3.8 conn per second.

Yes, we should also add this to UDP.

Rio · Answer 32 · Mon Feb 20 2017 10:32:05 GMT+0800 (China Standard Time)

@madeye 3-day protection for a dozen or so users is more than sufficient. Probing techniques based on nonce/IV reuse should be infeasible now.

Mygod · Answer 33 · Mon Feb 20 2017 14:46:38 GMT+0800 (China Standard Time)

Murphy's law:

Whatever can happen will happen.

Are you sure 1e-5 is small enough?

Max Lv · Answer 34 · Mon Feb 20 2017 15:13:28 GMT+0800 (China Standard Time)

Maybe we can consider the error rate here following the concept of SLA.

Both Google and Amazon ensure 99.95% uptime for their cloud service. As almost all of shadowsocks services are installed on this kind of servers, actually we can achieve no better than 99.95% SLA? In other words, either 1e-5 or 1e-15 error rate cannot make a difference for the SLA?

Mygod · Answer 35 · Mon Feb 20 2017 15:25:27 GMT+0800 (China Standard Time)

@madeye Hmm... I think 0.05% downtime usually consists of service maintenance that lasts for a while. That's different from occasional packet drop (I suppose this is the current behavior) or other behavior.

Max Lv · Answer 36 · Mon Feb 20 2017 15:34:40 GMT+0800 (China Standard Time)

Hmm, let's keep issue open and do more tests. The good news is 8 Mbytes memory usage of a 1e-15 error rate bloom filter is negligible for a modern linux box ($5, 512MB).

Rio · Answer 37 · Mon Feb 20 2017 15:46:30 GMT+0800 (China Standard Time)

The number of entries to remember and the false positive rate should be parameters that server operator choose according to their environment and resource constraint. We do not need to make the hard decision here.

Max Lv · Answer 38 · Mon Feb 20 2017 16:53:50 GMT+0800 (China Standard Time)

@riobard Good idea. So let's just suggest 1e+6 entries and 1e-6 error rate here.

Rio · Answer 39 · Mon Feb 20 2017 17:13:40 GMT+0800 (China Standard Time)

Looks good to me. If the number of entries is 0, we should disable the filter. People operating on low memory environment should have the freedom to operate without the additional protection.

二階堂真紅 · Answer 40 · Tue Feb 21 2017 22:43:15 GMT+0800 (China Standard Time)

Glad to see the implementation of bloom filter.
Hope this can provide evidence of whether replay attacks are happening now.

幻影显形 · Answer 41 · Sun Feb 26 2017 09:27:00 GMT+0800 (China Standard Time)

@shinku721
“V2Ray 2.19上线一周以来（含内测时间），官方服务器总共阻挡来自163个IP的重放攻击共165次，覆盖三大运营商，IP均来源于其骨干网络。行为特征是一个IP仅出现一次且用于攻击。升级了2.19的同学可以在access.log中搜索"Duplicated"来分析自己的情况。”
https://mobile.twitter.com/projectv2ray/status/833959357423448064

hellofwy · Answer 42 · Sun Feb 26 2017 10:09:11 GMT+0800 (China Standard Time)

This is the commit of V2Ray which added the anti replay function:

v2ray/v2ray-core@3e10f3a

shixl · Answer 43 · Mon Feb 27 2017 22:30:09 GMT+0800 (China Standard Time)

使用nonce作为序号，检查是否重放。当nonce回绕时，重新发送salt启用新的session subkey可以么？

Rio · Answer 44 · Mon Feb 27 2017 23:52:50 GMT+0800 (China Standard Time)

@xlshiz Why do you want to do that?

shixl · Answer 45 · Tue Feb 28 2017 21:23:18 GMT+0800 (China Standard Time)

@riobard 不好意思，我以为要缓存nonce（数据包里边的nonce）检查重用，看了一下代码 ( https://github.com/shadowsocks/shadowsocks-libev/tree/12c4344a420b230be8c5c286176081d4628c0eed ) ，现在的实现是缓存salt，这样可以防止抓取一条链接完整数据包重放给新建的链接。对于重放给原有链接需要检查么？目前没有nonce回绕检查。

Rio · Answer 46 · Tue Feb 28 2017 22:15:50 GMT+0800 (China Standard Time)

64-bit nonce wraparound is not gonna happen in practice.