git.hungrycats.org Git - linux/commit

author	Eric Dumazet <edumazet@google.com>
	Tue, 23 Apr 2024 12:56:20 +0000 (12:56 +0000)
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>
	Thu, 30 May 2024 07:49:16 +0000 (09:49 +0200)
commit	9b91e7cfb83a974c792f9e62da6704b133c71672
tree	e66c23af6375e4828698b0a96332896c41f11823	tree \| snapshot
parent	e1a636b29b6859a917b7b34444a76861a6b3af7b	commit \| diff

tcp: avoid premature drops in tcp_add_backlog()

[ Upstream commit ec00ed472bdb7d0af840da68c8c11bff9f4d9caa ]

While testing TCP performance with latest trees,
I saw suspect SOCKET_BACKLOG drops.

tcp_add_backlog() computes its limit with :

    limit = (u32)READ_ONCE(sk->sk_rcvbuf) +
            (u32)(READ_ONCE(sk->sk_sndbuf) >> 1);
    limit += 64 * 1024;

This does not take into account that sk->sk_backlog.len
is reset only at the very end of __release_sock().

Both sk->sk_backlog.len and sk->sk_rmem_alloc could reach
sk_rcvbuf in normal conditions.

We should double sk->sk_rcvbuf contribution in the formula
to absorb bubbles in the backlog, which happen more often
for very fast flows.

This change maintains decent protection against abuses.

Fixes: c377411f2494 ("net: sk_add_backlog() take rmem_alloc into account")
Signed-off-by: Eric Dumazet <edumazet@google.com>
Link: https://lore.kernel.org/r/20240423125620.3309458-1-edumazet@google.com
Signed-off-by: Jakub Kicinski <kuba@kernel.org>
Signed-off-by: Sasha Levin <sashal@kernel.org>