robryk: "@isomer@fosstodon.org @sgf@mastodon.xyz @danderso…"

@sgf @danderson @nyquildotorg Nagle gets used to "fixup" a bunch of problems (eg silly window syndrome etc).

In general, there are two types of flow: elephants (bandwidth heavy) and mice (latency sensitive). You want nagle for the first class (keep overheads as low as possible for maximum throughput), and not for the second (keep latency low as possible).

**robryk** @robryk@qoto.org · Dec 30, 2022, 12:12

**robryk** @robryk@qoto.org · Dec 30, 2022, 12:12

Dec 30, 2022, 12:12

I would expect that elephants will buffer (and when they temporarily become mice they will either reshuffle things so that the bufferedwriter is out of the picture or they will simply keep flushing the writer at appropriate times). If that's the case then by disabling Nagle's algorithm we're wasting at most one packet each time the buffer is emptied (pessimistically we will emit one one-byte packet then). So, Nagle should be superfluous if we buffer with a buffer that's much larger than a packet or that's chosen to be a multiple of a packet's size. Am I missing some reason Nagle is useful?

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 13:16

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 13:16

Dec 30, 2022, 13:16

@robryk @sgf @danderson @nyquildotorg that's all true, but you can't always keep your buffer full. Eg when reading data from disk. Esp for long fast networks.

Connections can often flip between mice and elephants. It's common to say "do you want this data?" then wait for a reply, then send the entire data. The first part is latency sensitive, the second part is bandwidth heavy.

**robryk** @robryk@qoto.org · Dec 30, 2022, 13:18

**robryk** @robryk@qoto.org · Dec 30, 2022, 13:18

Dec 30, 2022, 13:18

Simon Frankau @sgf@mastodon.xyz

If the buffer is not full, the buffered writer will not write until it gets full. People who deal with buffered writers are used to flushing them at appropriate times (and there are all those funny affordances like stdio's "flush out when someone's reading in").

**Simon Frankau** @sgf@mastodon.xyz · Dec 30, 2022, 13:59

**Simon Frankau** @sgf@mastodon.xyz · Dec 30, 2022, 13:59

Dec 30, 2022, 13:59

@robryk @isomer @danderson @nyquildotorg I've now got the dev party of my brain going "You could argue that Nagle is just a defense against badly-written programs that can't buffer properly", and the SRE part of my brain going "Yes! And we need defenses against badly written programs!".

Can we rename TCP_NODELAY to TCP_TRUSTME?

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 14:33

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 14:33

Dec 30, 2022, 14:33

@sgf @robryk @danderson @nyquildotorg if the network is slow then the end users (like the upstream author) eventually notice and fix the problem.

If the network is inefficient, then generally the people in the middle of the network notice and cannot even figure out what is creating inefficient data (it should be encrypted right?) to complain.

**robryk** @robryk@qoto.org · 2022-12-30T14:37:48Z

I loathe doing something worse, so that we move the pain to a better place. I don't know whether it's reasonable to loathe that.

That said, if we want to use pain to cause things to be fixed and don't care much about debuggability of that pain, shouldn't we rather only use Nagle (a) when we notice many small packets with no reads inbetween on a socket without TCP_NDELAY equivalent set (b) warn about that happening in e.g. kernel logs (rate limited)?

Dec 30, 2022, 14:37 · · · ·

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 14:52

**Perry Lorier** @isomer@fosstodon.org · Dec 30, 2022, 14:52

Dec 30, 2022, 14:52

@robryk @sgf @danderson @nyquildotorg with poll() style event loops, there are always reads in between.

Logging to syslog isn't a particularly useful way to get people to notice problems.

**robryk** @robryk@qoto.org · Dec 30, 2022, 15:00

**robryk** @robryk@qoto.org · Dec 30, 2022, 15:00

Dec 30, 2022, 15:00