Gzip response bodies #221

david-crespo · 2021-12-20T17:06:08Z

Substantially reduce response sizes by gzipping response body and adding content-encoding: gzip to headers. Probably simplest to make this configurable server-wide. This issue is inspired by me imagining a multi-MB response containing serial console contents.

Potential extra features

Configurable minimum size below which we don't bother compressing (see below, a few KB seems like a reasonable threshold)
Respect accept-encoding: identity request header by not compressing even if compression feature is turned on server-wide

The text was updated successfully, but these errors were encountered:

ahl · 2021-12-20T17:35:34Z

This is an interesting idea. In what situations would we expect the additional latency introduced by compress/decompress to be less than the transmission latency saved? I could imagine this would be most valuable on high-latency and/or low-bandwidth connections.

Would you expect dropshot to ignore this for pre-compressed data such as jpg, png or pre-compressed js/css objects?

david-crespo · 2021-12-20T19:11:56Z

Good point about images, it looks like the recommendation is not to gzip images because it doesn't make them smaller, and could in fact make them bigger. Same is true for pre-compressed (not only minified) static text assets. In that case we would need the server to know whether the asset being served is already compressed and pass through compressed files unmodified. Putting right content-encoding header on the response would require also knowing the compression algorithm used (browser supported options here).

I see people discussing thresholds for whether compression is worth it primarily in terms of response size. I thought of this issue because of potentially large responses like the serial console. Even on a fast connection, I think you're usually going to see a latency benefit from fast server-side compression. Bandwidth savings aside, off the top of my head I'd guess your download speed would have to be consistently faster than compression throughput to come out worse off in terms of latency. I see Google using gzip in GCP with dynamic JSON responses as small as 2 KB, though they may be concerned about bandwidth too. In short, the conventional wisdom seems to be that the latency trade pays off nearly all the time, or at least, even if your very fastest clients are slightly hurt by it, that's far outweighed by the gains for slower clients.

This (old) article argues that it doesn't make sense to compress things smaller than a TCP packet, presumably because you're still waiting for the entire packet? (Out of my wheelhouse.) From what I can tell, people seem to like minimum size thresholds of around a few KB. It probably doesn't matter that much unless we expect to have a lot of tiny responses.

Some useful links I found while looking around:

https://stackoverflow.com/a/32454901/604986
https://stackoverflow.com/a/32454901/604986
https://webmasters.stackexchange.com/questions/31750/what-is-recommended-minimum-object-size-for-gzip-performance-benefits
https://developers.google.com/web/fundamentals/performance/optimizing-content-efficiency/optimize-encoding-and-transfer#text_compression_with_gzip

david-crespo · 2021-12-20T19:17:29Z

Found an example from Tower of a nice way to handle precompressed assets by putting the compressed and uncompressed ones side by side.

a client with an Accept-Encoding header that allows the gzip encoding will receive the file dir/foo.txt.gz instead of dir/foo.txt. If the precompressed file is not available, or the client doesn’t support it, the uncompressed version will be served instead.

https://docs.rs/tower-http/latest/tower_http/services/fs/struct.ServeDir.html#method.precompressed_gzip

Presumably for images you would have to use some kind is_image file extension matcher. One easy way would be to use the mime type produced by mime_guess and check if starts with image/. Another way would be to flip it and use an allowlist for compressible extensions, so, e.g., we might only compress JSON and HTML responses and .json, .js, and .css files.

david-crespo · 2024-03-05T18:18:49Z

While thinking about oxidecomputer/console#2029, it occurred to me gzip should be very effective at reducing the size of big lists due to all the repetition of keys. And boy is it. This is a real response from dogfood containing 127 disks.

 55k disks.json
8.8k disks.json.gz

A counterpoint here is that 55k is already so small that there is no point in compressing. However, if we want to be able to scale nicely to fetching 1000 things, we're talking 433k vs. 69k (assuming linear scaling of gzip size savings, which might be conservative), so it gets more plausible.

seddonm1 · 2024-11-07T20:30:51Z

I was thinking about this and with the new ServerBuilder (#1122) it would maybe allow tower-http middleware. I did a basic implementation here that works but would need something like the ServerBuilder to allow users to add configurable tower middleware.

seddonm1@d627821

davepacheco · 2024-11-08T18:27:33Z

Yeah, the middleware pattern can be a nice fit for something like gzip, but we've explicitly avoided that pattern in Dropshot for the reasons mentioned here:
https://github.com/oxidecomputer/dropshot?tab=readme-ov-file#why-is-there-no-way-to-add-an-api-handler-function-that-runs-on-every-request

For something like this I'd be tempted to just bake that functionality into Dropshot itself.

seddonm1 · 2024-11-08T20:29:12Z

No worries and I actually very much agree with your rationale but maybe there is a difference between a custom logic flow (Auth) and a generic HTTP operation like compression/CORS.

I will leave that branch sitting there for anyone who needs to solve this problem as a vendoring + a quick copy-paste solution.

benjaminleonard · 2025-02-03T18:22:38Z

Just testing with some data from the oxql query endpoint.

 148k bytes_read.json
   6k bytes_read.json.gz

Individually not too bad, and we can probably optimise by selecting larger mean_within durations, but we're going to have multiple queries per page, and they auto-refresh – I think we'd see some real benefit from the compression.

david-crespo mentioned this issue Jun 20, 2022

Serve compressed console assets oxidecomputer/console#946

Closed

david-crespo mentioned this issue Jul 1, 2022

Serve gzipped version of console asset if it exists oxidecomputer/omicron#1345

Merged

david-crespo mentioned this issue Mar 4, 2024

Everything-up-front pagination oxidecomputer/console#2029

Open

david-crespo mentioned this issue May 16, 2024

Add PEM encoded certificate to external api response oxidecomputer/omicron#5078

Merged

benjaminleonard mentioned this issue Oct 7, 2024

Prototype: Client pagination, searching sorting and filtering oxidecomputer/console#2489

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gzip response bodies #221

Gzip response bodies #221

david-crespo commented Dec 20, 2021 •

edited

Loading

ahl commented Dec 20, 2021

david-crespo commented Dec 20, 2021

david-crespo commented Dec 20, 2021 •

edited

Loading

david-crespo commented Mar 5, 2024

seddonm1 commented Nov 7, 2024

davepacheco commented Nov 8, 2024

seddonm1 commented Nov 8, 2024

benjaminleonard commented Feb 3, 2025

Gzip response bodies #221

Gzip response bodies #221

Comments

david-crespo commented Dec 20, 2021 • edited Loading

Potential extra features

ahl commented Dec 20, 2021

david-crespo commented Dec 20, 2021

david-crespo commented Dec 20, 2021 • edited Loading

david-crespo commented Mar 5, 2024

seddonm1 commented Nov 7, 2024

davepacheco commented Nov 8, 2024

seddonm1 commented Nov 8, 2024

benjaminleonard commented Feb 3, 2025

david-crespo commented Dec 20, 2021 •

edited

Loading

david-crespo commented Dec 20, 2021 •

edited

Loading