Double the speed of zfec #114

itamarst · 2024-11-15T16:01:32Z

Fixes #102

Twice as fast.
No longer supports 3.8 (possibly not actually necessary, I was thinking there was manylinux issues but could be wrong. In practice 3.8 is EOL anyway for security updates, so meh).
Won't install on Python 3.9.0-3.9.5 unless they update pip in a virtualenv... in which case pip will just install the previous release, so not actually a problem for users.
No longer supports Ubuntu 18.04 (but it's EOL too).

meejah

🎉

hacklschorsch · 2024-12-04T18:41:31Z

I have machines older than this ; what's the best way to run tahoe-lafs on them?

meejah · 2024-12-04T19:10:25Z

Can you be more specific about the machines in question?

(The generic answer would be "use a ZFEC library before version 1.6.0.0").

hacklschorsch · 2024-12-05T08:50:54Z

My home server is an Intel Atom second generation or so, it is x86-64-v1. This also broke two Tahoe-LAFS CI runners for Redhat-Based distros - @meejah says because their CPUs are too old - are you sure about your 2008/2009 estimation? It would be great if there was a documented way of running zfec on older hardware. What are the steps to build this myself without this optimization? Can I do it without editing the source?

hacklschorsch · 2024-12-05T09:26:14Z

I guess something like hwcaps checking is too much hassle? https://www.theregister.com/2022/12/16/tumbleweed_reverses_x864v2_plan/

The hwcaps feature in glibc allows detection and manipulations of the hardware capabilities of chips in various CPU families

hacklschorsch · 2024-12-05T09:33:58Z

https://pypi.org/project/mwa-hyperbeam/ say they offer multiple wheels targeting different microarchitecture levels:

What are these different x86-64 versions?

They are microarchitecture levels. By default, Rust compiles for all x86-64 CPUs; this allows maximum compatibility, but potentially limits the runtime performance because many modern CPU features can't be used. Compiling at different levels allows the code to be optimised for different classes of CPUs so users can get something that works best for them.

Looking at the download files for their latest version I don't see how they do it though

itamarst · 2024-12-05T16:42:05Z

There are multiple ways to handle different CPU microarchitectures:

Use oldest possible CPU target; this is the default.
Use a commonly-used CPU target, what we tried to do in this PR, which I guess is a problem.
Runtime dispatch inside the function, which is supported by C/C++/Rust and probably other languages, where you (a) compile multiple versions of a function and (b) choose one at runtime. I imagine this is annoying in C.
Import time! This is a fun Python-specific one, where you ship multiple copies of the extension compiled against different targets, and then at import time check in Python what CPU you have, and then import the appropriate one for the current CPU. I have prototyped this.

For zfec... maybe easier just to revert this change for now?

hacklschorsch · 2024-12-08T08:14:12Z

I am all for execution efficiancy and would be sad to see this reverted. "Import time" seems like a great solution to this - can I help with realizing it? Maybe with code review and/or testing?

pythonspeed added 5 commits November 15, 2024 10:56

Run faster!

e043f27

Drop 3.8, it's end of life

8397393

Mention gcc requirement

6c6e61c

Modern manylinux

27f43c4

Drop 3.8, update to released 3.13

2ccec24

itamarst had a problem deploying to release November 15, 2024 16:13 — with GitHub Actions Failure

meejah approved these changes Nov 15, 2024

View reviewed changes

meejah merged commit dedbae7 into tahoe-lafs:master Nov 15, 2024
46 of 47 checks passed

itamarst deleted the 102-speed-up branch November 15, 2024 17:22

itamarst mentioned this pull request Dec 5, 2024

Switch to newer x86-64 microarchitecture by default scientific-python/faster-scientific-python-ideas#11

Open

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Double the speed of zfec #114

Double the speed of zfec #114

itamarst commented Nov 15, 2024 •

edited

Loading

meejah left a comment

hacklschorsch commented Dec 4, 2024 •

edited

Loading

meejah commented Dec 4, 2024

hacklschorsch commented Dec 5, 2024 via email

hacklschorsch commented Dec 5, 2024 •

edited

Loading

hacklschorsch commented Dec 5, 2024

What are these different x86-64 versions?

itamarst commented Dec 5, 2024

hacklschorsch commented Dec 8, 2024 via email

Double the speed of zfec #114

Double the speed of zfec #114

Conversation

itamarst commented Nov 15, 2024 • edited Loading

meejah left a comment

Choose a reason for hiding this comment

hacklschorsch commented Dec 4, 2024 • edited Loading

meejah commented Dec 4, 2024

hacklschorsch commented Dec 5, 2024 via email

hacklschorsch commented Dec 5, 2024 • edited Loading

hacklschorsch commented Dec 5, 2024

What are these different x86-64 versions?

itamarst commented Dec 5, 2024

hacklschorsch commented Dec 8, 2024 via email

itamarst commented Nov 15, 2024 •

edited

Loading

hacklschorsch commented Dec 4, 2024 •

edited

Loading

hacklschorsch commented Dec 5, 2024 •

edited

Loading