Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parse GAF files in parallel #200

Merged
merged 6 commits into from
Dec 29, 2024
Merged

Parse GAF files in parallel #200

merged 6 commits into from
Dec 29, 2024

Conversation

sampsyo
Copy link
Collaborator

@sampsyo sampsyo commented Dec 29, 2024

This is truly of dubious utility, but it was sorta fun to experiment with. Presumably, a similar parallel treatment could be applied to the GFA parser someday?

Parsing the GAF in parallel scales pretty well in my experiments. On my M1 Max laptop (10 cores total: 8 performance, 2 efficiency), the parallel version goes from 5.131 seconds to 608.0 ms, so a speedup of 8.4×. Or in terms of lines per second, that's from about 9.6 million to 80.6 million. Nice!

I can't pretend this is actually all that insightful or surprising, but it might be useful.

@sampsyo sampsyo merged commit 49594a6 into main Dec 29, 2024
11 checks passed
@sampsyo sampsyo deleted the gaf-parallel branch December 29, 2024 03:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant