Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DNADIFF is killed by out of memory signal on SLURM #141

Closed
GallVp opened this issue Sep 24, 2024 · 1 comment · Fixed by #158
Closed

DNADIFF is killed by out of memory signal on SLURM #141

GallVp opened this issue Sep 24, 2024 · 1 comment · Fixed by #158
Assignees
Labels
bug Something isn't working fatal Crashes the pipeline
Milestone

Comments

@GallVp
Copy link
Member

GallVp commented Sep 24, 2024

Description of the bug

DNADIFF is killed by out of memory signal on SLURM

Command used and terminal output

No response

Relevant files

No response

System information

plant-food-research-open/assemblyqc Version 2.1.1

@GallVp GallVp added the bug Something isn't working label Sep 24, 2024
@GallVp GallVp self-assigned this Sep 24, 2024
@GallVp GallVp added this to the 2.2 milestone Sep 24, 2024
@GallVp GallVp added the fatal Crashes the pipeline label Sep 24, 2024
@GallVp
Copy link
Member Author

GallVp commented Oct 14, 2024

The OOM kill does not always result in exit code 12 or 137. It might be process or context specific. For example, the DNADIFF process from Plant-Food-Research-Open/assemblyqc/2.1.1 has an OOM exit code of 2. Whereas, MUMMER's OOM exit code is 137.
The pipeline uses the nf-core template's default retry on error strategy which covers error code 137 but does not cover error code 2. Error code 2 is for No such file or directory so perhaps it is not a good idea to retry on 2. For now, I am going to bump up the memory requirement for DNADIFF from 6.GB to 12.GB.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working fatal Crashes the pipeline
Projects
None yet
1 participant