Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expected Run Time for RVS function multipleVariantPValue #5

Open
klmartinez opened this issue Jul 9, 2019 · 3 comments
Open

Expected Run Time for RVS function multipleVariantPValue #5

klmartinez opened this issue Jul 9, 2019 · 3 comments

Comments

@klmartinez
Copy link

When I the RVS function multipleVariantPValue on 2 exomes, it would take approximately 30 minutes. However, when I try to run the same function on 5 genome families, its been running for at least 10 days.

What is an expected run time for 5 genome families with this function?

I am trying to run this on my University's HPC but we have maximum wall times of 10 days. When the job has been terminated there are never any errors, just saying that it needed more time. I just want to make sure that this length of time is somewhat expected and doesn't instead point to another issue.

Thank you!

@sherman5
Copy link
Owner

sherman5 commented Jul 9, 2019

That's not expected at all, the running time should be on the scale of minutes not days. What is the size of the SnpMatrix that you are passing to the function?

@klmartinez
Copy link
Author

The dimensions of my SnpMatrix is [13, 3496630].

@sherman5
Copy link
Owner

We haven't previously tested RVS on data that big, so we were unaware of this bottleneck - I'm currently working on a solution (see PR #6)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants