New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

-max-reads 0 #21

Open

rotheconrad opened this issue Jul 8, 2021 · 0 comments

rotheconrad commented Jul 8, 2021 •

edited

Loading

Hello! I hope you are well. simka looks like a great tool!

How are the samples normalized with the -max-reads 0 flag? I did not see a description of this in the paper.

Have you considered normalization options such as suggested here:
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003531 ?

Or transformation options suggested here:
https://www.frontiersin.org/articles/10.3389/fmicb.2017.02224/full ?

Since the default is not to normalize, is the intended workflow to subset all samples to the same number of reads prior to running simka?

Have you tested how much the size discrepancies actually affect the various distance metrics?

Thanks for the clarification.

best,
Roth

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment