You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running Souporcell using the v3.0.0 of the Demuxafy pipeline and the snATAC-seq, I obtained results that differ significantly from those generated by using Souporcell as a standalone tool. This seems strange to me, cause I assumed that Demuxafy serves as a wrapper for various demultiplexing tools, so I expected the results to match. Upon inspecting the output files, I noticed that the following seven files were not generated when using the Demuxafy pipeline:
fastqs.done
minimap.err
remapping.done
retag.err
retagging.done
souporcell_minimap_tagged_sorted.bam
souporcell_minimap_tagged_sorted.bam.bai
Despite this, no errors were reported, and the pipeline appeared to complete all processes successfully. However, the droplet assignment results were drastically different between the two approaches.
For reference, here is the code using the Demuxafy pipeline:
And this is to run Souporcell without Demuxafy (except for the code to summarize the results, for that I used ‘souporcell_summary.sh’ script provided by Demuxafy):
With these codes, I got the following results when I demultiplexed three samples:
As you can tell, I used exactly the same code. The input files, parameters, and Souporcell version (v2.5) were also identical in both cases. The attached code examples do not use reference SNP genotypes, but the behavior was consistent when using reference SNP genotypes as well.
Questions:
What might be the reason for the difference in results between the two approaches? Are there undocumented preprocessing steps or any modifications to default options within the Demuxafy pipeline that could affect demultiplexing outputs? I also reviewed the publication but could not find relevant details about such discrepancies. Could similar differences arise when using other demultiplexing tools within Demuxafy?
When using the Demuxafy pipeline, I also found that Souporcell fails to run without providing SNP genotype information (i.e., you must use either --common_variants or --known_genotypes option). Is it intended behavior that Demuxafy requires SNP genotype information for Souporcell while the standalone tool does not?
Thank you in advance!
The text was updated successfully, but these errors were encountered:
I apologise for my delayed response. I'm pretty sure this is an issue with my wrapping script which is meant to help check for the same chr encoding for vcfs and bams but it doesn't appear to be parsing the arguments correctly. I'll update this in the new year but in the meantime, you can run the typical souporcell_pipeline.py from Demuxafy as well to double check that you get the same results. If you do run that, I would be interested to hear if you get the same results as running directly from the souporcell image.
I apologise for my delayed response. I'm pretty sure this is an issue with my wrapping script which is meant to help check for the same chr encoding for vcfs and bams but it doesn't appear to be parsing the arguments correctly. I'll update this in the new year but in the meantime, you can run the typical souporcell_pipeline.py from Demuxafy as well to double check that you get the same results. If you do run that, I would be interested to hear if you get the same results as running directly from the souporcell image.
Thanks for the reply.
I ran the souporcell_pipeline.py from Demuxafy using the same inputs, and indeed I've got the same results as running directly from the souporcell image (without Demuxafy), and also could run without providing SNP genotype information.
Could you also check other demultiplexing tools within Demuxafy to see if they might have similar problems when you update the script? Thank you so much for your time and effort.
When running Souporcell using the v3.0.0 of the Demuxafy pipeline and the snATAC-seq, I obtained results that differ significantly from those generated by using Souporcell as a standalone tool. This seems strange to me, cause I assumed that Demuxafy serves as a wrapper for various demultiplexing tools, so I expected the results to match. Upon inspecting the output files, I noticed that the following seven files were not generated when using the Demuxafy pipeline:
Despite this, no errors were reported, and the pipeline appeared to complete all processes successfully. However, the droplet assignment results were drastically different between the two approaches.
For reference, here is the code using the Demuxafy pipeline:
And this is to run Souporcell without Demuxafy (except for the code to summarize the results, for that I used ‘souporcell_summary.sh’ script provided by Demuxafy):
With these codes, I got the following results when I demultiplexed three samples:
As you can tell, I used exactly the same code. The input files, parameters, and Souporcell version (v2.5) were also identical in both cases. The attached code examples do not use reference SNP genotypes, but the behavior was consistent when using reference SNP genotypes as well.
Questions:
What might be the reason for the difference in results between the two approaches? Are there undocumented preprocessing steps or any modifications to default options within the Demuxafy pipeline that could affect demultiplexing outputs? I also reviewed the publication but could not find relevant details about such discrepancies. Could similar differences arise when using other demultiplexing tools within Demuxafy?
When using the Demuxafy pipeline, I also found that Souporcell fails to run without providing SNP genotype information (i.e., you must use either --common_variants or --known_genotypes option). Is it intended behavior that Demuxafy requires SNP genotype information for Souporcell while the standalone tool does not?
Thank you in advance!
The text was updated successfully, but these errors were encountered: