-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why Pairwise alignment (--allpairs_global) only support positive strand? #576
Comments
Perhaps the If all the sequences are fairly similar, I don't think you'll need many sequences in the database for the |
Thank you. I will try |
We'll consider adding |
hello @billzt here is a possible workaround, using a tiny dataset for demonstration purpose:
FASTA_FILE=$(mktemp)
printf ">s1\nAAAA\n>s2\nAAAT\n>s3\nTTTT\n" > "${FASTA_FILE}"
(
cat "${FASTA_FILE}"
vsearch \
--fastx_revcomp "${FASTA_FILE}" \
--quiet \
--label_suffix "_rv" \
--fastaout -
) | \
vsearch \
--allpairs_global - \
--id 0.75 \
--iddef 1 \
--quiet \
--blast6out -
rm "${FASTA_FILE}" We obtain the expected results, equivalent to a search on both strands:
Warning: doubling the size of a dataset quadruples computation time. |
I have a set of sequences that are definitely homologous. However some of them are in negative strand and I don't know which of them are. Therefore I cannot use
--allpairs_global
directly since sequences are aligned on their plus strand only. Any other good suggestions?The text was updated successfully, but these errors were encountered: