-
Notifications
You must be signed in to change notification settings - Fork 10
Fasta dereplication
Yoann Dufresne edited this page Nov 29, 2017
·
1 revision
This module use the dereplication module of the Vsearch tool to merge similar sequences in a FASTA file. The dereplicated sequences are outputted in a FASTA file with the quantity of each read in their header.
- FASTA file: The FASTA containing all the sequences to dereplicate.
- Dereplicate FASTA file: The file containing all the merged sequences. For each sequence the header contain the the size annotation as ;size=234;.
- Vsearch github: https://github.com/torognes/vsearch
- Original publication: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3150044/