sdm

Simple demultiplexing This project started as a simple sequence demultiplexer, because I needed a fast and memory efficient program for this. Since then (5 years ago) it has grown in scope and complexity and is by now part of most of my pipelines, as the initial filtering step, that is able to handle a variety of standard sequence inputs (fasta / fastq all versions / gz of these), detecting inputs and even handling corrupt files (till the point where they are corrupted). Thus this tool is a basic bioinformatic program, written in C++11 for fast and efficient functionality. It's main purposes are:

parsing between DNA sequence formats (fasta / fastq)
demultiplexing multi sample sequence files (based on DNA sequence or header keywords)
removing sequencing / PCR primer from reads
filtering DNA sequences based on a multitude of quality parameters
fast extraction of a subset of DNA sequences, based on header names (also works for paired end reads)
combining several inputs into one (filtered) sequence file

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
include		include
.gitignore		.gitignore
Benchmark.h		Benchmark.h
DNAconsts.cpp		DNAconsts.cpp
DNAconsts.h		DNAconsts.h
Demultipl.cpp		Demultipl.cpp
FastxReader.cpp		FastxReader.cpp
FastxReader.h		FastxReader.h
IO.cpp		IO.cpp
IO.h		IO.h
IOMultithreaded.h		IOMultithreaded.h
InputStream.cpp		InputStream.cpp
InputStream.h		InputStream.h
LICENSE		LICENSE
LICENSE_gzstream		LICENSE_gzstream
LICENSE_zstr		LICENSE_zstr
Makefile		Makefile
MergeReads.h		MergeReads.h
README.md		README.md
ReadMerger.cpp		ReadMerger.cpp
ReadMerger.h		ReadMerger.h
Statistics.cpp		Statistics.cpp
Statistics.h		Statistics.h
StatisticsMultithreaded.h		StatisticsMultithreaded.h
ThreadPool.h		ThreadPool.h
ThreadPool_JF.h		ThreadPool_JF.h
containers.cpp		containers.cpp
containers.h		containers.h
gzipstream.cpp		gzipstream.cpp
gzipstream.h		gzipstream.h
gzstream.h		gzstream.h
khash.hh		khash.hh
strict_fstream.hpp		strict_fstream.hpp
targetver.h		targetver.h
zlib.h		zlib.h
zstr.cpp		zstr.cpp
zstr.h		zstr.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

sdm

About

Licenses found

Releases 23

Packages

Contributors 3

Languages

License

Licenses found

hildebra/sdm

Folders and files

Latest commit

History

Repository files navigation

sdm

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 23

Packages 0

Contributors 3

Languages

Packages