This repository focuses on UG sub-lineage mutations and drug resistance mutation discovery for Mycobacterium tuberculosis (MTB). It leverages tools like FastQC, MultiQC, and Snippy for rapid bacterial SNP genotype and drug resistance calling (Version 1.0.0).
Gerald Mboowa
Snippy identifies SNPs between a haploid reference genome and HTS datasets. It detects both substitutions (SNPs) and insertions/deletions (indels). Snippy is designed for speed and efficiency, supporting multi-threading on a single machine. It produces consistent output files in a single folder. (Tested with Snippy 4.3.2 - Torsten Seemann's Snippy).
-
FastQC
Provides quality control checks on raw sequence data from high-throughput sequencing pipelines. -
MultiQC
Aggregates analysis logs into a single HTML report, summarizing outputs from multiple tools like FastQC. -
Snippy
Processes samples for SNP discovery and annotation. -
MTB Genotypes and Drug Resistance Annotations
Identifies MTB sub-lineages and drug resistance mutations, including MDR and XDR strains. -
Redundant File Cleanup
Deletes unnecessary files generated during the pipeline execution.
The pipeline generates the following output files:
- Text files detailing MTB genotypes
- Text files summarizing drug resistance mutations for both MDR and XDR strains
For suggestions or to report bugs, please use the Issue Tracker.
- FastQC
- MultiQC
- Perl (>= 5.12)
- Modules:
Time::Piece
(core with modern Perl),Bioperl
(>= 1.6)
- Modules:
- BWA MEM (>= 0.7.12)
- Minimap2 (>= 2.0)
- Samtools (>= 1.7)
- Bedtools (>= 2.0)
- BCFtools (>= 1.7)
- GNU Parallel (>= 2013xxxx)
- FreeBayes (>= 1.1)
- Includes:
freebayes
,freebayes-parallel
,fasta_generate_regions.py
- Includes:
- VCFtools (vcflib) (>= 1.0)
- Includes:
vcfstreamsort
,vcfuniq
,vcffirstheader
- Includes:
- VT (>= 0.5)
- ReadSeq (>= 2.0)
- SnpEff (>= 4.3)
- Samclip (>= 0.2)
- Seqtk (>= 1.2)
- Snp-Sites (>= 2.0)
- Wgsim (>= 1.8) (for testing only)
This pipeline has been tested on macOS Mojave using precompiled binaries.
Ensure the listed dependencies are properly installed and configured before running the pipeline. Refer to the respective tool documentation for detailed installation and usage guidelines.
For further inquiries or collaborations, please contact the author.