title | author | date | output |
---|---|---|---|
hCNVbundles project- ELIXIR UK |
Khaled Jumah, Krzysztof Poterlowicz |
11/04/2022 |
html_document |
Although a number of the CNV detectiong tools have been developed over the recent years (ref Khaled) only a few of them were intergated into the Galaxy and only couple of them are suported and functional (Khaled put tools table here).
Whole exome sequning CNV detection tools according to [reference paper]
Tool | Reference | Ural | Availabel on tool shed |
---|---|---|---|
Control-FREEC | 1 | 1 | Yes / owner |
CoNIFER | 2 | 2 | No |
XHMM | 3 | 3 | No |
ExomeCNV | 4 | 4 | No |
CONTRA | 5 | 5 | Yes / owner |
CONDEX | 6 | 6 | No |
SeqGene | 7 | 7 | No |
PropSeq | 8 | 8 | No |
VarScan2 | 9 | 9 | Yes / alternative / alternative 2 /owner1-2 / owner2 |
ExoCNVTest | 10 | 10 | No |
ExomeDepth | 11 | 11 | Yes / Owner |
XCAVATOR | NA | 12 | NO |
CNVkit | 13 | 13 | No |
Galaxy training network community [web reference] provide a copmprehensive tutotial [ web link to the tutorial / https://training.galaxyproject.org/training-material/topics/dev/tutorials/tool-from-scratch/tutorial.html] to instruct the reader in the full process of integrating a tool into Galaxy thorugh the process of
- the creation of a bioconda recipe for a new tool
- writing a Galaxy tool wrapper
- finally the testing and deployment of this tool into both a local and public Galaxy environment.
This document present a case study of intergrating CNVkit into the Galaxy using the above tutorial
Choosing a specific tool to carry on the analysis require information about CNVs detection accuracy, execution time and required infrastructure.
Our plan includes :
-
Do a benchmark test for the CNV detection workflow in Galaxy to measure the run time and detected CNV
-
Compare them with a reference CNV test.
The reference CNV workflow is available in the reference article below.
https://www.nist.gov/programs-projects/genome-bottle
The workflow and the tools/code used
https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/MHC/e2e_notebooks
Usually, The preprocessing steps for CNVs data (from the quality control to the mapping step) are the similar for all CNVs detecting workflows. The changes occur in the CNVs detection step.
Our plan is to create a tutorial that allows the user to understand the CNVs detection process and give guidance on how to choose between the available CNVs tools.
The process to create this tutorial is by:
-
View the input requirements for some of the available CNVs tools.
-
Create a shared prepossessing workflow for different tools data.
-
Make the tool that requires the longest prepossessing as the main tool in the tutorial and locate the sections where we can introduce the different CBV tools for the data.
-
add links for additional reading to provide the user with further knowledge on how to use a specific CNV tool
-
Locate most of the CNV detecting tools available in/outside galaxy
Our current progress can be found here tutorial that uses Contol-freec to detect CNVs which also can be the backbone tutorial for this cause.