Skip to content

poterlowicz-lab/hcnv-uk

Repository files navigation

title author date output
hCNVbundles project- ELIXIR UK
Khaled Jumah, Krzysztof Poterlowicz
11/04/2022
html_document

WP3 - Exploitation of the datasets by the Galaxy Community.

1. Intergrating a Copy number variant (CNV) detecting tools into Galaxy.

Although a number of the CNV detectiong tools have been developed over the recent years (ref Khaled) only a few of them were intergated into the Galaxy and only couple of them are suported and functional (Khaled put tools table here).

Whole exome sequning CNV detection tools according to [reference paper]

Tool Reference Ural Availabel on tool shed
Control-FREEC 1 1 Yes / owner
CoNIFER 2 2 No
XHMM 3 3 No
ExomeCNV 4 4 No
CONTRA 5 5 Yes / owner
CONDEX 6 6 No
SeqGene 7 7 No
PropSeq 8 8 No
VarScan2 9 9 Yes / alternative / alternative 2 /owner1-2 / owner2
ExoCNVTest 10 10 No
ExomeDepth 11 11 Yes / Owner
XCAVATOR NA 12 NO
CNVkit 13 13 No

Galaxy training network community [web reference] provide a copmprehensive tutotial [ web link to the tutorial / https://training.galaxyproject.org/training-material/topics/dev/tutorials/tool-from-scratch/tutorial.html] to instruct the reader in the full process of integrating a tool into Galaxy thorugh the process of

  • the creation of a bioconda recipe for a new tool
  • writing a Galaxy tool wrapper
  • finally the testing and deployment of this tool into both a local and public Galaxy environment.

This document present a case study of intergrating CNVkit into the Galaxy using the above tutorial

2. Benchmarking hCNV detection tools.

Choosing a specific tool to carry on the analysis require information about CNVs detection accuracy, execution time and required infrastructure.

Our plan includes :

  1. Do a benchmark test for the CNV detection workflow in Galaxy to measure the run time and detected CNV

  2. Compare them with a reference CNV test.

The reference CNV workflow is available in the reference article below.
https://www.nist.gov/programs-projects/genome-bottle

The workflow and the tools/code used
https://github.com/NCBI-Hackathons/TheHumanPangenome/tree/master/MHC/e2e_notebooks

WP4 - Training and dissemination

Usually, The preprocessing steps for CNVs data (from the quality control to the mapping step) are the similar for all CNVs detecting workflows. The changes occur in the CNVs detection step.

Our plan is to create a tutorial that allows the user to understand the CNVs detection process and give guidance on how to choose between the available CNVs tools.

The process to create this tutorial is by:

  1. View the input requirements for some of the available CNVs tools.

  2. Create a shared prepossessing workflow for different tools data.

  3. Make the tool that requires the longest prepossessing as the main tool in the tutorial and locate the sections where we can introduce the different CBV tools for the data.

  4. add links for additional reading to provide the user with further knowledge on how to use a specific CNV tool

  5. Locate most of the CNV detecting tools available in/outside galaxy

Our current progress can be found here tutorial that uses Contol-freec to detect CNVs which also can be the backbone tutorial for this cause.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages