Skip to content

Latest commit

 

History

History
61 lines (34 loc) · 3.48 KB

tutorial_dx_web.md

File metadata and controls

61 lines (34 loc) · 3.48 KB

Tutorial for DNAnexus Platform (web)

All test samples and genome data are shared on our public DNAnexus project. You don't have to download any data for testing our pipeline on DNAnexus platform.

There are two methods to run our pipeline on DNAnexus.

  1. Building your own DX workflow from chip.wdl with dxWDL (CLI)
  2. Using a pre-built DX workflow on our public DX project (Web UI)

This document describes instruction for the item 2).

  1. Sign up for a DNAnexus account.

  2. Create a new DX project by clicking on "+New Project" on the top left.

  3. Move to one of the following workflow directories according to the platform you have chosen for your project (AWS or Azure). These DX workflows are pre-built with all parameters defined.

  1. Copy it to your project by right-clicking on the DX workflow chip and choose "Copy".

  2. Choose your project and create a folder for the test run by clicking on the "Folder+" icon.

  3. Click on "Copy into this folder" on the bottom left.

  4. Move to the target folder and click on the DX workflow chip.

  5. Specify an output directory by clicking "Workflow Actions" on the top right. Click on "Set output folder" and choose an output folder.

  6. Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.

  7. It will take about 6 hours. You will be able to find all outputs on your output folder. Final QC report (qc.html)/JSON (qc.json) will be found on it.

  8. See full specification for input JSON file.

Extras for advanced users

  1. DNAnexus allows only one copy of a workflow per project. The example workflow in the previous section is pre-built for the subsampled test sample ENCSR936XTK with all parameters defined already.

  2. Choose your main platform (AWS or Azure). Move to ENCODE ChIP-seq pipeline repository for AWS or ENCODE ChIP-seq pipeline repository for Azure.

  3. Choose a folder with the latest available version.

  4. Copy one of the following workflows according to the platform you have chosen for your project.

IMPORTANT: Make sure that you have chosen a correct platform (AWS or Azure) for your project.

  • general: General workflow without pre-defined reference genome.
  • hg38: Worfklow with pre-defined hg38 reference genome.
  • hg19: Worfklow with pre-defined hg19 reference genome.
  • mm10: Worfklow with pre-defined mm10 reference genome.
  • mm9: Worfklow with pre-defined mm9 reference genome.
  1. Click on the DX workflow chip.

  2. Specify your input files (FASTQs, BAMs, TAG-ALIGNs, ...) on the top left. For example, click on the item "fastqs_rep1_R1" and choose your R1 FASTQ file for replicate 1. See details here for other input types.

  3. Choose a reference genome. See details here.

  4. Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.