-
Notifications
You must be signed in to change notification settings - Fork 102
Home
- Integrate latest (Apr. 2015) Gene Myers' daligner code
- Supporting read length up to 100kb from current 64kb
- New consensus code which process reads from diploid genomes better
- Logging for tracking job submission
-
it uses up-to-date Gene’s daligner code. It helps to support read > 64kb and helps for tracking read identifiers through the pipeline (which is important for diploid assembly in the future). The cost is that the new code use twice more memory if the daligner parameters is the same before. One needs to use the “-M” option to set limit on the memory usage. For human assembly, we tested with
-M 32
for using 32G RAM for each daligner. Other possibilities are under investigation. -
You will need a new option
--output_dformat
for the configuration variablefalcon_sense_option
. It is used inLA4Falcon
andfc_consensus.py
to generate p-read fasta files that are compatible to DAZZ_DB'sfasta2DB
for creating p-read database.
Falcon is not meant to be a "push-button" assembler (yet) except for very small genomes. To get best results, one might need some knowledge about various distributed computation system and some basic genome assembly theory. I will collect some question and answer in the FAQ page which might help you.