-
Notifications
You must be signed in to change notification settings - Fork 102
Assembled genome size under estimated #581
Comments
Hi, You have sufficient raw coverage (almost 90-fold) so there should be plenty of raw data to get a decent draft assembly. What I see from your pre-asesmbled statistics, however is that you only have 5-6fold coverage of pre-assembled reads You will never achieve a contiguous or complete assembly with only 5-fold preads, you need closer to 15-25-fold pread coverage above a certain length threshold if you want to achieve a highly contiguous assembly. You need to start by troubleshooting your pre-assembly. Also, we generally don't recommend using the -a option in DBsplit : Including the Hope this helps |
I will restart from your recommendation and I will update results. Thanks. |
Here is follow-up result. I removed -a option. preads_stat
p_ctg_stat
|
I agree pread correction doesn't appear to be proceeding as efficiently as it should be, which is still what's limiting the assembly. However, I also notice in your latest assembly that you're not using the same full dataset as the previous two assemblies, so it's not a fair apples to apples comparison. (look at the Raw read stats - the latest version appears to be starting with roughly 23-fold less coverage (~90X vs ~67X)) One thing I notice is that between your initial assembly, and your second version is that you raised the raw read overlapping parameter from |
Greg, I used same input but I don't know why did it happen. Thank you for your help. Won |
Hello folks @pb-cdunn @mseetin @pb-jchin
Our genome size is 800Mbp and looks like highly repeat genome.
I ran Falcon twice with different options, but it generated underestimated genome size.
The problems are almost no overlap between reads and underestimated p_ctg size.
Can you please help to tune the assembly config?
Thanks in advance.
First run
fc.cfg
raw_reads_statistics
preassembly_stat
overlap_histogram
OvlpHist_1.pdf
p_ctg_statistics
Second run
fc.cfg
raw_reads_stat
OvlpHist_2.pdf
preassemble_stat
p_ctg_stat
The text was updated successfully, but these errors were encountered: