You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the updated, that looks great.
I already tried out haploRILs this morning.
The functions works for my data. (but I have to modify the path for the haploRILs_function.R)
I would like to ask for your opinion about the parameter setting {nSnp} {step} {K}.
I have 2.7M SNP in total, but I guess it would be better to use the subset ~50K SNP. And compare the stability of the results.
Thus, what values would you recommend to use with ~50K SNP for 10 chromosomes of maize dataset?
P.S. I got a lot of warning messages when running the code, maybe you can check for it.
"summarise() has grouped output by 'id', 'nSnp', 'K', 'blocksFiltered'. You
can override using the .groups argument."
Thus, what values would you recommend to use with ~50K SNP for 10 chromosomes of maize dataset?
Hard to say. It depends on the resolution you aim to obtain, the marker size of your data and the genotyping error rates you expect in your data. I run a simulation-based benchmarking analysis on haploRILs that suggested that combinations of small window sizes, using low nSnp, with higher filtering controlled by K produces the best performance, especially with genotyping errors.
The functions works for my data. (but I have to modify the path for the haploRILs_function.R)
Thanks for reporting the bug!
P.S. I got a lot of warning messages when running the code, maybe you can check for it. "summarise() has grouped output by 'id', 'nSnp', 'K', 'blocksFiltered'. You can override using the .groups argument."
Thanks, I'm aware. dplyr::sumarise prints that annoying warning. It's possible to deactivate it but I found out that it's less risky to let it happen. I will fix it at some point.
The text was updated successfully, but these errors were encountered:
Thanks for the information "small window sizes, using low nSnp, with higher filtering controlled by K produces the best performance", I will try out with this principle.
Thanks for the updated, that looks great.
I already tried out haploRILs this morning.
The functions works for my data. (but I have to modify the path for the haploRILs_function.R)
I would like to ask for your opinion about the parameter setting {nSnp} {step} {K}.
I have 2.7M SNP in total, but I guess it would be better to use the subset ~50K SNP. And compare the stability of the results.
Thus, what values would you recommend to use with ~50K SNP for 10 chromosomes of maize dataset?
P.S. I got a lot of warning messages when running the code, maybe you can check for it.
"
summarise()
has grouped output by 'id', 'nSnp', 'K', 'blocksFiltered'. Youcan override using the
.groups
argument."Best regards,
Yan-Cheng
Originally posted by @yan-cheng-lin in GoliczGenomeLab/haploMAGIC#2 (comment)
Hard to say. It depends on the resolution you aim to obtain, the marker size of your data and the genotyping error rates you expect in your data. I run a simulation-based benchmarking analysis on haploRILs that suggested that combinations of small window sizes, using low nSnp, with higher filtering controlled by K produces the best performance, especially with genotyping errors.
Thanks for reporting the bug!
Thanks, I'm aware. dplyr::sumarise prints that annoying warning. It's possible to deactivate it but I found out that it's less risky to let it happen. I will fix it at some point.
The text was updated successfully, but these errors were encountered: