-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
This package only takes BND notation vcf ? #34
Comments
I have a similar question here. Probably also related to #33. |
@yangyxt sorry for the late reply. Can you post which version of DELLY you're using, and a VCF with a few entries in it?
That is actually possible for |
Sorry for the long delay - I'm currently updating the documentation to better describe the design of StructuralVariantAnnotation.
StructuralVariantAnnotation already does this in breakpointRanges(). I have test cases for VCFs produced by crest, delly, gridss, manta, pindel, tigra, lumpy, and others.
Any spec-compliant VCF representation (plus a few caller-specific ones I have special-case code for). That is, sequence symbolic, breakpoint, and breakend notations are all supported. For example, StructuralVariantAnnotation can correctly parse the following VCF:
|
What is not immediately clear from the docs is that SVA turns everything into breakpoint notation. In the delly example by the OP, SVA turns DUP000000000 into breakpoint notation hence why the output includes DUP000000000_bp1 and DUP000000000_bp2, and why exists INV00026615_bp4 (since an inversion has 2 breakpoints = 4 breakends). |
I tried to convert vcf records to grange objects and use breakpointGRangesToVCF function to normalise symbolic records to BND vcf records.
However, I found this is not available since the symbolic records will be stored as records with irange width > 1 in GRange Object. And there is an assertion in .toVcfBreakendNotationAlt
all(width(gr)==1)
, so the records in GRange object derived from symbolic vcf records will surely fail this assertion.I test this with a simple DELLY generated SV record VCF file. Here is a screenshot for GRange object derived from function breakpointRanges(vcf):
Therefore, generally speaking, StructuralVariantAnnotation cannot do format normalization for SV records in vcf files from different callers? I better do the normalization myself, like convert all symbolic records to BND notation records and then load the vcf into StructuralVariantAnnotation?
The text was updated successfully, but these errors were encountered: