-
Notifications
You must be signed in to change notification settings - Fork 597
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add utilites to update Echo filters and reference genotypes #8867
base: EchoCallset
Are you sure you want to change the base?
Add utilites to update Echo filters and reference genotypes #8867
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, though this should target the EchoCallset
branch rather than ah_var_store
.
site_filtering_data : list[str] | ||
Paths to site filtering files. | ||
vets_filtering_data : list[str] | ||
Paths to VETS/VQSR filtering files. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ploidy_data
is missing from these docs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for catching this!
hl.vds.VariantDataset( | ||
reference_data=reference_data, variant_data=variant_data | ||
).write(output_path, overwrite=True) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- The
overwrite=True
makes me nervous, though my teammates may overrule me on that 😅. This could also be a (defaulted) function parameter. - It might be useful to return the
VariantDataset
object so that callers can introspect after these transformations.
hl.vds.VariantDataset( | |
reference_data=reference_data, variant_data=variant_data | |
).write(output_path, overwrite=True) | |
vds = hl.vds.VariantDataset(reference_data=reference_data, variant_data=variant_data) | |
vds.write(output_path) | |
return vds |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure, i can do this.
avros : Sequence[str] | ||
List of paths for raw site filtering data | ||
site_path : str | ||
Path to site filters table where, if a hail table exists, it will be read, unless ``force`` is true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This update is backwards. The force
parameter will always recreate the table.
fe960cf
to
f94a5ad
Compare
avros : Sequence[str] | ||
List of paths for raw site filtering data | ||
site_path : str | ||
Path to site filters table where, if a hail table exists, it will be read, unless ``force`` is true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This update is backwards. The force
parameter will always recreate the table.
This should prevent us from needing to localize the data to run the pipeline.
No description provided.