You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, multisearch is completely in-memory and may not be performant for larger databases. Thus, I want to benchmark whether we can compute our own probability of overlap and TF-IDF with polars using the kmer parquet file created by kmerseek.sketch.sketch method.
The text was updated successfully, but these errors were encountered:
The probability of overlap between the query and target, plus a TF-IDF calculation was added to
sourmash scripts multisearch
here: sourmash-bio/sourmash_plugin_branchwater#458However,
multisearch
is completely in-memory and may not be performant for larger databases. Thus, I want to benchmark whether we can compute our own probability of overlap and TF-IDF with polars using the kmer parquet file created bykmerseek.sketch.sketch
method.The text was updated successfully, but these errors were encountered: