You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Lucene's dynamic numeric range faceting is a cool auto-ranging feature that looks at the distribution of values for a numeric field among all collected results and picks "good" ranges by roughly evenly distributing another field (relevance, counts) across the requested N ranges.
There are exciting optimizations happening to it recently: apache/lucene#13914
Let's get some coverage in our benchmarks, and maybe nightly benchmarks?
The text was updated successfully, but these errors were encountered:
To add comprehensive benchmarks for dynamic numeric faceting, we would also need a corpus that has "many numbers." Options include wikipedia line files (day_of_year, etc.), NYC taxis corpus, or even the OpenStreetMaps corpus (possible numeric fields). Random/synthetic datasets are discouraged because they are more likely to draw random/synthetic conclusions.
Related work: GH#325 and GH#160 both add related datasets for benchmarks.
Lucene's dynamic numeric range faceting is a cool auto-ranging feature that looks at the distribution of values for a numeric field among all collected results and picks "good" ranges by roughly evenly distributing another field (relevance, counts) across the requested N ranges.
There are exciting optimizations happening to it recently: apache/lucene#13914
Let's get some coverage in our benchmarks, and maybe nightly benchmarks?
The text was updated successfully, but these errors were encountered: