-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Setting search parameters for one query
Faiss indexes have their search-time parameters as object fields.
However, it can be useful to set these parameters separately per query.
For example, for an IndexIVF
, one query vector may be run with nprobe=10
and another with nprobe=20
.
This is problematic when the searches are called from different threads.
To support this use case, a SearchParameter
object can be provided as the last argument of the search()
function.
Index types that accept search parameters have a corresponding SearchParameter
child object.
For example, IndexIVFPQ
has a SearchParameterIVFPQ
object.
The search will look like:
D, I = index.search(xq, 10, params=faiss.SearchParametersIVFPQ(nprobe=10))
Note that the params=
is mandatory, not to confuse the search parameters with possible I
and D
output buffers that can also be provided.
In C++:
idx_t *I = ...
float *D = ...
faiss::SearchParametersIVFPQ params;
params.nprobe = 10;
index.search(nq, xq, 10, D, I, ¶ms);
SearchParameters can be nested, for example
params=faiss.SearchParametersIVFPQ(
nprobe=10,
quantizer_params=faiss.SearchParameterHNSW(efSearch=123)
)
is valid for an IVF1234_HNSW,...
index.
These functionalities are tested in test_search_params.py.
It is possible to select a subset of vectors, based on their ids, to search from.
Similar to the remove_ids
method, the subset is defined by an IDSelector
object.
The IDSelector::is_member(i)
returns true
if vector i
should be included in the search.
If the sel
field of SearchParameters
is non-null, the IDSelector
will be used to filter vectors at search time.
In C++, it is possible to define an IDSelector
subclass at will.
In Python this is also possible (see below) but inefficient.
By default, vectors are filtered using the is_member
method but some combinations of index types and selector types are optimized specifically.
This selects ids that are in range [imin, imax)
.
Specific optimizations:
-
for
IndexFlat
, ignores the data matrix outside of the[imin, imax)
range -
for
IndexIVF
with sorted ids, only the subsection of the inverted lists that contains the range is handled. The ids are sorted either if vectors are added withadd
or ifadd_with_ids
was called with increasing ids (does not reqyure to be strictly). This can be checked withindex_ivf.check_sorted_ids()
. -
TODO: support for all flat indexes.
These three IDSelectors
encapsulate a set of elements to process.
They differ by their properties, see below (n = nb of ids in index, k = nb of selected ids)
class | storage | lookup cost |
---|---|---|
IDSelectorArray | O(k) | O(k) |
IDSelectorBatch | O(k) | O(1) |
IDSelectorBitmap | O(n) | O(1) |
For IDSelectorArray
, the ids are just provided and stored as an array.
Therefore, merely calling is_member
can be slow.
However, for some indexes (currently IndexFlat
), it simply picks up the database elements to compare with, which is fast.
IDselectorBatch
combines a hashtable (unordered_set
) and a Bloom filter.
Thus the lookup is in constant time, but there is some memory overhead.
IDSelectorBitmap
takes an array with 1 bit per vector id.
It is particularly interesting when a significant subset of vectors needs to be handled and ids are sequential.
Then it is more compact and faster than IDselectorBatch
.
IDSelectorBatch
and IDSelectorArray
are constructed from an array of ids to store.
IDSelectorBitmap
takes the binary mask as an uint8
array as argument.
This reverts another IDSelector
sel = faiss.IDSelectorNot(faiss.IDSelectorBatch(list_of_ids))
ignores the ids from the list.
This wraps a Python function that implements is_member
.
This is inefficient but useful for debugging.
Faiss building blocks: clustering, PCA, quantization
Index IO, cloning and hyper parameter tuning
Threads and asynchronous calls
Inverted list objects and scanners
Indexes that do not fit in RAM
Brute force search without an index
Fast accumulation of PQ and AQ codes (FastScan)
Setting search parameters for one query
Binary hashing index benchmark