About training, history data and daily updated data. #1858
Replies: 3 comments
-
If the distribution of your data does not change, there is no need to re-train the index. |
Beta Was this translation helpful? Give feedback.
-
If there is some change of the distribution everday, and I need train the index everyday, should I sample the training data from {all history data and the daily updated data} instead of training only from the sample of {the daily updated data}? |
Beta Was this translation helpful? Give feedback.
-
We can't give a generic answer to this question: it depends by how much the data distribution changes! |
Beta Was this translation helpful? Give feedback.
-
Running on:
Interface:
First we add history data (about 150kk) into the index, and randomly select some to train the index.
And then we have newly updated data everyday (about 400k/day).
How should I train the index everyday? Should I only randomly select {the daily updated data} for training, or randomly select {the daily updated data and all history data}for training?
Beta Was this translation helpful? Give feedback.
All reactions