You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fixed \n replacement with spaces -> will improve some top keywords quality
the chunker system will fix "tensor size" issues, and therefore increase the data output (instead of losing some batches once in a while)
improved pre_install procedure to have 2 more models in the docker base image
added tiktoken (OpenAI gpt3 tokenizer) library to count (& print) the number of tokens for each item, to help decide if the client has to split an item in several pieces (paragraphs)