The following are change log of each versions.
- Add docker image build
- Support list input of text on embeddings API
- Support downloading model from huggingface
- Support auto-batching on chat API
- Support
top_p
,temperature
andseed
parameters in chat API - Update OpenVINO to 2024.3.0
- Update IPEX-LLM to 2.1.0
- Add IPEX-LLM backend
- Support InternVL2 on IPEX-LLM backend with OpenAI chat completion image input
- Support Qwen2 tool calling on IPEX-LLM and OpenVINO backend with OpenAI chat completion tools input
- Support embedding models on IPEX-LLM and OpenVINO backend with OpenAI embedding API
- Support parallel completion requests: concurrent completion requests can be submit on both OpenVINO and IPEX-LLM backends (not batching)
- Add README and changelog
Initial release with OpenVINO support and basic OpenAI chat completion API.