How to use vLLM? #780

yikchunnnn · 2024-10-11T19:05:17Z

The Readme only mentions 'pip install vllm...'

May I know how shall I proceed to enable vLLM?

Do I need to do anything after installing vLLM?
Or ChatTTS will automatically use vLLM when detecting it available?

Thank you in advance.

fumiama · 2024-10-15T15:08:11Z

Now vLLM is still under test&dev and ONLY BASIC INFER is available. It will not be enabled be default and if you are a developer, you can easily find the way to enable it by looking the parameters of Chat.load.

the-nine-nation · 2024-10-30T03:41:51Z

Using vllm would lead to an error: ImportError: cannot import name 'LogicalTokenBlock' from 'vllm.block' (/root/miniconda3/envs/py39/lib/python3.9/site-packages/vllm/block.py)
How can i fix it?

superstring · 2024-10-30T14:10:12Z

Now vLLM is still under test&dev and ONLY BASIC INFER is available. It will not be enabled be default and if you are a developer, you can easily find the way to enable it by looking the parameters of Chat.load.

Does this mean that zero-shot infer is not supported yet?

2noise deleted a comment from rose07 Oct 15, 2024

fumiama added the documentation Improvements or additions to documentation label Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use vLLM? #780

How to use vLLM? #780

yikchunnnn commented Oct 11, 2024

fumiama commented Oct 15, 2024

the-nine-nation commented Oct 30, 2024

superstring commented Oct 30, 2024

How to use vLLM? #780

How to use vLLM? #780

Comments

yikchunnnn commented Oct 11, 2024

fumiama commented Oct 15, 2024

the-nine-nation commented Oct 30, 2024

superstring commented Oct 30, 2024