We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New visual captioning model
Whisper Turbo model
other updates, if any
speed test
The text was updated successfully, but these errors were encountered:
Speed: for processing 1000 images: Blip: 68 seconds (0.068 for each image) qwen2-2b: 501 seconds (0.501 for each image)
Sorry, something went wrong.
We need to investigate the speed. The current qwen speed seems to be too low.
max_ongoing_requests=1000 and batch_size=2 286.45805220138624 token/secs 0.1256728506088257 per image
Using built in batch of vllm requests 514.784160764785 (tokens/sec) 0.21983970880508422 (per image)
HRashidi
Successfully merging a pull request may close this issue.
New visual captioning model
Whisper Turbo model
other updates, if any
speed test
The text was updated successfully, but these errors were encountered: