feat: add initial qwen2.5-vl model and test #2971

drbh · 2025-01-30T17:50:03Z

This PR adds support for qwen2.5-vl models and currently loads the weights and supports reasonable responses. Opening early for exposure and any feedback.

These changes are dependent on #2943 and must be rebased/merged after it is merged

items

small reproducible example:

text-generation-launcher --model-id Qwen/Qwen2.5-VL-3B-Instruct

script

import requests
import json

url = "http://127.0.0.1:3000/generate"
headers = {"Content-Type": "application/json"}
image_urls = [
    "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg",
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/tgi/rabbit.png",
]

for image in image_urls:
    query = "Describe the image"
    payload = {
        "inputs": f"<|im_start|>system\nYou are a helpful assistant.<|im_end|>\n<|im_start|>user\n![]({image}){query}<|im_end|>\n<|im_start|>assistant\n",
        "parameters": {"max_new_tokens": 50},
    }
    response = requests.post(url, headers=headers, json=payload)
    print(json.dumps(response.json(), indent=4))

output

{
    "generated_text": "The image showcases the iconic Statue of Liberty in New York City, with the New York City skyline in the background. The statue is a large, green-colored sculpture on a stone pedestal, with the American flag on a flagpole in the foreground. The"
}
{
    "generated_text": "The image depicts a character in a space suit, set in a rocky, desert-like environment with a warm, orange hue. The character is a large, brown, and white rabbit with long, pointed ears and a small, beak-like nose."
}

HuggingFaceDocBuilderDev · 2025-01-31T17:48:17Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

feat: support qwen2.5 vl model

e9b5806

drbh force-pushed the add-qwen25vl-support branch from 1adfee4 to e9b5806 Compare January 31, 2025 17:36

fix: bump support models doc

17c93ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add initial qwen2.5-vl model and test #2971

feat: add initial qwen2.5-vl model and test #2971

drbh commented Jan 30, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 31, 2025

feat: add initial qwen2.5-vl model and test #2971

Are you sure you want to change the base?

feat: add initial qwen2.5-vl model and test #2971

Conversation

drbh commented Jan 30, 2025 • edited Loading

items

HuggingFaceDocBuilderDev commented Jan 31, 2025

drbh commented Jan 30, 2025 •

edited

Loading