Qwen2.5 VL AWQ 7B

Qwen2.5 VL AWQ 7B is a vision-language model that supports 7 billion parameters, offering advanced capabilities such as visual analysis, agentic reasoning, long video comprehension, visual localization, and structured output generation.

Model details

Category	Details
Model Name	Qwen2.5 VL
Version	AWQ 7B
Model Category	VLM
Size	7B params
HuggingFace Model	Qwen/Qwen2.5-VL-7B-Instruct-AWQ
OpenAI Compatible endpoint	Chat Completions
License	Apache 2.0

Capabilities

Feature	Details
Tool Calling	✅
Azion Long-term Support (LTS)	✅
Context Length	32k tokens
Supports LoRA	✅
Input data	Text + Image

Usage

Basic chat completion

This is a basic chat completion example using this model:

const modelResponse = await Azion.AI.run("qwen-qwen25-vl-7b-instruct-awq", {
  "stream": true,
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Name the european capitals"
    }
  ]
})

Property	Type	Description
`stream`	`boolean`	Indicates whether to stream the response.
`messages[]`	`array`	Array of message objects forming the conversation history.
`messages[].role`	`string`	The role of the message sender.
`messages[].content`	`string`	The content of the message.

Response example:

{
  "id": "chatcmpl-e27716424abf4b3f891ff4850470cb09",
  "object": "chat.completion",
  "created": 1746821581,
  "model": "qwen-qwen25-vl-7b-instruct-awq",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "reasoning_content": null,
        "content": "Sure! Here is a list of some European capitals...",
        "tool_calls": []
      },
      "logprobs": null,
      "finish_reason": "stop",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 9,
    "total_tokens": 527,
    "completion_tokens": 518,
    "prompt_tokens_details": null
  },
  "prompt_logprobs": null
}

Property	Type	Description
`id`	`string`	Unique identifier for the chat completion.
`object`	`string`	Type of the object.
`created`	`number`	Unix timestamp of when the response was created.
`model`	`string`	Identifier of the model that generated the response.
`choices`	`array`	Array containing possible completions.
`choices[]`	`object`	First choice returned by the model.
`choices[].index`	`number`	Index of this choice in the list.
`choices[].message`	`object`	The message object returned as a response.
`choices[].message.role`	`string`	Role of the message sender.
`choices[].message.reasoning_content`	`string`	Field for reasoning metadata.
`choices[].message.content`	`string`	The textual content of the assistant’s response.
`choices[].message.tool_calls`	`array`	Any tool calls made by the assistant.
`choices[].logprobs`	`string`	Log probability details.
`choices[].finish_reason`	`string`	Reason why the response was completed.
`choices[].stop_reason`	`string`	Reason why the response was stopped.
`usage`	`object`	Token usage statistics for the request.
`usage.prompt_tokens`	`number`	Number of tokens in the prompt.
`usage.total_tokens`	`number`	Total number of tokens.
`usage.completion_tokens`	`number`	Number of tokens in the model’s response.
`usage.prompt_tokens_details`	`string`	Optional breakdown of prompt tokens.
`prompt_logprobs`	`number`	Overall log probabilities for prompt.

Tool Calling Example

This is a tool calling example using this model:

const modelResponse = await Azion.AI.run("qwen-qwen25-vl-7b-instruct-awq", {
  "stream": true,
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant with access to tools."
    },
    {
      "role": "user",
      "content": "What is the weather in London?"
    }
  ],
  "tools": [
    {
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get the current weather for a location",
        "parameters": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state"
            }
          },
          "required": [
            "location"
          ]
        }
      }
    }
  ]
})

Property	Type	Description
`stream`	`boolean`	Whether to stream the response.
`messages[]`	`array`	List of messages in the conversation.
`messages[].role`	`string`	Role of the message sender.
`messages[].content`	`string`	The user’s prompt.
`tools[]`	`array`	List of tools the assistant can use.
`tools[].type`	`string`	Type of tool being registered.
`tools[].function.name`	`string`	Name of the function.
`tools[].function.description`	`string`	Description of the function.
`tools[].function.parameters`	`object`	Schema for the function’s parameters.

Response example:

{
  "id": "chatcmpl-88affc4730cf4219a06d2b15aad9ad44",
  "object": "chat.completion",
  "created": 1746821866,
  "model": "qwen-qwen25-vl-7b-instruct-awq",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "reasoning_content": null,
        "content": null,
        "tool_calls": [
          {
            "id": "chatcmpl-tool-fd3311e75aed4cbfbeb7244ced77379f",
            "type": "function",
            "function": {
              "name": "get_weather",
              "arguments": "{\"location\": \"London\"}"
            }
          }
        ]
      },
      "logprobs": null,
      "finish_reason": "tool_calls",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 293,
    "total_tokens": 313,
    "completion_tokens": 20,
    "prompt_tokens_details": null
  },
  "prompt_logprobs": null
}

Property	Type	Description
`id`	`string`	Unique identifier for the completion.
`object`	`string`	Type of object returned.
`created`	`number`	Unix timestamp of when the completion was created.
`model`	`string`	Model name used to generate the response.
`choices`	`array`	List of response choices returned by the model.
`choices[].index`	`number`	Index of this choice in the returned list.
`choices[].message.role`	`string`	Role of the message sender.
`choices[].message.reasoning_content`	`string`	Placeholder for reasoning content.
`choices[].message.content`	`string`	Main textual content of the assistant’s response.
`choices[].message.tool_calls[]`	`array`	List of tools/functions the assistant intends to call.
`choices[].message.tool_calls[].id`	`string`	Unique identifier for this tool call.
`choices[].message.tool_calls[].type`	`string`	Type of the tool.
`choices[].message.tool_calls[].function.name`	`string`	Name of the function to be called.
`choices[].message.tool_calls[].function.arguments`	`string`	JSON string with arguments passed to the function.
`choices[].logprobs`	`string`	Log probability data.
`choices[].finish_reason`	`string`	Reason why the response was finished.
`choices[].stop_reason`	`string`	Reason why the response was stopped.
`usage.prompt_tokens`	`number`	Number of tokens used in the prompt.
`usage.total_tokens`	`number`	Total tokens used.
`usage.completion_tokens`	`number`	Tokens used in the model’s response.
`usage.prompt_tokens_details`	`string`	Detailed prompt token usage.
`prompt_logprobs`	`number`	Log probabilities for prompt tokens.

Multimodal (text + image) example

This is a multimodal example using this model:

const modelResponse = await Azion.AI.run("qwen-qwen25-vl-7b-instruct-awq", {
  "stream": true,
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://example.com/image.jpg"
          }
        }
      ]
    }
  ]
})

Property	Type	Description
`stream`	`boolean`	Indicates whether the response should be streamed.
`messages[]`	`array`	Array of messages in the conversation.
`messages[].role`	`string`	Role of the message sender.
`messages[].content[]`	`array`	Array of content blocks.
`messages[].content[].type`	`string`	Type of content block.
`messages[].content[].text`	`string`	The actual text content.
`messages[].content[].image_url`	`object`	Object containing the image URL.
`messages[].content[].image_url.url`	`string`	URL of the image to be analyzed.

The response will be similar to the one in the Basic Chat Completion example.

JSON schema

{
    "$schema": "http://json-schema.org/draft-07/schema#",
    "type": "object",
    "required": [
        "messages"
    ],
    "properties": {
        "messages": {
            "type": "array",
            "items": {
                "$ref": "#/components/schemas/Message"
            }
        },
        "temperature": {
            "type": "number",
            "minimum": 0,
            "maximum": 2
        },
        "top_p": {
            "type": "number",
            "minimum": 0,
            "maximum": 1,
            "default": 1
        },
        "n": {
            "type": "integer",
            "minimum": 1,
            "default": 1
        },
        "stream": {
            "type": "boolean",
            "default": false
        },
        "max_tokens": {
            "type": "integer",
            "minimum": 1
        },
        "presence_penalty": {
            "type": "number",
            "minimum": -2,
            "maximum": 2,
            "default": 0
        },
        "frequency_penalty": {
            "type": "number",
            "minimum": -2,
            "maximum": 2,
            "default": 0
        },
        "tools": {
            "type": "array",
            "items": {
                "$ref": "#/components/schemas/ToolDefinition"
            }
        }
    },
    "components": {
        "schemas": {
            "Message": {
                "oneOf": [
                    {
                        "$ref": "#/components/schemas/SystemMessage"
                    },
                    {
                        "$ref": "#/components/schemas/UserMessage"
                    },
                    {
                        "$ref": "#/components/schemas/AssistantMessage"
                    },
                    {
                        "$ref": "#/components/schemas/ToolMessage"
                    }
                ]
            },
            "SystemMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "system"
                        ]
                    },
                    "content": {
                        "$ref": "#/components/schemas/TextContent"
                    }
                }
            },
            "UserMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "user"
                        ]
                    },
                    "content": {
                        "oneOf": [
                            {
                                "type": "string"
                            },
                            {
                                "type": "array",
                                "items": {
                                    "oneOf": [
                                        {
                                            "$ref": "#/components/schemas/TextContentItem"
                                        },
                                        {
                                            "$ref": "#/components/schemas/ImageContentItem"
                                        }
                                    ]
                                }
                            }
                        ]
                    }
                }
            },
            "AssistantMessage": {
                "oneOf": [
                    {
                        "$ref": "#/components/schemas/AssistantMessageWithoutToolCalls"
                    },
                    {
                        "$ref": "#/components/schemas/AssistantMessageWithToolCalls"
                    }
                ]
            },
            "ToolMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content",
                    "tool_call_id"
                ],
                "properties": {
                    "role": {
                        "enum": [
                            "tool"
                        ]
                    },
                    "content": {
                        "type": "string"
                    },
                    "tool_call_id": {
                        "type": "string"
                    }
                }
            },
            "AssistantMessageWithoutToolCalls": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "assistant"
                        ]
                    },
                    "content": {
                        "$ref": "#/components/schemas/TextContent"
                    }
                },
                "not": {
                    "required": [
                        "tool_calls"
                    ]
                }
            },
            "AssistantMessageWithToolCalls": {
                "type": "object",
                "required": [
                    "role",
                    "tool_calls"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "assistant"
                        ]
                    },
                    "tool_calls": {
                        "type": "array",
                        "items": {
                            "$ref": "#/components/schemas/ToolCalls"
                        }
                    }
                }
            },
            "TextContent": {
                "oneOf": [
                    {
                        "type": "string"
                    },
                    {
                        "type": "array",
                        "items": {
                            "$ref": "#/components/schemas/TextContentItem"
                        }
                    }
                ],
                "description": "Text content that can be provided either as a simple string or as an array of TextContentItem objects"
            },
            "ImageContent": {
                "type": "array",
                "items": {
                    "$ref": "#/components/schemas/ImageContentItem"
                }
            },
            "TextContentItem": {
                "type": "object",
                "required": [
                    "type",
                    "text"
                ],
                "properties": {
                    "type": {
                        "type": "string",
                        "enum": [
                            "text"
                        ]
                    },
                    "text": {
                        "type": "string"
                    }
                }
            },
            "ImageContentItem": {
                "type": "object",
                "required": [
                    "type",
                    "image_url"
                ],
                "properties": {
                    "type": {
                        "type": "string",
                        "enum": [
                            "image_url"
                        ]
                    },
                    "image_url": {
                        "type": "object",
                        "required": [
                            "url"
                        ],
                        "properties": {
                            "url": {
                                "type": "string",
                                "format": "uri"
                            }
                        }
                    }
                }
            },
            "ToolCalls": {
                "type": "object",
                "required": [
                    "function",
                    "id",
                    "type"
                ],
                "properties": {
                    "function": {
                        "type": "object",
                        "required": [
                            "name",
                            "arguments"
                        ],
                        "properties": {
                            "name": {
                                "type": "string"
                            },
                            "arguments": {
                                "type": "string"
                            }
                        }
                    },
                    "id": {
                        "type": "string"
                    },
                    "type": {
                        "enum": [
                            "function"
                        ]
                    }
                },
                "description":"The name and arguments of a function that should be called, as generated by the model."
            },
            "ToolDefinition": {
                "type": "object",
                "required": [
                    "type",
                    "function"
                ],
                "properties": {
                    "type": {
                        "type": "string",
                        "enum": [
                            "function"
                        ]
                    },
                    "function": {
                        "type": "object",
                        "required": [
                            "name"
                        ],
                        "properties": {
                            "name": {
                                "type": "string"
                            },
                            "description": {
                                "type": "string"
                            },
                            "parameters": {
                                "type": "object",
                                "additionalProperties": true
                            },
                            "strict": {
                                "type": "boolean",
                                "default": false
                            }
                        }
                    }
                },
                "description": "Definition of a tool that can be used by the model"
            }
        }
    }
}