Mistral 3 Small (24B AWQ)

Mistral 3 Small is a language model that delivers capabilities comparable to larger models while being compact. It’s ideal for conversational agents, function calling, fine-tuning, and local inference with sensitive data.

Model details

Category	Details
Model Name	Mistral 3 Small
Version	24B AWQ
Model Category	Large Language Model (LLM)
Size	24B parameters
HuggingFace Model	casperhansen/mistral-small-24b-instruct-2501-awq
OpenAI Compatible Endpoint	Chat Completions
License	Apache 2.0

Capabilities

Feature	Details
Tool Calling	✅
Azion Long-term Support (LTS)	✅
Context Length	32k tokens
Supports LoRA	❌
Input Data	Text

Usage

Basic chat completion

This is an example of a basic chat completion request using this model:

const modelResponse = await Azion.AI.run("casperhansen-mistral-small-24b-instruct-2501-awq", {
  "stream": true,
  "max_tokens": 1024,
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant."
    },
    {
      "role": "user",
      "content": "Name the european capitals"
    }
  ]
})

Property	Type	Description
`stream`	boolean	Boolean indicating if the response should be streamed.
`max_tokens`	number	The maximum number of tokens in the response.
`messages`	array	An array of message objects containing the role and content of the message.
`messages[].role`	string	The role of the message sender.
`messages[].content`	string	The content of the message.

Response example:

{
  "id": "chatcmpl-e27716424abf4b3f891ff4850470cb09",
  "object": "chat.completion",
  "created": 1746821581,
  "model": "casperhansen-mistral-small-24b-instruct-2501-awq",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "reasoning_content": null,
        "content": "Sure! Here is a list of some European capitals...",
        "tool_calls": []
      },
      "logprobs": null,
      "finish_reason": "stop",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 9,
    "total_tokens": 527,
    "completion_tokens": 518,
    "prompt_tokens_details": null
  },
  "prompt_logprobs": null
}

Property	Type	Description
`id`	string	Unique identifier for the response.
`object`	string	The type of object returned in the response.
`created`	number	Timestamp of when the response was created.
`model`	string	The name of the model used for the request.
`choices`	array	Array of objects containing the response choices.
`usage`	object	Object containing usage metrics for the request.
`prompt_logprobs`	number	Log probabilities of the prompt.
`choices[].index`	number	Index of the choice in the array.
`choices[].message`	object	Object containing the message details.
`choices[].message.role`	string	The role of the message sender.
`choices[].message.reasoning_content`	string	The reasoning content of the message.
`choices[].message.content`	string	The content of the message.
`choices[].message.tool_calls`	array	Array of tool call objects.
`choices[].logprobs`	number	Log probabilities of the choice.
`choices[].finish_reason.`	string	The reason the choice was finished.
`choices[].stop_reason`	string	The reason the choice was stopped.
`usage.prompt_tokens`	number	The number of tokens in the input prompt.
`usage.total_tokens`	number	The total number of tokens processed.
`usage.completion_tokens`	number	The number of tokens in the completion.
`usage.prompt_tokens_details`	string	Additional details about the prompt tokens.

Tool Calling example

This is an example of a tool calling request using this model:

const modelResponse = await Azion.AI.run("casperhansen-mistral-small-24b-instruct-2501-awq", {
  "stream": true,
  "max_tokens": 1024,
  "messages": [
    {
      "role": "system",
      "content": "You are a helpful assistant with access to tools."
    },
    {
      "role": "user",
      "content": "What is the weather in London?"
    }
  ],
  "tools": [
    {
      "type": "function",
      "function": {
        "name": "get_weather",
        "description": "Get the current weather for a location",
        "parameters": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state"
            }
          },
          "required": [
            "location"
          ]
        }
      }
    }
  ]
})

Property	Type	Description
stream	boolean	Indicates whether to stream the response.
max_tokens	number	The maximum number of tokens to generate in the response.
messages	array	List of messages in the conversation.
messages[].role	string	The role of the message sender.
messages[].content	string	The content of the message.
tools	array	A list of tools or functions the model can call.
tools[].type	string	The type of the tool.
tools[].function	object	Metadata about the function being defined.
tools[].function.name	string	Name of the function.
tools[].function.description	string	Description of what the function does.
tools[].function.parameters	object	JSON Schema describing the function’s parameters.

Response example:

{
  "id": "chatcmpl-88affc4730cf4219a06d2b15aad9ad44",
  "object": "chat.completion",
  "created": 1746821866,
  "model": "qwen-qwen25-vl-3b-instruct-awq",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "reasoning_content": null,
        "content": null,
        "tool_calls": [
          {
            "id": "chatcmpl-tool-fd3311e75aed4cbfbeb7244ced77379f",
            "type": "function",
            "function": {
              "name": "get_weather",
              "arguments": "{\"location\": \"London\"}"
            }
          }
        ]
      },
      "logprobs": null,
      "finish_reason": "tool_calls",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 293,
    "total_tokens": 313,
    "completion_tokens": 20,
    "prompt_tokens_details": null
  },
  "prompt_logprobs": null
}

Property	Type	Description
`id`	string	Unique identifier for the response.
`object`	string	The type of object returned in the response.
`created`	number	Timestamp of when the response was created.
`model`	string	The name of the model used for the request.
`choices`	array	Array of objects containing the response choices.
`usage`	object	Object containing usage metrics for the request.
`choices[index]`	number	Index of the choice in the array.
`choices[].message`	object	Object containing the message details.
`choices[].message.role`	string	The role of the message sender.
`choices[].message.reasoning_content`	string	The reasoning content of the message.
`choices[].message.content`	string	The content of the message.
`choices[].message.tool_calls[]`	array	Array of tool call objects.
`choices[].message.tool_calls[].id`	string	Unique identifier for the tool call.
`choices[].message.tool_calls[].type`	string	The type of tool call.
`choices[].message.tool_calls[].function`	object	Object containing the function details.
`choices[].message.tool_calls[].function.name`	string	The name of the function.
`choices[].message.tool_calls[].function.arguments`	string	The arguments passed to the function.
`usage.prompt_tokens`	number	The number of tokens in the input prompt.
`usage.total_tokens`	number	The total number of tokens processed.
`usage.completion_tokens`	number	The number of tokens in the completion.
`usage.prompt_tokens_details`	string	Additional details about the prompt tokens.

JSON schema

{
    "$schema": "http://json-schema.org/draft-07/schema#",
    "type": "object",
    "required": [
        "messages"
    ],
    "properties": {
        "messages": {
            "type": "array",
            "items":
                "$ref": "#/components/schemas/Message"
            }
        },
        "temperature": {
            "type": "number",
            "minimum": 0,
            "maximum": 2
        },
        "top_p": {
            "type": "number",
            "minimum": 0,
            "maximum": 1,
            "default": 1
        },
        "n": {
            "type": "integer",
            "minimum": 1,
            "default": 1
        },
        "stream": {
            "type": "boolean",
            "default": false
        },
        "max_tokens": {
            "type": "integer",
            "minimum": 1
        },
        "presence_penalty": {
            "type": "number",
            "minimum": -2,
            "maximum": 2,
            "default": 0
        },
        "frequency_penalty": {
            "type": "number",
            "minimum": -2,
            "maximum": 2,
            "default": 0
        },
        "tools": {
            "type": "array",
            "items": {
                "$ref": "#/components/schemas/ToolDefinition"
            }
        }
    },
    "components": {
        "schemas": {
            "Message": {
                "oneOf": [
                    {
                        "$ref": "#/components/schemas/SystemMessage"
                    },
                    {
                        "$ref": "#/components/schemas/UserMessage"
                    },
                    {
                        "$ref": "#/components/schemas/AssistantMessage"
                    },
                    {
                        "$ref": "#/components/schemas/ToolMessage"
                    }
                ]
            },
            "SystemMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "system"
                        ]
                    },
                    "content": {
                        "$ref": "#/components/schemas/TextContent"
                    }
                }
            },
            "UserMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "user"
                        ]
                    },
                    "content": {
                        "oneOf": [
                            {
                                "type": "string"
                            },
                            {
                                "type": "array",
                                "items": {
                                    "oneOf": [
                                        {
                                            "$ref": "#/components/schemas/TextContentItem"
                                        }
                                    ]
                                }
                            }
                        ]
                    }
                }
            },
            "AssistantMessage": {
                "oneOf": [
                    {
                        "$ref": "#/components/schemas/AssistantMessageWithoutToolCalls"
                    },
                    {
                        "$ref": "#/components/schemas/AssistantMessageWithToolCalls"
                    }
                ]
            },
            "ToolMessage": {
                "type": "object",
                "required": [
                    "role",
                    "content",
                    "tool_call_id"
                ],
                "properties": {
                    "role": {
                        "enum": [
                            "tool"
                        ]
                    },
                    "content": {
                        "type": "string"
                    },
                    "tool_call_id": {
                        "type": "string"
                    }
                }
            },
            "AssistantMessageWithoutToolCalls": {
                "type": "object",
                "required": [
                    "role",
                    "content"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "assistant"
                        ]
                    },
                    "content": {
                        "$ref": "#/components/schemas/TextContent"
                    }
                },
                "not": {
                    "required": [
                        "tool_calls"
                    ]
                }
            },
            "AssistantMessageWithToolCalls": {
                "type": "object",
                "required": [
                    "role",
                    "tool_calls"
                ],
                "properties": {
                    "role": {
                        "type": "string",
                        "enum": [
                            "assistant"
                        ]
                    },
                    "tool_calls": {
                        "type": "array",
                        "items": {
                            "$ref": "#/components/schemas/ToolCalls"
                        }
                    }
                }
            },
            "TextContent": {
                "oneOf": [
                    {
                        "type": "string"
                    },
                    {
                        "type": "array",
                        "items": {
                            "$ref": "#/components/schemas/TextContentItem"
                        }
                    }
                ],
                "description": "Text content that can be provided either as a simple string or as an array of TextContentItem objects"
            },
            "TextContentItem": {
                "type": "object",
                "required": [
                    "type",
                    "text"
                ],
                "properties": {
                    "type": {
                        "type": "string",
                        "enum": [
                            "text"
                        ]
                    },
                    "text": {
                        "type": "string"
                    }
                }
            },
            "ToolCalls": {
                "type": "object",
                "required": [
                    "function",
                    "id",
                    "type"
                ],
                "properties": {
                    "function": {
                        "type": "object",
                        "required": [
                            "name",
                            "arguments"
                        ],
                        "properties": {
                            "name": {
                                "type": "string"
                            },
                            "arguments": {
                                "type": "string"
                            }
                        }
                    },
                    "id": {
                        "type": "string"
                    },
                    "type": {
                        "enum": [
                            "function"
                        ]
                    }
                },
                "description":"The name and arguments of a function that should be called, as generated by the model."
            },
            "ToolDefinition": {
                "type": "object",
                "required": [
                    "type",
                    "function"
                ],
                "properties": {
                    "type": {
                        "type": "string",
                        "enum": [
                            "function"
                        ]
                    },
                    "function": {
                        "type": "object",
                        "required": [
                            "name"
                        ],
                        "properties": {
                            "name": {
                                "type": "string"
                            },
                            "description": {
                                "type": "string"
                            },
                            "parameters": {
                                "type": "object",
                                "additionalProperties": true
                            },
                            "strict": {
                                "type": "boolean",
                                "default": false
                            }
                        }
                    }
                },
                "description": "Definition of a tool that can be used by the model"
            }
        }
    }
}