R1 Distill Qwen 32B

DeepSeek Generally Available

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, ach

Context
33K
tokens
Input
$0.29
per MTok
Output
$0.29
per MTok

About

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

Modalities

Input
Text
Output
Text

Code Examples

curl https://openrouter.ai/api/v1/chat/completions \
  -H "Authorization: Bearer $OPENROUTER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-r1-distill-qwen-32b",
    "messages": [
      { "role": "user", "content": "Explain quantum entanglement in one sentence." }
    ]
  }'

API Parameters

Name Type Description
frequency_penalty number Penalize tokens by their frequency so far. Positive values reduce repetition.
include_reasoning boolean Include the model's internal reasoning trace in the response.
logprobs boolean Return log probabilities for each output token.
max_tokens deprecated integer Deprecated. Use max_completion_tokens.
presence_penalty number Penalize tokens that have appeared at all so far. Positive values encourage new topics.
reasoning object Configuration for extended-thinking / reasoning mode.
repetition_penalty number Penalize repeated tokens (>1.0 reduces repetition, <1.0 encourages it).
response_format one of Constrain output to a JSON schema or an enum (structured outputs).
seed integer Deterministic seed for sampling. Same seed + same prompt produces identical output.
stop array Up to 4 sequences where the API will stop generating tokens.
structured_outputs boolean Enable JSON-schema-constrained output.
temperature number Sampling temperature; higher values produce more random output. 0 is deterministic.
top_logprobs integer Return the top-N most likely tokens at each step (requires logprobs: true).
top_p number Nucleus sampling: consider only tokens whose cumulative probability ≥ top_p.

Standard OpenAI-compatible parameters. Consult the provider docs for model-specific behaviour.