Qwen QwQ-32B
oah/qwqDeploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
by Alibaba Cloud (Open Source)
Alibaba's Qwen family offers strong multilingual performance with a particular edge in Chinese and Asian languages. Compare Qwen API pricing and Qwen 2.5 cost across providers. Qwen 2.5 brings competitive performance on English benchmarks while maintaining multilingual excellence.
Every Qwen request is scanned for 28+ PII entity types — SSNs, credit cards, emails, API keys, and more — before it reaches any provider.
Qwen is available across 3 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.
Change two lines in your OpenAI SDK — base_url and api_key — and every request flows through ModelGate. Full backward compatibility.
Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.
oah/qwqDeploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2-1.5bDeploy Qwen 2 (1.5B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2Deploy Qwen 2 (72B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2-vlDeploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-1.5bDeploy Qwen2.5 1.5B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5Deploy Qwen2.5 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-coderDeploy Qwen 2.5 Coder 32B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-vlDeploy Qwen2.5-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-0.6bDeploy Qwen3 0.6B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-0.6b-baseDeploy Qwen3 0.6B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-1.7bDeploy Qwen3 1.7B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-1.7b-baseDeploy Qwen3 1.7B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-14b-baseDeploy Qwen3 14B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3Deploy Qwen/Qwen3-235B-A22B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-instruct-2507-tputDeploy Qwen3 235B A22B Instruct 2507 FP8 Throughput with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-thinkingDeploy Qwen3 235B A22B Thinking 2507 FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-30b-a3b-baseDeploy Qwen3 30B A3b Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-30b-a3b-instruct-2507-loraDeploy Qwen3 30B A3B Instruct 2507 Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-4b-baseDeploy Qwen3 4B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-8b-baseDeploy Qwen3 8B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-8b-loraDeploy Qwen3 8B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coderDeploy Qwen3 Coder 30B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coder-nextDeploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-nextDeploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-next-80b-a3b-thinkingDeploy Qwen3 Next 80B A3b Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-vlDeploy Qwen3-VL-235B-A22B-Instruct-FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5Deploy Qwen3.5 35B A3b with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-2-1.5bDeploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/cogito-v1-preview-qwenDeploy Cogito V1 Preview Qwen 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-editDeploy Qwen/Qwen-Image-Edit with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-edit-maxDeploy Qwen/Qwen-Image-Edit-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-maxDeploy Qwen/Qwen-Image-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-maxDeploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-max-thinkingDeploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5-0.8bDeploy Qwen/Qwen3.5-0.8B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.
| Model | Params | Context | Vision | Together.ai | DeepInfra | Groq |
|---|---|---|---|---|---|---|
Qwen QwQ-32B oah/qwq | — | 131K | No | $1.20/$1.20 | — | — |
Qwen 2 (1.5B) oah/qwen2-1.5b | — | 33K | No | $0.02/$0.02 | — | — |
Qwen 2 (72B) oah/qwen2 | — | 33K | No | Free/Free | — | — |
Qwen2-VL (72B) Instruct oah/qwen2-vl | — | 33K | No | $1.20/$1.20 | — | — |
Qwen2.5 1.5B oah/qwen2.5-1.5b | — | 131K | No | Free/Free | — | — |
Qwen2.5 14B oah/qwen2.5 | — | 131K | No | $0.30/$0.30 | $0.12/$0.39 | — |
Qwen 2.5 Coder 32B Instruct oah/qwen2.5-coder | — | 16K | No | $0.80/$0.80 | — | — |
Qwen2.5-VL (72B) Instruct oah/qwen2.5-vl | — | 33K | No | $1.95/$8.00 | $0.20/$0.60 | — |
Qwen3 0.6B oah/qwen3-0.6b | — | 41K | No | Free/Free | — | — |
Qwen3 0.6B Base oah/qwen3-0.6b-base | — | 33K | No | Free/Free | — | — |
Qwen3 1.7B oah/qwen3-1.7b | — | 41K | No | Free/Free | — | — |
Qwen3 1.7B Base oah/qwen3-1.7b-base | — | 33K | No | Free/Free | — | — |
Qwen3 14B Base oah/qwen3-14b-base | — | 33K | No | Free/Free | — | — |
Qwen/Qwen3-235B-A22B oah/qwen3 | — | — | No | Free/Free | $0.10/$0.28 | $0.29/$0.39 |
Qwen3 235B A22B Instruct 2507 FP8 Throughput oah/qwen3-235b-a22b-instruct-2507-tput | — | 262K | No | $0.20/$0.60 | — | — |
Qwen3 235B A22B Thinking 2507 FP8 oah/qwen3-235b-a22b-thinking | — | 262K | No | $0.65/$3.00 | $0.30/$2.90 | — |
Qwen3 30B A3b Base oah/qwen3-30b-a3b-base | — | 33K | No | Free/Free | — | — |
Qwen3 30B A3B Instruct 2507 Lora oah/qwen3-30b-a3b-instruct-2507-lora | — | 262K | No | Free/Free | — | — |
Qwen3 4B Base oah/qwen3-4b-base | — | 33K | No | Free/Free | — | — |
Qwen3 8B Base oah/qwen3-8b-base | — | 33K | No | Free/Free | — | — |
Qwen3 8B Lora oah/qwen3-8b-lora | — | 41K | No | Free/Free | — | — |
Qwen3 Coder 30B A3b Instruct oah/qwen3-coder | — | 262K | No | $2.00/$2.00 | $0.29/$1.20 | — |
Qwen3 Coder Next Fp8 oah/qwen3-coder-next | — | 262K | No | $0.50/$1.20 | — | — |
Qwen3 Next 80B A3b Instruct oah/qwen3-next | — | 262K | No | Free/Free | $0.14/$1.40 | — |
Qwen3 Next 80B A3b Thinking oah/qwen3-next-80b-a3b-thinking | — | 262K | No | $0.15/$1.50 | — | — |
Qwen3-VL-235B-A22B-Instruct-FP8 oah/qwen3-vl | — | 262K | No | $0.18/$0.68 | — | — |
Qwen3.5 35B A3b oah/qwen3.5 | — | 262K | No | Free/Free | — | — |
Arize AI Qwen 2 1.5B Instruct oah/qwen-2-1.5b | — | 33K | No | $0.10/$0.10 | — | — |
Cogito V1 Preview Qwen 14B oah/cogito-v1-preview-qwen | — | 131K | No | Free/Free | — | — |
Qwen/Qwen-Image-Edit oah/qwen-image-edit | — | — | No | — | — | — |
Qwen/Qwen-Image-Edit-Max oah/qwen-image-edit-max | — | — | No | — | — | — |
Qwen/Qwen-Image-Max oah/qwen-image-max | — | — | No | — | — | — |
Qwen/Qwen3-Max oah/qwen3-max | — | — | No | — | — | — |
Qwen/Qwen3-Max-Thinking oah/qwen3-max-thinking | — | — | No | — | — | — |
Qwen/Qwen3.5-0.8B oah/qwen3.5-0.8b | — | — | No | — | — | — |
What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.
| Mode | What You Pay | PII Redaction | Budget Caps | Routing | Audit Trail |
|---|---|---|---|---|---|
| Direct to Alibaba Cloud | Provider pricing only | None | None | Manual | None |
| Hub — Managed Mode | Provider + 25% markup | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
| Hub — Pro BYOK ($29/mo) | Direct to provider (0% markup) | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
Chinese/Asian language applications
Multilingual content generation and translation
Budget-friendly open-source deployments
Fine-tuning base models for domain-specific tasks
from openai import OpenAI
client = OpenAI(
base_url="https://api.aimodelgate.ai/v1",
api_key="your_hub_api_key"
)
# Use any virtual model name from the pricing table above
response = client.chat.completions.create(
model="oah/qwq",
messages=[{"role": "user", "content": "Hello!"}]
)Use any virtual model name from the pricing table above (prefixed with oah/). Works with the standard OpenAI SDK. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).
Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged — zero configuration.
Not ready yet? Get notified about Qwen updates:
Meta's open-weights Llama family is the most widely deployed open-source LLM series. Compare Llama API pricing across Gr…
OpenAI's GPT family powers the majority of commercial AI applications. Compare GPT-4 API cost and OpenAI API pricing acr…
Google's Gemini family offers powerful multimodal capabilities with large context windows. Compare Gemini API pricing an…
Anthropic's Claude family is built with safety and reliability at its core. Compare Claude API pricing and Claude Sonnet…
DeepSeek has rapidly risen as a leading open-source model family, known for exceptional coding performance and cost effi…
Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.