All Models
35 Models · 3 Providers · PII Redacted

🏮Qwen Models

by Alibaba Cloud (Open Source)

Alibaba's Qwen family offers strong multilingual performance with a particular edge in Chinese and Asian languages. Compare Qwen API pricing and Qwen 2.5 cost across providers. Qwen 2.5 brings competitive performance on English benchmarks while maintaining multilingual excellence.

From $0.02/M tokens
3 providers
28+ PII entities redacted

Why deploy Qwen through AI ModelGate?

Automatic PII Redaction

Every Qwen request is scanned for 28+ PII entity types — SSNs, credit cards, emails, API keys, and more — before it reaches any provider.

Smart Cost Routing

Qwen is available across 3 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.

Zero Code Changes

Change two lines in your OpenAI SDK — base_url and api_key — and every request flows through ModelGate. Full backward compatibility.

Full Observability

Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.

Qwen Strengths

  • Best-in-class Chinese and Asian language support
  • Competitive English performance in Qwen 2.5 series
  • Open-weights for transparency and fine-tuning
  • Strong coding variants (Qwen-Coder)
  • Multiple sizes from 0.5B to 72B for flexible deployment

Available Qwen Models (35)

Qwen QwQ-32B

oah/qwq
Open Source

Deploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $1.20/MOutput: $1.20/M

Qwen 2 (1.5B)

oah/qwen2-1.5b
Open Source

Deploy Qwen 2 (1.5B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $0.02/MOutput: $0.02/M

Qwen 2 (72B)

oah/qwen2
Open Source

Deploy Qwen 2 (72B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: Free/MOutput: Free/M

Qwen2-VL (72B) Instruct

oah/qwen2-vl
Open Source

Deploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $1.20/MOutput: $1.20/M

Qwen2.5 1.5B

oah/qwen2.5-1.5b
Open Source

Deploy Qwen2.5 1.5B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: Free/MOutput: Free/M

Qwen2.5 14B

oah/qwen2.5
Open Source

Deploy Qwen2.5 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
Input: $0.12/MOutput: $0.30/M

Qwen 2.5 Coder 32B Instruct

oah/qwen2.5-coder
Open Source

Deploy Qwen 2.5 Coder 32B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $0.80/MOutput: $0.80/M

Qwen2.5-VL (72B) Instruct

oah/qwen2.5-vl
Open Source

Deploy Qwen2.5-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
Input: $0.20/MOutput: $0.60/M

Qwen3 0.6B

oah/qwen3-0.6b
Open Source

Deploy Qwen3 0.6B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 0.6B Base

oah/qwen3-0.6b-base
Open Source

Deploy Qwen3 0.6B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 1.7B

oah/qwen3-1.7b
Open Source

Deploy Qwen3 1.7B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 1.7B Base

oah/qwen3-1.7b-base
Open Source

Deploy Qwen3 1.7B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 14B Base

oah/qwen3-14b-base
Open Source

Deploy Qwen3 14B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-235B-A22B

oah/qwen3
Open Source

Deploy Qwen/Qwen3-235B-A22B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiGroqDeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen3 235B A22B Instruct 2507 FP8 Throughput

oah/qwen3-235b-a22b-instruct-2507-tput
Open Source

Deploy Qwen3 235B A22B Instruct 2507 FP8 Throughput with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $0.20/MOutput: $0.60/M

Qwen3 235B A22B Thinking 2507 FP8

oah/qwen3-235b-a22b-thinking
Open Source

Deploy Qwen3 235B A22B Thinking 2507 FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.30/MOutput: $2.90/M

Qwen3 30B A3b Base

oah/qwen3-30b-a3b-base
Open Source

Deploy Qwen3 30B A3b Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 30B A3B Instruct 2507 Lora

oah/qwen3-30b-a3b-instruct-2507-lora
Open Source

Deploy Qwen3 30B A3B Instruct 2507 Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 4B Base

oah/qwen3-4b-base
Open Source

Deploy Qwen3 4B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 8B Base

oah/qwen3-8b-base
Open Source

Deploy Qwen3 8B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 8B Lora

oah/qwen3-8b-lora
Open Source

Deploy Qwen3 8B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: Free/MOutput: Free/M

Qwen3 Coder 30B A3b Instruct

oah/qwen3-coder
Open Source

Deploy Qwen3 Coder 30B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.29/MOutput: $1.20/M

Qwen3 Coder Next Fp8

oah/qwen3-coder-next
Open Source

Deploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $0.50/MOutput: $1.20/M

Qwen3 Next 80B A3b Instruct

oah/qwen3-next
Open Source

Deploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen3 Next 80B A3b Thinking

oah/qwen3-next-80b-a3b-thinking
Open Source

Deploy Qwen3 Next 80B A3b Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $0.15/MOutput: $1.50/M

Qwen3-VL-235B-A22B-Instruct-FP8

oah/qwen3-vl
Open Source

Deploy Qwen3-VL-235B-A22B-Instruct-FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.18/MOutput: $0.68/M

Qwen3.5 35B A3b

oah/qwen3.5
Open Source

Deploy Qwen3.5 35B A3b with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: Free/MOutput: Free/M

Arize AI Qwen 2 1.5B Instruct

oah/qwen-2-1.5b
Open Source

Deploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $0.10/MOutput: $0.10/M

Cogito V1 Preview Qwen 14B

oah/cogito-v1-preview-qwen
Open Source

Deploy Cogito V1 Preview Qwen 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: Free/MOutput: Free/M

Qwen/Qwen-Image-Edit

oah/qwen-image-edit
Open Source

Deploy Qwen/Qwen-Image-Edit with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen-Image-Edit-Max

oah/qwen-image-edit-max
Open Source

Deploy Qwen/Qwen-Image-Edit-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen-Image-Max

oah/qwen-image-max
Open Source

Deploy Qwen/Qwen-Image-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen3-Max

oah/qwen3-max
Open Source

Deploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-Max-Thinking

oah/qwen3-max-thinking
Open Source

Deploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3.5-0.8B

oah/qwen3.5-0.8b
Open Source

Deploy Qwen/Qwen3.5-0.8B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen Pricing Comparison (per 1M tokens, USD)

Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.

ModelParamsContextVisionTogether.aiDeepInfraGroq
Qwen QwQ-32B
oah/qwq
131KNo
$1.20/$1.20
Qwen 2 (1.5B)
oah/qwen2-1.5b
33KNo
$0.02/$0.02
Qwen 2 (72B)
oah/qwen2
33KNo
Free/Free
Qwen2-VL (72B) Instruct
oah/qwen2-vl
33KNo
$1.20/$1.20
Qwen2.5 1.5B
oah/qwen2.5-1.5b
131KNo
Free/Free
Qwen2.5 14B
oah/qwen2.5
131KNo
$0.30/$0.30
$0.12/$0.39
Qwen 2.5 Coder 32B Instruct
oah/qwen2.5-coder
16KNo
$0.80/$0.80
Qwen2.5-VL (72B) Instruct
oah/qwen2.5-vl
33KNo
$1.95/$8.00
$0.20/$0.60
Qwen3 0.6B
oah/qwen3-0.6b
41KNo
Free/Free
Qwen3 0.6B Base
oah/qwen3-0.6b-base
33KNo
Free/Free
Qwen3 1.7B
oah/qwen3-1.7b
41KNo
Free/Free
Qwen3 1.7B Base
oah/qwen3-1.7b-base
33KNo
Free/Free
Qwen3 14B Base
oah/qwen3-14b-base
33KNo
Free/Free
Qwen/Qwen3-235B-A22B
oah/qwen3
No
Free/Free
$0.10/$0.28
$0.29/$0.39
Qwen3 235B A22B Instruct 2507 FP8 Throughput
oah/qwen3-235b-a22b-instruct-2507-tput
262KNo
$0.20/$0.60
Qwen3 235B A22B Thinking 2507 FP8
oah/qwen3-235b-a22b-thinking
262KNo
$0.65/$3.00
$0.30/$2.90
Qwen3 30B A3b Base
oah/qwen3-30b-a3b-base
33KNo
Free/Free
Qwen3 30B A3B Instruct 2507 Lora
oah/qwen3-30b-a3b-instruct-2507-lora
262KNo
Free/Free
Qwen3 4B Base
oah/qwen3-4b-base
33KNo
Free/Free
Qwen3 8B Base
oah/qwen3-8b-base
33KNo
Free/Free
Qwen3 8B Lora
oah/qwen3-8b-lora
41KNo
Free/Free
Qwen3 Coder 30B A3b Instruct
oah/qwen3-coder
262KNo
$2.00/$2.00
$0.29/$1.20
Qwen3 Coder Next Fp8
oah/qwen3-coder-next
262KNo
$0.50/$1.20
Qwen3 Next 80B A3b Instruct
oah/qwen3-next
262KNo
Free/Free
$0.14/$1.40
Qwen3 Next 80B A3b Thinking
oah/qwen3-next-80b-a3b-thinking
262KNo
$0.15/$1.50
Qwen3-VL-235B-A22B-Instruct-FP8
oah/qwen3-vl
262KNo
$0.18/$0.68
Qwen3.5 35B A3b
oah/qwen3.5
262KNo
Free/Free
Arize AI Qwen 2 1.5B Instruct
oah/qwen-2-1.5b
33KNo
$0.10/$0.10
Cogito V1 Preview Qwen 14B
oah/cogito-v1-preview-qwen
131KNo
Free/Free
Qwen/Qwen-Image-Edit
oah/qwen-image-edit
No
Qwen/Qwen-Image-Edit-Max
oah/qwen-image-edit-max
No
Qwen/Qwen-Image-Max
oah/qwen-image-max
No
Qwen/Qwen3-Max
oah/qwen3-max
No
Qwen/Qwen3-Max-Thinking
oah/qwen3-max-thinking
No
Qwen/Qwen3.5-0.8B
oah/qwen3.5-0.8b
No

Qwen Direct vs AI ModelGate

What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.

ModeWhat You PayPII RedactionBudget CapsRoutingAudit Trail
Direct to Alibaba CloudProvider pricing onlyNoneNoneManualNone
Hub — Managed ModeProvider + 25% markup28+ PII typesPer-key hard capsSmart RouterFull compliance log
Hub — Pro BYOK ($29/mo)Direct to provider (0% markup)28+ PII typesPer-key hard capsSmart RouterFull compliance log

Popular Use Cases

1

Chinese/Asian language applications

2

Multilingual content generation and translation

3

Budget-friendly open-source deployments

4

Fine-tuning base models for domain-specific tasks

Integration — 2 Lines

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aimodelgate.ai/v1",
    api_key="your_hub_api_key"
)

# Use any virtual model name from the pricing table above
response = client.chat.completions.create(
    model="oah/qwq",
    messages=[{"role": "user", "content": "Hello!"}]
)

Use any virtual model name from the pricing table above (prefixed with oah/). Works with the standard OpenAI SDK. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).

Frequently Asked Questions

What is the Qwen API pricing?
Qwen API pricing varies by model size and provider. In Managed Mode, we add a 25% markup. With Pro BYOK, pay the provider directly at 0% markup. See the pricing table above for current rates.
What is the Qwen 2.5 cost?
Qwen 2.5 cost depends on the parameter count (0.5B to 72B) and provider. Smaller variants are extremely affordable. Check the pricing comparison table above.
Is Qwen good for English tasks?
Yes. Qwen 2.5 72B is competitive with Llama 3.3 70B on English benchmarks. For Chinese and multilingual tasks, Qwen is often the best open-source choice.

Deploy Qwen with Enterprise-Grade Security

Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged — zero configuration.

Not ready yet? Get notified about Qwen updates:

Explore Other Model Families

Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.