All Models
18 Models ยท 3 Providers ยท PII Redacted

๐ŸฎQwen Models

by Alibaba Cloud (Open Source)

Alibaba's Qwen family offers strong multilingual performance with a particular edge in Chinese and Asian languages. Compare Qwen API pricing and Qwen 2.5 cost across providers. Qwen 2.5 brings competitive performance on English benchmarks while maintaining multilingual excellence.

From $0.10/M tokens
3 providers
28+ PII entities redacted

Why deploy Qwen through OpenSourceAIHub?

Automatic PII Redaction

Every Qwen request is scanned for 28+ PII entity types โ€” SSNs, credit cards, emails, API keys, and more โ€” before it reaches any provider.

Smart Cost Routing

Qwen is available across 3 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.

Zero Code Changes

Change two lines in your OpenAI SDK โ€” base_url and api_key โ€” and every request flows through the Hub. Full backward compatibility.

Full Observability

Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.

Qwen Strengths

  • Best-in-class Chinese and Asian language support
  • Competitive English performance in Qwen 2.5 series
  • Open-weights for transparency and fine-tuning
  • Strong coding variants (Qwen-Coder)
  • Multiple sizes from 0.5B to 72B for flexible deployment

Available Qwen Models (18)

Qwen2-VL (72B) Instruct

oah/qwen2-vl
Open Source

Deploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $1.20/MOutput: $1.20/M

Qwen 2.5 14B Instruct

oah/qwen2.5
Open Source

Deploy Qwen 2.5 14B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
Input: $0.12/MOutput: $0.30/M

Qwen/Qwen3-235B-A22B

oah/qwen3
Open Source

Deploy Qwen/Qwen3-235B-A22B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiGroqDeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen3 235B A22B Instruct 2507 FP8 Throughput

oah/qwen3-235b-a22b-instruct-2507-tput
Open Source

Deploy Qwen3 235B A22B Instruct 2507 FP8 Throughput with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $0.20/MOutput: $0.60/M

Qwen3 235B A22B Thinking 2507 FP8

oah/qwen3-235b-a22b-thinking
Open Source

Deploy Qwen3 235B A22B Thinking 2507 FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.30/MOutput: $2.90/M

Qwen3 Coder 480B A35B Instruct Fp8

oah/qwen3-coder
Open Source

Deploy Qwen3 Coder 480B A35B Instruct Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.29/MOutput: $1.20/M

Qwen3 Coder Next Fp8

oah/qwen3-coder-next
Open Source

Deploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
ReasoningInput: $0.50/MOutput: $1.20/M

Qwen3 Next 80B A3b Instruct

oah/qwen3-next
Open Source

Deploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.14/MOutput: $1.40/M

Qwen3-VL-8B-Instruct

oah/qwen3-vl
Open Source

Deploy Qwen3-VL-8B-Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: $0.18/MOutput: $0.68/M

Qwen3.5 35B A3b

oah/qwen3.5
Open Source

Deploy Qwen3.5 35B A3b with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra
ReasoningInput: Free/MOutput: Free/M

Arize AI Qwen 2 1.5B Instruct

oah/qwen-2-1.5b
Open Source

Deploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai
Input: $0.10/MOutput: $0.10/M

Qwen/Qwen-Image-Edit

oah/qwen-image-edit
Open Source

Deploy Qwen/Qwen-Image-Edit with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen-Image-Edit-Max

oah/qwen-image-edit-max
Open Source

Deploy Qwen/Qwen-Image-Edit-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen-Image-Max

oah/qwen-image-max
Open Source

Deploy Qwen/Qwen-Image-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: Free/MOutput: Free/M

Qwen/Qwen2.5-VL-32B-Instruct

oah/qwen2.5-vl
Open Source

Deploy Qwen/Qwen2.5-VL-32B-Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
Input: $0.20/MOutput: $0.60/M

Qwen/Qwen3-Max

oah/qwen3-max
Open Source

Deploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-Max-Thinking

oah/qwen3-max-thinking
Open Source

Deploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3.5-0.8B

oah/qwen3.5-0.8b
Open Source

Deploy Qwen/Qwen3.5-0.8B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra
ReasoningInput: Free/MOutput: Free/M

Qwen Pricing Comparison (per 1M tokens, USD)

Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.

ModelParamsContextVisionTogether.aiDeepInfraGroq
Qwen2-VL (72B) Instruct
oah/qwen2-vl
โ€”33KNo
$1.20/$1.20
โ€”โ€”
Qwen 2.5 14B Instruct
oah/qwen2.5
โ€”33KNo
$0.30/$0.30
$0.12/$0.39
โ€”
Qwen/Qwen3-235B-A22B
oah/qwen3
โ€”โ€”No
Free/Free
$0.10/$0.28
$0.29/$0.39
Qwen3 235B A22B Instruct 2507 FP8 Throughput
oah/qwen3-235b-a22b-instruct-2507-tput
โ€”262KNo
$0.20/$0.60
โ€”โ€”
Qwen3 235B A22B Thinking 2507 FP8
oah/qwen3-235b-a22b-thinking
โ€”262KNo
$0.65/$3.00
$0.30/$2.90
โ€”
Qwen3 Coder 480B A35B Instruct Fp8
oah/qwen3-coder
โ€”262KNo
$2.00/$2.00
$0.29/$1.20
โ€”
Qwen3 Coder Next Fp8
oah/qwen3-coder-next
โ€”262KNo
$0.50/$1.20
โ€”โ€”
Qwen3 Next 80B A3b Instruct
oah/qwen3-next
โ€”262KNo
$0.15/$1.50
$0.14/$1.40
โ€”
Qwen3-VL-8B-Instruct
oah/qwen3-vl
โ€”262KNo
$0.18/$0.68
โ€”โ€”
Qwen3.5 35B A3b
oah/qwen3.5
โ€”262KNo
Free/Free
โ€”โ€”
Arize AI Qwen 2 1.5B Instruct
oah/qwen-2-1.5b
โ€”33KNo
$0.10/$0.10
โ€”โ€”
Qwen/Qwen-Image-Edit
oah/qwen-image-edit
โ€”โ€”Noโ€”โ€”โ€”
Qwen/Qwen-Image-Edit-Max
oah/qwen-image-edit-max
โ€”โ€”Noโ€”โ€”โ€”
Qwen/Qwen-Image-Max
oah/qwen-image-max
โ€”โ€”Noโ€”โ€”โ€”
Qwen/Qwen2.5-VL-32B-Instruct
oah/qwen2.5-vl
โ€”โ€”Noโ€”
$0.20/$0.60
โ€”
Qwen/Qwen3-Max
oah/qwen3-max
โ€”โ€”Noโ€”โ€”โ€”
Qwen/Qwen3-Max-Thinking
oah/qwen3-max-thinking
โ€”โ€”Noโ€”โ€”โ€”
Qwen/Qwen3.5-0.8B
oah/qwen3.5-0.8b
โ€”โ€”Noโ€”โ€”โ€”

Qwen Direct vs OpenSourceAIHub

What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.

ModeWhat You PayPII RedactionBudget CapsRoutingAudit Trail
Direct to Alibaba CloudProvider pricing onlyNoneNoneManualNone
Hub โ€” Managed ModeProvider + 25% markup28+ PII typesPer-key hard capsSmart RouterFull compliance log
Hub โ€” Pro BYOK ($29/mo)Direct to provider (0% markup)28+ PII typesPer-key hard capsSmart RouterFull compliance log

Popular Use Cases

1

Chinese/Asian language applications

2

Multilingual content generation and translation

3

Budget-friendly open-source deployments

4

Fine-tuning base models for domain-specific tasks

Integration โ€” 2 Lines

from openai import OpenAI

client = OpenAI(
    base_url="https://api.opensourceaihub.ai/v1",
    api_key="your_hub_api_key"
)

# Use any virtual model name from the pricing table above
response = client.chat.completions.create(
    model="oah/qwen2-vl",
    messages=[{"role": "user", "content": "Hello!"}]
)

Use any virtual model name from the pricing table above (prefixed with oah/). Works with the standard OpenAI SDK. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).

Frequently Asked Questions

What is the Qwen API pricing?
Qwen API pricing varies by model size and provider. In Managed Mode, we add a 25% markup. With Pro BYOK, pay the provider directly at 0% markup. See the pricing table above for current rates.
What is the Qwen 2.5 cost?
Qwen 2.5 cost depends on the parameter count (0.5B to 72B) and provider. Smaller variants are extremely affordable. Check the pricing comparison table above.
Is Qwen good for English tasks?
Yes. Qwen 2.5 72B is competitive with Llama 3.3 70B on English benchmarks. For Chinese and multilingual tasks, Qwen is often the best open-source choice.

Deploy Qwen with Enterprise-Grade Security

Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged โ€” zero configuration.

Not ready yet? Get notified about Qwen updates:

Explore Other Model Families

Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.