Qwen2-VL (72B) Instruct
oah/qwen2-vlDeploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
by Alibaba Cloud (Open Source)
Alibaba's Qwen family offers strong multilingual performance with a particular edge in Chinese and Asian languages. Compare Qwen API pricing and Qwen 2.5 cost across providers. Qwen 2.5 brings competitive performance on English benchmarks while maintaining multilingual excellence.
Every Qwen request is scanned for 28+ PII entity types โ SSNs, credit cards, emails, API keys, and more โ before it reaches any provider.
Qwen is available across 3 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.
Change two lines in your OpenAI SDK โ base_url and api_key โ and every request flows through the Hub. Full backward compatibility.
Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.
oah/qwen2-vlDeploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5Deploy Qwen 2.5 14B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3Deploy Qwen/Qwen3-235B-A22B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-instruct-2507-tputDeploy Qwen3 235B A22B Instruct 2507 FP8 Throughput with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-thinkingDeploy Qwen3 235B A22B Thinking 2507 FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coderDeploy Qwen3 Coder 480B A35B Instruct Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coder-nextDeploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-nextDeploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-vlDeploy Qwen3-VL-8B-Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5Deploy Qwen3.5 35B A3b with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-2-1.5bDeploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-editDeploy Qwen/Qwen-Image-Edit with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-edit-maxDeploy Qwen/Qwen-Image-Edit-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-maxDeploy Qwen/Qwen-Image-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-vlDeploy Qwen/Qwen2.5-VL-32B-Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-maxDeploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-max-thinkingDeploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5-0.8bDeploy Qwen/Qwen3.5-0.8B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.
| Model | Params | Context | Vision | Together.ai | DeepInfra | Groq |
|---|---|---|---|---|---|---|
Qwen2-VL (72B) Instruct oah/qwen2-vl | โ | 33K | No | $1.20/$1.20 | โ | โ |
Qwen 2.5 14B Instruct oah/qwen2.5 | โ | 33K | No | $0.30/$0.30 | $0.12/$0.39 | โ |
Qwen/Qwen3-235B-A22B oah/qwen3 | โ | โ | No | Free/Free | $0.10/$0.28 | $0.29/$0.39 |
Qwen3 235B A22B Instruct 2507 FP8 Throughput oah/qwen3-235b-a22b-instruct-2507-tput | โ | 262K | No | $0.20/$0.60 | โ | โ |
Qwen3 235B A22B Thinking 2507 FP8 oah/qwen3-235b-a22b-thinking | โ | 262K | No | $0.65/$3.00 | $0.30/$2.90 | โ |
Qwen3 Coder 480B A35B Instruct Fp8 oah/qwen3-coder | โ | 262K | No | $2.00/$2.00 | $0.29/$1.20 | โ |
Qwen3 Coder Next Fp8 oah/qwen3-coder-next | โ | 262K | No | $0.50/$1.20 | โ | โ |
Qwen3 Next 80B A3b Instruct oah/qwen3-next | โ | 262K | No | $0.15/$1.50 | $0.14/$1.40 | โ |
Qwen3-VL-8B-Instruct oah/qwen3-vl | โ | 262K | No | $0.18/$0.68 | โ | โ |
Qwen3.5 35B A3b oah/qwen3.5 | โ | 262K | No | Free/Free | โ | โ |
Arize AI Qwen 2 1.5B Instruct oah/qwen-2-1.5b | โ | 33K | No | $0.10/$0.10 | โ | โ |
Qwen/Qwen-Image-Edit oah/qwen-image-edit | โ | โ | No | โ | โ | โ |
Qwen/Qwen-Image-Edit-Max oah/qwen-image-edit-max | โ | โ | No | โ | โ | โ |
Qwen/Qwen-Image-Max oah/qwen-image-max | โ | โ | No | โ | โ | โ |
Qwen/Qwen2.5-VL-32B-Instruct oah/qwen2.5-vl | โ | โ | No | โ | $0.20/$0.60 | โ |
Qwen/Qwen3-Max oah/qwen3-max | โ | โ | No | โ | โ | โ |
Qwen/Qwen3-Max-Thinking oah/qwen3-max-thinking | โ | โ | No | โ | โ | โ |
Qwen/Qwen3.5-0.8B oah/qwen3.5-0.8b | โ | โ | No | โ | โ | โ |
What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.
| Mode | What You Pay | PII Redaction | Budget Caps | Routing | Audit Trail |
|---|---|---|---|---|---|
| Direct to Alibaba Cloud | Provider pricing only | None | None | Manual | None |
| Hub โ Managed Mode | Provider + 25% markup | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
| Hub โ Pro BYOK ($29/mo) | Direct to provider (0% markup) | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
Chinese/Asian language applications
Multilingual content generation and translation
Budget-friendly open-source deployments
Fine-tuning base models for domain-specific tasks
from openai import OpenAI
client = OpenAI(
base_url="https://api.opensourceaihub.ai/v1",
api_key="your_hub_api_key"
)
# Use any virtual model name from the pricing table above
response = client.chat.completions.create(
model="oah/qwen2-vl",
messages=[{"role": "user", "content": "Hello!"}]
)Use any virtual model name from the pricing table above (prefixed with oah/). Works with the standard OpenAI SDK. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).
Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged โ zero configuration.
Not ready yet? Get notified about Qwen updates:
Meta's open-weights Llama family is the most widely deployed open-source LLM series. Compare Llama API pricing across Grโฆ
OpenAI's GPT family powers the majority of commercial AI applications. Compare GPT-4 API cost and OpenAI API pricing acrโฆ
Google's Gemini family offers powerful multimodal capabilities with large context windows. Compare Gemini API pricing anโฆ
Anthropic's Claude family is built with safety and reliability at its core. Compare Claude API pricing and Claude Sonnetโฆ
DeepSeek has rapidly risen as a leading open-source model family, known for exceptional coding performance and cost effiโฆ
Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.