Alibaba AI models

Maker of the Qwen family — open-weight and hosted via Alibaba Cloud Model Studio.

Cheapest GAQwen-Flash$0.138 /M blended Biggest contextQwen3.7-Max1M tokens FlagshipQwen-Max (Qwen2.5-Max)GA Tracked18 models13 generally available

Lifecycle watch · 5

Qwen3-Max — GA, shuts down 8 Sep 2026 → migrate to qwen3.7-max
Qwen3-Max-Preview — Deprecated, shuts down 8 Sep 2026 → migrate to qwen3.7-max
Qwen3.6-Max-Preview — Deprecated, shuts down 8 Sep 2026 → migrate to qwen3.7-max
Qwen-Turbo — Deprecated → migrate to qwen-flash
Qwen3-235B-A22B (open weights) — Deprecated → migrate to qwen3.7-plus

The lineup

All Alibaba models

Official pricing ↗

Model ↕	Status ↕	Context ↕	Input $/M ↕	Output $/M ↕	Blended $/M ↕	Cutoff ↕
Qwen-Max (Qwen2.5-Max) Alibaba · text	GA	33K	$1.60	$6.40	$2.80	—
Qwen3.7-Max Alibaba · text, image, video	GA	1M	$2.50	$7.50	$3.75	—
Qwen3.7-Plus Alibaba · text, image, video	GA	1M	$0.4	$1.60	$0.7	—
Qwen3.6-Flash Alibaba · text, image, video	GA	1M	$0.25	$1.50	$0.563	—
Qwen3-Max Alibaba · text	GA	262K	$1.20	$6	$2.40	2025-06
Qwen3.7-Plus (snapshot 2026-05-26) Alibaba · text, image, video	GA	1M	$0.4	$1.60	$0.7	—
Qwen3.5-Plus Alibaba · text, image	GA	262K	$0.4	$2.40	$0.9	—
Qwen3.6-Plus Alibaba · text, image, video	GA	1M	$0.5	$3	$1.13	—
Qwen3.5-Flash Alibaba · text, image	GA	1M	$0.1	$0.4	$0.175	—
Qwen-Plus (Qwen3-series) Alibaba · text	GA	1M	$0.4	$1.20	$0.6	—
Qwen-Flash Alibaba · text	GA	1M	$0.05	$0.4	$0.138	—
Qwen3.5-Omni-Plus Alibaba · text, image, audio, video	GA	—	—	—	—	—
Qwen3-Rerank Alibaba · text	GA	—	—	—	—	—
Qwen3.7-Max-Preview Alibaba · text	Preview	1M	$2.50	$7.50	$3.75	—
Qwen3-Max-Preview Alibaba · text	Deprecated	262K	$1.20	$6	$2.40	—
Qwen3.6-Max-Preview Alibaba · text	Deprecated	262K	$1.30	$7.80	$2.92	—
Qwen-Turbo Alibaba · text	Deprecated	1M	$0.05	$0.2	$0.088	—
Qwen3-235B-A22B (open weights) Alibaba · text	Deprecated	—	—	—	—	—

Blended = 0.75 × input + 0.25 × output $/M tokens (a fair single-number cost proxy). Click any header to sort.

FAQ

Alibaba pricing & models

What is the cheapest Alibaba model?

Qwen-Flash is the cheapest generally-available Alibaba model we track, at $0.05 per 1M input tokens and $0.4 per 1M output tokens ($0.138/1M blended).

What is Alibaba's flagship model?

Qwen-Max (Qwen2.5-Max) is Alibaba's most prominent model in our catalog, with a 33K-token context window and pricing of $1.60/$6.40 per 1M input/output tokens.

How many Alibaba models are there?

We track 18 Alibaba models, of which 13 are generally available and 5 are deprecated or scheduled for retirement.

Which Alibaba models are being deprecated?

Qwen3-Max (retires 8 Sep 2026), Qwen3-Max-Preview (retires 8 Sep 2026), Qwen3.6-Max-Preview (retires 8 Sep 2026), Qwen-Turbo, Qwen3-235B-A22B (open weights).

Alibaba AI models

All Alibaba models

Alibaba pricing & models

Track Alibaba price & deprecation changes