Alibaba AI models
Maker of the Qwen family — open-weight and hosted via Alibaba Cloud Model Studio.
- Qwen3-Max — GA, shuts down 8 Sep 2026 → migrate to qwen3.7-max
- Qwen3-Max-Preview — Deprecated, shuts down 8 Sep 2026 → migrate to qwen3.7-max
- Qwen3.6-Max-Preview — Deprecated, shuts down 8 Sep 2026 → migrate to qwen3.7-max
- Qwen-Turbo — Deprecated → migrate to qwen-flash
- Qwen3-235B-A22B (open weights) — Deprecated → migrate to qwen3.7-plus
All Alibaba models
| Model ↕ | Status ↕ | Context ↕ | Input $/M ↕ | Output $/M ↕ | Blended $/M ↕ | Cutoff ↕ |
|---|---|---|---|---|---|---|
| Qwen-Max (Qwen2.5-Max) Alibaba · text | GA | 33K | $1.60 | $6.40 | $2.80 | — |
| Qwen3.7-Max Alibaba · text, image, video | GA | 1M | $2.50 | $7.50 | $3.75 | — |
| Qwen3.7-Plus Alibaba · text, image, video | GA | 1M | $0.4 | $1.60 | $0.7 | — |
| Qwen3.6-Flash Alibaba · text, image, video | GA | 1M | $0.25 | $1.50 | $0.563 | — |
| Qwen3-Max Alibaba · text | GA | 262K | $1.20 | $6 | $2.40 | 2025-06 |
| Qwen3.7-Plus (snapshot 2026-05-26) Alibaba · text, image, video | GA | 1M | $0.4 | $1.60 | $0.7 | — |
| Qwen3.5-Plus Alibaba · text, image | GA | 262K | $0.4 | $2.40 | $0.9 | — |
| Qwen3.6-Plus Alibaba · text, image, video | GA | 1M | $0.5 | $3 | $1.13 | — |
| Qwen3.5-Flash Alibaba · text, image | GA | 1M | $0.1 | $0.4 | $0.175 | — |
| Qwen-Plus (Qwen3-series) Alibaba · text | GA | 1M | $0.4 | $1.20 | $0.6 | — |
| Qwen-Flash Alibaba · text | GA | 1M | $0.05 | $0.4 | $0.138 | — |
| Qwen3.5-Omni-Plus Alibaba · text, image, audio, video | GA | — | — | — | — | — |
| Qwen3-Rerank Alibaba · text | GA | — | — | — | — | — |
| Qwen3.7-Max-Preview Alibaba · text | Preview | 1M | $2.50 | $7.50 | $3.75 | — |
| Qwen3-Max-Preview Alibaba · text | Deprecated | 262K | $1.20 | $6 | $2.40 | — |
| Qwen3.6-Max-Preview Alibaba · text | Deprecated | 262K | $1.30 | $7.80 | $2.92 | — |
| Qwen-Turbo Alibaba · text | Deprecated | 1M | $0.05 | $0.2 | $0.088 | — |
| Qwen3-235B-A22B (open weights) Alibaba · text | Deprecated | — | — | — | — | — |
Blended = 0.75 × input + 0.25 × output $/M tokens (a fair single-number cost proxy). Click any header to sort.
Alibaba pricing & models
What is the cheapest Alibaba model?
Qwen-Flash is the cheapest generally-available Alibaba model we track, at $0.05 per 1M input tokens and $0.4 per 1M output tokens ($0.138/1M blended).
What is Alibaba's flagship model?
Qwen-Max (Qwen2.5-Max) is Alibaba's most prominent model in our catalog, with a 33K-token context window and pricing of $1.60/$6.40 per 1M input/output tokens.
How many Alibaba models are there?
We track 18 Alibaba models, of which 13 are generally available and 5 are deprecated or scheduled for retirement.
Which Alibaba models are being deprecated?
Qwen3-Max (retires 8 Sep 2026), Qwen3-Max-Preview (retires 8 Sep 2026), Qwen3.6-Max-Preview (retires 8 Sep 2026), Qwen-Turbo, Qwen3-235B-A22B (open weights).
Track Alibaba price & deprecation changes
New models, price cuts, and deprecations — a short email when something actually changes. No spam, unsubscribe anytime.
◎ You're on the watch list. We'll ping you the moment a model launches, changes price, or gets deprecated.
Free forever · powered by the same data on this page.