Models / Alibaba

Qwen-Flash

GA

Legacy stable Flash alias = qwen-flash-2025-07-28; the recommended replacement for the discontinued Qwen-Turbo. Official International TIERED pricing: 0<token<=256K $0.05 in / $0.4 out; 256K<token<=1M $0.25 in / $2 out per 1M. Supports 50% batch-inference discount and context caching. Context window 1M. Max output not on accessible official page (null). Still GA on pricing page Jun 22, 2026. Text-only.

Provider
Alibaba
Status
GA
Input price
$0.05 / 1M tokens
Output price
$0.4 / 1M tokens
Cached input
Blended price
$0.138 / 1M tokens
Context window
1,000,000 tokens (1M)
Max output
Modality
text
Knowledge cutoff
Released
28 Jul 2025
API string
qwen-flash

Source: Alibaba official documentation ↗