Models / Alibaba

Qwen3.6-Flash

GA

Most cost-effective current Flash model, alias = qwen3.6-flash-2026-04-16; native vision-language (multimodal text/image/video). Official International TIERED pricing: tier1 0<token<=256K: $0.25 in / $1.5 out; tier2 256K<token<=1M: $1 in / $4 out. Supports 50% batch-inference discount and context caching. Has an open-weight sibling listed: qwen3.6-35b-a3b (per official release notes, the qwen3.6-flash entry groups qwen3.6-flash / qwen3.6-flash-2026-04-16 / qwen3.6-35b-a3b). Context window 1M (tier to 1M). Max output not published on accessible official page (null). Released 2026-04-16 (Interna

Provider
Alibaba
Status
GA
Input price
$0.25 / 1M tokens
Output price
$1.50 / 1M tokens
Cached input
Blended price
$0.563 / 1M tokens
Context window
1,000,000 tokens (1M)
Max output
Modality
text, image, video
Knowledge cutoff
Released
16 Apr 2026
API string
qwen3.6-flash

Source: Alibaba official documentation ↗