Models / Alibaba

Qwen3.6-Flash

GA

Most cost-effective current Flash model, alias = qwen3.6-flash-2026-04-16; native vision-language (multimodal text/image/video). Official International TIERED pricing: tier1 0<token<=256K: $0.25 in / $1.5 out; tier2 256K<token<=1M: $1 in / $4 out. Supports 50% batch-inference discount and context caching. Has an open-weight sibling listed: qwen3.6-35b-a3b (per official release notes, the qwen3.6-flash entry groups qwen3.6-flash / qwen3.6-flash-2026-04-16 / qwen3.6-35b-a3b). Context window 1M (tier to 1M). Max output not published on accessible official page (null). Released 2026-04-16 (Interna

Provider

Alibaba

Status

GA

Input price

$0.25 / 1M tokens

Output price

$1.50 / 1M tokens

Cached input

—

Blended price

$0.563 / 1M tokens

Context window

1,000,000 tokens (1M)

Max output

—

Modality

text, image, video

Knowledge cutoff

—

Released

16 Apr 2026

API string

qwen3.6-flash

Source: Alibaba official documentation ↗

Compare Qwen3.6-Flash with…

Qwen3.6-Flash vs Claude Opus 4.8→

$0.563 vs $10 blended /M

Qwen3.6-Flash vs Claude Opus 4.7→

$0.563 vs $10 blended /M

Qwen3.6-Flash vs Claude Opus 4.6→

$0.563 vs $10 blended /M

Qwen3.6-Flash vs Claude Opus 4.5→

$0.563 vs $10 blended /M

Qwen3.6-Flash

Compare Qwen3.6-Flash with…

Track Qwen3.6-Flash price & status changes