Models / Alibaba

Qwen3.7-Max

GA

Flagship Max model, API-only/proprietary (no open weights). Official International (Singapore) list price $2.5 in / $7.5 out per 1M tokens, single tier 0<token<=1M, Non-Thinking and Thinking modes; alias currently = qwen3.7-max-2026-05-20 (the 2026-06-08 snapshot added visual-modal understanding -> text+image+video). Thinking enabled by default; supports explicit/context cache (caching priced as a discount, no separate cached-input column published) and Function Calling. Max output: official Alibaba blog/config shows 65536; a benchmark footnote on the same blog mentions max_tokens=80K (unconfi

Provider
Alibaba
Status
GA
Input price
$2.50 / 1M tokens
Output price
$7.50 / 1M tokens
Cached input
Blended price
$3.75 / 1M tokens
Context window
1,000,000 tokens (1M)
Max output
65,536 tokens
Modality
text, image, video
Knowledge cutoff
Released
21 May 2026
API string
qwen3.7-max

Source: Alibaba official documentation ↗