DeepSeek-V4-Pro
PreviewLarger/most-capable V4 model (~1.6T total / ~49B active params per HuggingFace model card + authoritative third-party reports; MIT License; mixed FP4/FP8). Context length 1M, max output 384K tokens. Input price $0.435/M cache-miss, $0.003625/M cache-hit; output $0.87/M (USD). Supports three reasoning-effort modes (non-think / think high / think max), JSON output, tool calls; FIM completion non-thinking-mode only. Concurrency limit 500. Part of the 'DeepSeek V4 Preview' generation (released 2026-04-24), hence status=preview. Knowledge cutoff NOT officially published by DeepSeek -> left null. Pr
Track DeepSeek-V4-Pro price & status changes
New models, price cuts, and deprecations — a short email when something actually changes. No spam, unsubscribe anytime.
◎ You're on the watch list. We'll ping you the moment a model launches, changes price, or gets deprecated.
Free forever · powered by the same data on this page.