Llama 4 Maverick (17B-128E Instruct) vs Qwen-Max (Qwen2.5-Max)

Llama 4 Maverick (17B-128E Instruct) is about 10.7× cheaper than Qwen-Max (Qwen2.5-Max) on blended token cost ($0.262 vs $2.80 per 1M).

SpecLlama 4 Maverick (17B-128E Instruct)Qwen-Max (Qwen2.5-Max)
ProviderMetaAlibaba
StatusGAGA
Input $/1M$0.15$1.60
Output $/1M$0.6$6.40
Blended $/1M$0.262$2.80
Context1M33K
Max output8K
Cutoff2024-08