Models / Meta

Llama 3.2 11B Vision Instruct

GA

Open-weight multimodal (text+image), 10.6B params, 128k context. License: Llama 3.2 Community License. Image+text tasks English-primary; text-only adds German, French, Italian, Portuguese, Hindi, Spanish, Thai. Per-token hosted price not separately captured this pass (null).

Provider
Meta
Status
GA
Input price
Output price
Cached input
Blended price
Context window
128,000 tokens (128K)
Max output
Modality
text, image
Knowledge cutoff
2023-12
Released
25 Sep 2024
API string
meta-llama/Llama-3.2-11B-Vision-Instruct

Source: Meta official documentation ↗