Models / Meta

Llama 3.2 90B Vision Instruct

GA

Open-weight multimodal (text+image), ~88.8B params, 128k context. License: Llama 3.2 Community License. Source is the 3.2 11B Vision card which documents the 90B sibling (same architecture/128k ctx/Dec-2023 cutoff). Image+text tasks are English-primary; text-only adds German, French, Italian, Portuguese, Hindi, Spanish, Thai. Per-token hosted price not separately captured this pass (null). Dedicated card: huggingface.co/meta-llama/Llama-3.2-90B-Vision-Instruct.

Provider
Meta
Status
GA
Input price
Output price
Cached input
Blended price
Context window
128,000 tokens (128K)
Max output
Modality
text, image
Knowledge cutoff
2023-12
Released
25 Sep 2024
API string
meta-llama/Llama-3.2-90B-Vision-Instruct

Source: Meta official documentation ↗