Models / OpenAI

GPT-Realtime-2

GA

Speech-to-speech realtime model. Input: text/audio/image; Output: text/audio. Prices listed are the TEXT-token rates ($4 in / $24 out / $0.40 cached). Audio tokens are billed separately at $32 input / $64 output per 1M; image input $5/1M. Max output 32K tokens.

Provider
OpenAI
Status
GA
Input price
$4 / 1M tokens
Output price
$24 / 1M tokens
Cached input
$0.4 / 1M tokens
Blended price
$9 / 1M tokens
Context window
128,000 tokens (128K)
Max output
32,000 tokens
Modality
text, audio, image
Knowledge cutoff
2024-09-30
Released
API string
gpt-realtime-2

Source: OpenAI official documentation ↗