GPT-Realtime-Translate is OpenAI's purpose-built live speech translation model in the Realtime API, released May 7, 2026 alongside GPT-Realtime-2 and GPT-Realtime-Whisper.
GPT-Realtime-Translate is OpenAI's purpose-built live speech translation model in the Realtime API, released May 7, 2026 alongside GPT-Realtime-2 and GPT-Realtime-Whisper. It translates speech from 70+ input languages into 13 output languages while keeping pace with the speaker, intentionally constrained to the translation task — it will not respond conversationally and will not summarize.
The model was trained on thousands of hours of professional interpreter audio, which helps it (a) remain translation-only and (b) wait for enough context before producing speech, mimicking the rhythm of human simultaneous interpretation.
gpt-realtime-translate (Realtime API)Live Simultaneous Interpretation Pace: Trained on professional human interpreter audio so it waits for enough source-language context before speaking — closer to human conference interpretation than to lagged dub-style translation.
Translation-Only Behavior: Hard-constrained to translation. It will not respond conversationally to the speaker, summarize, or volunteer information. This is deliberate — production live-translation workflows require deterministic behavior.
Per-Minute Pricing: At $0.034/minute, OpenAI is pricing aggressively against existing dedicated speech translation services (Deepgram, AssemblyAI, Microsoft, Google) — particularly for bidirectional or multi-party meetings where prior solutions ran on per-character or per-second tariffs.
May 11, 2026