GPT-Realtime-2 is OpenAI's first realtime voice model with GPT-5-class reasoning, released May 7, 2026 in the OpenAI Realtime API.
GPT-Realtime-2 is OpenAI's first realtime voice model with GPT-5-class reasoning, released May 7, 2026 in the OpenAI Realtime API. It is a major upgrade to the prior GPT-Realtime line, designed to handle harder requests and carry the conversation forward naturally — calling tools, handling corrections or interruptions, and responding in a way that fits the moment.
The model headlines OpenAI's "advancing voice intelligence" launch alongside [[GPT-Realtime-Translate]] (live speech translation) and GPT-Realtime-Whisper (low-latency transcription). The product framing: voice is now the surface where OpenAI competes to convert frontier reasoning into real-world agentic workflows.
gpt-realtime-2 (Realtime API)GPT-5-Class Reasoning In-Voice: First voice model to inherit reasoning capability from the GPT-5 family, enabling multi-step tool use, plan adjustment, and live correction within a continuous voice conversation.
128K Context Window: 4× the prior generation's context, supporting longer conversations and more complex multi-turn workflows without losing earlier state.
Continuous Conversation: The model keeps the conversation moving while it reasons through a request, calls tools, handles corrections or interruptions, and produces responses that fit the conversational moment — closing the gap between "voice assistant" and "voice-native agent."
Tool Use During Voice Turn: Supports tool calls inline with speech generation, enabling agentic actions (lookups, transactions, MCP-style integrations) without breaking conversational flow.
May 11, 2026