OpenAI ships gpt-realtime and declares GA for the Realtime API. It's a speech-to-speech model that tightens instruction-following, steadies tool calling, and lifts voice fidelity. Latency drops. True realtime agents become possible. The release prescribes prompt skeletons, JSON envelope tool outputs, session.update flows, phonetic hints, and per-tool rules.










