OpenAI's WebSocket mode: persistent connections for agentic workloads OpenAI's Responses API now supports WebSocket transport. For tool-call-heavy agent loops, it cuts end-to-end latency by up to 40%. 4 Mar 2026 · 6 min read aiopenaiwebsockets