#My transcripts between ElevenLabs agent and OpenAI realtime agent have some of the EL messages trun
1 messages · Page 1 of 1 (latest)
Short answer: the “...” you’re seeing in agent turns is expected when an agent response was interrupted. ElevenLabs emits an agent_response first, and if an interruption occurs, it follows with an agent_response_correction that truncates the text to what should be considered the final utterance; UI/history prefer the corrected (truncated) text even if some audio for the longer response was already streamed. This is why your locally captured audio can contain the full phrase while the transcript shows an ellipsis. See the agent_response_correction client event, which “contains truncated response after interruption.” (elevenlabs.io)
How to confirm it’s this case
- Inspect your WebSocket logs for agent_response_correction and interruption events during those turns. (elevenlabs.io)
- When fetching history, prefer GET /v1/convai/conversations/{conversation_id} (which returns the transcript array for that call) over list endpoints that only provide summaries. (elevenlabs.io)
Ways to retrieve the final, analyzed record
- Use the Post‑call Transcription webhook (post_call_transcription). It is sent only after analysis is complete and includes the “complete conversation transcript with turns” plus metadata. (elevenlabs.io)
- If you need exactly what was spoken (regardless of text truncation), fetch the call recording via GET /v1/convai/conversations/{conversation_id}/audio or enable the Post‑call Audio webhook (post_call_audio). (elevenlabs.io)
How to reduce/avoid truncation going forward
- Adjust Interruption handling in the agent’s settings (Advanced tab → Client Events). Disabling or relaxing interruptions prevents mid‑utterance cuts that trigger agent_response_correction. (elevenlabs.io)
Related note
- If Zero Retention Mode is enabled (per‑agent or workspace), transcripts may be unavailable by design; use post‑call webhooks in that case. (elevenlabs.io)
If you share a conversation_id for an affected call, you can check: (a) whether an agent_response_correction was emitted, and (b) compare the GET conversation transcript vs. the streamed audio to verify this behavior. (elevenlabs.io)
Sources:
Client events | ElevenLabs Documentation
Client events | ElevenLabs Documentation
Get conversation details | ElevenLabs Documentation
Post-call webhooks | ElevenLabs Documentation
Get conversation audio | ElevenLabs Documentation
Conversation flow | ElevenLabs Documentation
Zero Retention Mode (per-agent) | ElevenLabs Documentation
Get conversation details | ElevenLabs Documentation
Ok. I am struggling to understand why ElevenLabs audio logs seem to have all of my QA agent responses clustered at the start of the dialogue, overlapping with EL agent responses, despite them actually being correctly conditioned on the EL words. I.e. it seems like two audio streams are being incorrectly merged on the EL side. Can you search for similar issues and make hypothesis why this can be happening?