type field when creating a call.
FIXED_TTS
Convert text to speech and play it during the call. The simplest call type — just provide a message and a voice.FIXED_AUDIO
Play a pre-recorded audio file (Base64 encoded MP3 or WAV). Best when you need a polished, branded voice message.DYNAMIC_STREAM
Stream real-time PCM or MP3 audio frames over a WebSocket. Use this when you want full control over the audio pipeline — connect your own TTS, translation layer, or live human agent.GET /v1/calls/:callId/stream:
- Client → Server: Send PCM or MP3 audio frames
- Server → Client: Receive real-time transcripts and call events
WebSocket streaming is currently in preview. See Stream endpoint for details.
DYNAMIC_AGENT
Deploy an AI agent that conducts the call autonomously based on natural-language instructions. The agent handles the conversation flow without any manual input.Comparison
| Feature | FIXED_TTS | FIXED_AUDIO | DYNAMIC_STREAM | DYNAMIC_AGENT |
|---|---|---|---|---|
| Pre-written message | ✓ | ✓ | — | — |
| Custom audio file | — | ✓ | — | — |
| Real-time audio control | — | — | ✓ | — |
| AI-driven conversation | — | — | — | ✓ |
| Requires WebSocket | — | — | ✓ | — |
Start a call
POST /v1/calls — full request reference