Class ElevenLabsVoiceAgent

Composable voice agent with ElevenLabs-opinionated defaults.

Not to be confused with ElevenLabsAgentAdapter (above) which talks to ElevenLabs' hosted ConvAI endpoint. This class is local: you compose ElevenLabsSTTProvider + any LLM + ElevenLabs TTS yourself.

Default stack:

STT: ElevenLabsSTTProvider with the same API key.
LLM: openai("gpt-5.4-mini") — text-only chat completion.
TTS: elevenlabs/EXAVITQu4vr4xnSDxMaL (Sarah — free-tier premade). Override via the ELEVENLABS_VOICE_ID env var or the voice arg.

Example

// Defaults — all ElevenLabs STT, gpt-5.4-mini, EL TTS
const agent = new ElevenLabsVoiceAgent({ apiKey: process.env.ELEVENLABS_API_KEY! });

// Override just the LLM
import { anthropic } from "@ai-sdk/anthropic";
const agent = new ElevenLabsVoiceAgent({ apiKey, llm: anthropic("claude-sonnet-4-6") });

// Bring your own STT
const agent = new ElevenLabsVoiceAgent({ apiKey, stt: new MyCustomSTT() });

Hierarchy (View Summary)

ComposableVoiceAgent
- ElevenLabsVoiceAgent

Constructors

constructor

new ElevenLabsVoiceAgent(
options: ElevenLabsVoiceAgentOptions,
): ElevenLabsVoiceAgent
Parameters
- options: ElevenLabsVoiceAgentOptions
Returns ElevenLabsVoiceAgent
Overrides ComposableVoiceAgent.constructor
- Defined in work/scenario/scenario/javascript/src/voice/adapters/elevenlabs.ts:427

Properties

`Optional`agentSpeakingEvent

agentSpeakingEvent?: AgentSpeakingEvent

Set when the adapter has emitted its first agent audio chunk for the current turn — gates timing-based barge-in. Concrete adapters expose this so scenario.interrupt can wait for real speech before firing the interruption. Optional: adapters without server-VAD-style interrupt sequencing can leave it undefined.

`Readonly`capabilities

capabilities: AdapterCapabilities = ...

Declaration of what this adapter can and cannot do. Concrete subclasses MUST publish a non-default value; the base instance defaults to "nothing supported" so capability-gated steps fail safely when an adapter forgets to declare.

`Protected` `Readonly`history

history: ModelMessage[]

lastLlmResponse

lastLlmResponse: string | null = null

lastUserTranscript

lastUserTranscript: string | null = null

`Readonly`llm

llm: LanguageModel

`Optional`name

name?: string

responseMaxDuration

responseMaxDuration: number = 30.0

Hard cap on a single agent turn's audio. Prevents runaway loops if a transport never signals end-of-stream. 30s = a long sentence.

responseTailSilence

responseTailSilence: number = 0.6

Tail silence: once the first agent chunk arrives, keep draining receiveAudio until no chunk shows up within this many seconds — that's how we detect the agent finished talking.

responseTimeout

responseTimeout: number = 30.0

Seconds to wait for agent audio after sending user audio.

role

role: AgentRole = AgentRole.AGENT

`Optional`streamingTranscript

streamingTranscript?: string

Incremental transcript text emitted while the agent speaks. Populated by adapters that advertise capabilities.streamingTranscripts. Read by scenario.interrupt when afterWords: N is set.

`Readonly`stt

stt: STTProvider

`Readonly`tts

tts: string

`Protected` `Readonly`ttsOptions

ttsOptions: SynthesizeOptions

`Protected`turnOutputEmitted

turnOutputEmitted: boolean = false

Turn-output guard. The default call() drains receiveAudio until tail-silence; on this adapter that would kick a second LLM call. Reset by sendAudio (new user turn → new LLM call allowed), set by the end of receiveAudio.

`Readonly`voice

voice: string

`Static` `Readonly`DEFAULT_SYSTEM_PROMPT

DEFAULT_SYSTEM_PROMPT: string = ...

Methods

call

call(input: AgentInput): Promise<AgentReturnTypes>
Default call() body, ported from Python VoiceAgentAdapter.call.

Threads the latest user-message audio through sendAudio, drains the agent response on tail silence, records one user and one agent segment into the executor state, and returns the merged assistant audio message. Subclasses may override for specialised flows but will usually inherit it.
Parameters
- input: AgentInput
Returns Promise<AgentReturnTypes>
Inherited from ComposableVoiceAgent.call
- Defined in work/scenario/scenario/javascript/src/voice/adapter.ts:67

connect

connect(): Promise<void>
Open the transport and prepare to exchange audio.

Returns Promise<void>
Inherited from ComposableVoiceAgent.connect
- Defined in work/scenario/scenario/javascript/src/voice/adapters/composable.ts:173

disconnect

disconnect(): Promise<void>
Close the transport and release resources.

Returns Promise<void>
Inherited from ComposableVoiceAgent.disconnect
- Defined in work/scenario/scenario/javascript/src/voice/adapters/composable.ts:177

interrupt

interrupt(): Promise<void>
Send a first-class interrupt signal to the agent under test.

Adapters that advertise capabilities.interruption === true override this to send the transport-native interrupt (e.g. Twilio clear, OpenAI Realtime response.cancel). The default raises UnsupportedCapabilityError; callers (scenario.interrupt()) check capabilities.interruption and fall back to timing-based barge-in when this returns false.

Returns Promise<void>
Inherited from ComposableVoiceAgent.interrupt
- Defined in work/scenario/scenario/javascript/src/voice/adapter.ts:158

isConnected

isConnected(): boolean
Whether the transport is currently open and ready to exchange audio (Gap #11). The default call flow (defaultVoiceCall) consults this BEFORE sending audio and raises PendingTransportError uniformly when it returns false — so a call() issued before the executor's connect() fails with one clear error across every transport instead of a transport-specific null-dereference or silent hang.

Base default is true: adapters with no meaningful "not connected" state (in-process composable, test doubles) never trip the gate. Network transport leaves override this to report their real socket/session state.

Returns boolean
Inherited from ComposableVoiceAgent.isConnected
- Defined in work/scenario/scenario/javascript/src/voice/adapter.ts:105

receiveAudio

receiveAudio(timeout: number): Promise<AudioChunk>
Receive the next AudioChunk from the agent.
Parameters
- timeout: number
Returns Promise<AudioChunk>
Inherited from ComposableVoiceAgent.receiveAudio
- Defined in work/scenario/scenario/javascript/src/voice/adapters/composable.ts:188

sendAudio

sendAudio(chunk: AudioChunk): Promise<void>
Transmit an AudioChunk to the agent under test.
Parameters
- chunk: AudioChunk
Returns Promise<void>
Inherited from ComposableVoiceAgent.sendAudio
- Defined in work/scenario/scenario/javascript/src/voice/adapters/composable.ts:181

sendDtmf

sendDtmf(_tones: string): Promise<void>
Transmit DTMF tones to the telephony peer. Adapters that advertise capabilities.dtmf MUST implement this; the default raises UnsupportedCapabilityError so an adapter that forgot to ship sendDtmf while claiming the capability fails loudly instead of silently routing through a PCM fallback.
Parameters
- _tones: string
Returns Promise<void>
Inherited from ComposableVoiceAgent.sendDtmf
- Defined in work/scenario/scenario/javascript/src/voice/adapter.ts:138

toString

toString(): string
Returns a string representation of an object.

Returns string
Overrides ComposableVoiceAgent.toString
- Defined in work/scenario/scenario/javascript/src/voice/adapters/elevenlabs.ts:447

Class ElevenLabsVoiceAgent

Example

Hierarchy (View Summary)

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

Returns ElevenLabsVoiceAgent

Properties

OptionalagentSpeakingEvent

Readonlycapabilities

Protected Readonlyhistory

lastLlmResponse

lastUserTranscript

Readonlyllm

Optionalname

responseMaxDuration

responseTailSilence

responseTimeout

role

OptionalstreamingTranscript

Readonlystt

Readonlytts

Protected ReadonlyttsOptions

ProtectedturnOutputEmitted

Readonlyvoice

Static ReadonlyDEFAULT_SYSTEM_PROMPT

Methods

call

Parameters

Returns Promise<AgentReturnTypes>

connect

Returns Promise<void>

disconnect

Returns Promise<void>

interrupt

Returns Promise<void>

isConnected

Returns boolean

receiveAudio

Parameters

Returns Promise<AudioChunk>

sendAudio

Parameters

Returns Promise<void>

sendDtmf

Parameters

Returns Promise<void>

toString

Returns string

Settings

On This Page

`Optional`agentSpeakingEvent

`Readonly`capabilities

`Protected` `Readonly`history

`Readonly`llm

`Optional`name

`Optional`streamingTranscript

`Readonly`stt

`Readonly`tts

`Protected` `Readonly`ttsOptions

`Protected`turnOutputEmitted

`Readonly`voice

`Static` `Readonly`DEFAULT_SYSTEM_PROMPT