action.start
Start a live translation session.
An object that accepts the start parameters.
start Parameters
The webhook URI to be called. Authentication can also be set in the url in the format of username:password@url.
enThe language to translate from.
Learn more about our supported Voices & Languages here.
enThe language to translate to.
Learn more about our supported Voices & Languages here.
elevenlabs.joshThe TTS voice you want to use for the source language.
Learn more about our supported Voices & Languages here.
elevenlabs.joshThe TTS voice you want to use for the target language.
Learn more about our supported Voices & Languages here.
Translation filter to apply to the source language direction. Adjusts the tone or style of translated speech.
Preset Values:
polite- Translates to a polite version, removing anything insulting while maintaining sentimentrude- Translates to a rude and insulting version while maintaining sentimentprofessional- Translates to sound professional, removing slang or lingoshakespeare- Translates to sound like Shakespeare, speaking in iambic pentametergen-z- Translates to use Gen-Z slang and expressions
Custom: Use prompt: prefix for custom instructions (e.g., prompt:Use formal business language).
Translation filter to apply to the target language direction. Adjusts the tone or style of translated speech.
Preset Values:
polite- Translates to a polite version, removing anything insulting while maintaining sentimentrude- Translates to a rude and insulting version while maintaining sentimentprofessional- Translates to sound professional, removing slang or lingoshakespeare- Translates to sound like Shakespeare, speaking in iambic pentametergen-z- Translates to use Gen-Z slang and expressions
Custom: Use prompt: prefix for custom instructions.
falseWhether to enable automatic AI summarization. When enabled, AI-generated summaries in both languages will be sent to your webhook when the translation session ends.
60000The timeout for speech recognition.
Possible Values: [Minimum value: 1500, Maximum Value: None]
300 | 500Voice activity detection silence time in milliseconds.
Default depends on the speech engine: 300 for Deepgram, 500 for Google.
Possible Values: [Minimum value: 1, Maximum Value: None]
400Voice activity detection threshold.
Possible Values: [Minimum value: 0, Maximum Value: 1800]
local-callerThe direction of the call that should be translated.
Possible Values: [remote-caller, local-caller]
deepgramThe speech recognition engine to use.
Possible Values: [deepgram, google]
The AI prompt that instructs how to summarize the conversation when ai_summary is enabled.
This prompt is sent to an AI model to guide how it generates the summary.
Example
- YAML
- JSON
live_translate:
action:
start:
webhook: 'https://example.com/webhook'
from_lang: en-US
to_lang: es-ES
from_voice: elevenlabs.josh
to_voice: elevenlabs.josh
filter_from: professional
live_events: true
ai_summary: true
ai_summary_prompt: Summarize this conversation
speech_timeout: 60000
vad_silence_ms: 500
vad_thresh: 400
debug_level: 0
direction:
- remote-caller
- local-caller
speech_engine: deepgram
{
"live_translate": {
"action": {
"start": {
"webhook": "https://example.com/webhook",
"from_lang": "en-US",
"to_lang": "es-ES",
"from_voice": "elevenlabs.josh",
"to_voice": "elevenlabs.josh",
"filter_from": "professional",
"live_events": true,
"ai_summary": true,
"ai_summary_prompt": "Summarize this conversation",
"speech_timeout": 60000,
"vad_silence_ms": 500,
"vad_thresh": 400,
"debug_level": 0,
"direction": [
"remote-caller",
"local-caller"
],
"speech_engine": "deepgram"
}
}
}
}