LiveTranslate Python SDK

Call Qwen-LiveTranslate with the DashScope Python SDK for real-time speech translation. User guide: For tutorials and complete examples, see Real-time translation.

Request parameters

Set these in the OmniRealtimeConversation constructor:

Click to view sample code

from dashscope.audio.qwen_omni import (
  OmniRealtimeConversation,
  OmniRealtimeCallback,
  MultiModality,
)
from dashscope.audio.qwen_omni.omni_realtime import TranslationParams


class MyCallback(OmniRealtimeCallback):
  """Callback handler for real-time translation"""
  def __init__(self, conversation=None):
    self.conversation = conversation
    self.handlers = {
      'session.created': self._handle_session_created,
      'response.audio_transcript.done': self._handle_translation_done,
      'response.audio.delta': self._handle_audio_delta,
      'response.done': lambda r: print('======Response Done======'),
      'input_audio_buffer.speech_started': lambda r: print('======Speech Start======'),
      'input_audio_buffer.speech_stopped': lambda r: print('======Speech Stop======'),
    }

  def on_open(self):
    print('Connection opened')

  def on_close(self, code, msg):
    print(f'Connection closed, code: {code}, msg: {msg}')

  def on_event(self, response):
    try:
      handler = self.handlers.get(response['type'])
      if handler:
        handler(response)
    except Exception as e:
      print(f'[Error] {e}')

  def _handle_session_created(self, response):
    print(f"Session created: {response['session']['id']}")

  def _handle_translation_done(self, response):
    print(f"Translation result: {response['transcript']}")

  def _handle_audio_delta(self, response):
    # Process incremental audio data.
    audio_b64 = response.get('delta', '')
    # Decode the audio data for playback or to save it.

conversation = OmniRealtimeConversation(
  model='qwen3-livetranslate-flash-realtime',
  url='wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime',
  callback=MyCallback(conversation=None)  # Temporarily pass None. It will be injected later.
)
# Inject self into the callback.
conversation.callback.conversation = conversation

Parameter	Type	Required	Description
`model`	`str`	Yes	Model name. Set to `qwen3-livetranslate-flash-realtime`.
`callback`	`OmniRealtimeCallback`	Yes	Callback object that handles server events.
`url`	`str`	No	Service endpoint: `wss://dashscope-intl.aliyuncs.com/api-ws/v1/realtime`. Defaults to the DashScope endpoint.
`api_key`	`str`	No	API key for authentication. If this parameter is not provided, the SDK uses the `DASHSCOPE_API_KEY` environment variable.

Set these with OmniRealtimeConversation.update_session:

Click to view sample code

# Set translation parameters
translation_params = TranslationParams(
  language='en',  # Target language
  corpus=TranslationParams.Corpus(
    phrases={
      'Inteligencia Artificial': 'Artificial Intelligence',
      'Aprendizaje Automático': 'Machine Learning'
    }
  )
)

# Update session configuration
conversation.update_session(
  output_modalities=[MultiModality.TEXT, MultiModality.AUDIO],
  voice='Cherry',
  translation_params=translation_params,
)

Parameter	Type	Required	Description
`output_modalities`	`List[MultiModality]`	No	Output types. Default: `[MultiModality.TEXT, MultiModality.AUDIO]`. Valid values: `[MultiModality.TEXT]` (text only) or `[MultiModality.TEXT, MultiModality.AUDIO]` (text and audio).
`voice`	`str`	No	Voice for audio output. Default: `Cherry`. See Supported voices.
`input_audio_transcription_model`	`str`	No	Set to `qwen3-asr-flash-realtime` to get speech recognition results for the source language.
`translation_params`	`TranslationParams`	No	Translation settings.

Set these in the TranslationParams constructor:

Click to view sample code

translation_params = TranslationParams(
  language='en',  # Target language code
  corpus=TranslationParams.Corpus(
    phrases={
      'Inteligencia Artificial': 'Artificial Intelligence',  # Source phrase: Target translation
      'Aprendizaje Automático': 'Machine Learning'
    }
  )
)

Parameter	Type	Required	Description
`language`	`str`	No	Target language code. Default: `en`. See Supported languages.
`corpus`	`TranslationParams.Corpus`	No	Hotword settings to improve accuracy for specific terms.
`corpus.phrases`	`dict`	No	Hotword map (key: source term, value: target translation). Example: `{'Inteligencia Artificial': 'Artificial Intelligence'}`

Key interfaces

OmniRealtimeConversation class

Import: from dashscope.audio.qwen_omni import OmniRealtimeConversation

Method signature	Server event (via callback)	Description
`def connect(self) -> None:`	Server event: Session created; Server event: Session config updated	Connects to the server.
`def update_session(self, output_modalities: List[MultiModality], voice: str = None, translation_params: TranslationParams = None, **kwargs) -> None:`	Server event: Session config updated	Updates session settings. Call right after connecting. If not called, defaults apply. See the `OmniRealtimeConversation.update_session` parameters.
`def end_session(self, timeout: int = 20) -> None:`	session.finished: The server finishes translation and ends the session	Ends the session. The server finishes any remaining translation before closing.
`def append_audio(self, audio_b64: str) -> None:`	None	Sends Base64-encoded audio to the input buffer. The server auto-detects speech boundaries and triggers translation.
`def close(self) -> None:`	None	Stops the task and closes the connection.
`def get_session_id(self) -> str:`	None	Returns the current session ID.
`def get_last_response_id(self) -> str:`	None	Returns the last response ID.

Callback interface (OmniRealtimeCallback)

The server sends events to the client through callbacks. Inherit this class and implement its methods to handle them. Import: from dashscope.audio.qwen_omni import OmniRealtimeCallback

Method signature	Parameters	Description
`def on_open(self) -> None:`	None	Called when the WebSocket connection opens.
`def on_event(self, message: dict) -> None:`	message: Server event	Called when a server event arrives.
`def on_close(self, close_status_code, close_msg) -> None:`	close_status_code: Status code. close_msg: Log message.	Called when the WebSocket connection closes.

​Request parameters

​Key interfaces

​OmniRealtimeConversation class

​Callback interface (OmniRealtimeCallback)

Request parameters

Key interfaces

OmniRealtimeConversation class

Callback interface (OmniRealtimeCallback)