github Azure/azure-sdk-for-python azure-ai-voicelive_1.2.0b3

latest releases: azure-cosmos_4.14.6, azure-monitor-opentelemetry-exporter_1.0.0b47
pre-release17 hours ago

1.2.0b3 (2026-02-02)

Features Added

  • Support for Explicit Null Values: Enhanced RequestSession to properly serialize explicitly set None values (e.g., turn_detection=None now correctly sends "turn_detection": null in the WebSocket message)
  • Filler Response Configuration: Added support for filler response generation during latency or tool calls:
    • BasicFillerResponseConfig for static filler texts that are randomly selected
    • LlmFillerResponseConfig for LLM-generated context-aware filler responses
    • FillerTrigger enum with latency and tool triggers
    • filler_response field in RequestSession and ResponseSession
  • Foundry Agent Integration: Added support for Azure AI Foundry agents:
    • FoundryAgentTool for defining Foundry agent configurations
    • ResponseFoundryAgentCallItem for Foundry agent call responses
    • FoundryAgentContextType enum for context management (no_context, agent_context)
    • Server events for Foundry agent call lifecycle: ServerEventResponseFoundryAgentCallArgumentsDelta, ServerEventResponseFoundryAgentCallArgumentsDone, ServerEventResponseFoundryAgentCallInProgress, ServerEventResponseFoundryAgentCallCompleted, ServerEventResponseFoundryAgentCallFailed
  • Reasoning Effort Control: Added reasoning_effort field to RequestSession, ResponseSession, and ResponseCreateParams for controlling reasoning models effort levels with ReasoningEffort enum (none, minimal, low, medium, high, xhigh)
  • Response Metadata: Added metadata field to Response and ResponseCreateParams for attaching up to 16 key-value pairs (max 64 chars for keys, 512 chars for values)
  • Array Encoding Support: Enhanced serialization to support pipe, space, comma, and newline-delimited array encoding formats
  • Custom Text Normalization: Added custom_text_normalization_url field to AzureStandardVoice, AzureCustomVoice, and AzurePersonalVoice for custom text normalization configurations
  • Avatar Scene Configuration: Added Scene model for controlling avatar's zoom level, position (x/y), rotation (x/y/z pitch/yaw/roll), and movement amplitude in the video frame
  • Enhanced Avatar Configuration: Added scene and output_audit_audio fields to AvatarConfig for scene control and audit audio forwarding via WebSocket

Other Changes

  • Dependency Update: Updated minimum azure-core version from 1.36.0 to 1.37.0
  • Security Enhancement: Removed eval() usage in serialization utilities, replaced with explicit type checking for improved security
  • Serialization Improvements: Enhanced model_base deserialization for mutable types and array-encoded strings

Bug Fixes

  • Audio Format Values: Fixed OutputAudioFormat enum values to use underscore format (pcm16_8000hz, pcm16_16000hz) instead of hyphenated format for consistency with wire protocol and backward compatibility

Don't miss a new azure-sdk-for-python release

NewReleases is sending notifications on new releases.