Azure/azure-sdk-for-python azure-ai-voicelive

1.2.0b3 (2026-02-02)

Features Added

Support for Explicit Null Values: Enhanced RequestSession to properly serialize explicitly set None values (e.g., turn_detection=None now correctly sends "turn_detection": null in the WebSocket message)
Filler Response Configuration: Added support for filler response generation during latency or tool calls:
- BasicFillerResponseConfig for static filler texts that are randomly selected
- LlmFillerResponseConfig for LLM-generated context-aware filler responses
- FillerTrigger enum with latency and tool triggers
- filler_response field in RequestSession and ResponseSession
Foundry Agent Integration: Added support for Azure AI Foundry agents:
- FoundryAgentTool for defining Foundry agent configurations
- ResponseFoundryAgentCallItem for Foundry agent call responses
- FoundryAgentContextType enum for context management (no_context, agent_context)
- Server events for Foundry agent call lifecycle: ServerEventResponseFoundryAgentCallArgumentsDelta, ServerEventResponseFoundryAgentCallArgumentsDone, ServerEventResponseFoundryAgentCallInProgress, ServerEventResponseFoundryAgentCallCompleted, ServerEventResponseFoundryAgentCallFailed
Reasoning Effort Control: Added reasoning_effort field to RequestSession, ResponseSession, and ResponseCreateParams for controlling reasoning models effort levels with ReasoningEffort enum (none, minimal, low, medium, high, xhigh)
Response Metadata: Added metadata field to Response and ResponseCreateParams for attaching up to 16 key-value pairs (max 64 chars for keys, 512 chars for values)
Array Encoding Support: Enhanced serialization to support pipe, space, comma, and newline-delimited array encoding formats
Custom Text Normalization: Added custom_text_normalization_url field to AzureStandardVoice, AzureCustomVoice, and AzurePersonalVoice for custom text normalization configurations
Avatar Scene Configuration: Added Scene model for controlling avatar's zoom level, position (x/y), rotation (x/y/z pitch/yaw/roll), and movement amplitude in the video frame
Enhanced Avatar Configuration: Added scene and output_audit_audio fields to AvatarConfig for scene control and audit audio forwarding via WebSocket

Other Changes

Dependency Update: Updated minimum azure-core version from 1.36.0 to 1.37.0
Security Enhancement: Removed eval() usage in serialization utilities, replaced with explicit type checking for improved security
Serialization Improvements: Enhanced model_base deserialization for mutable types and array-encoded strings

Bug Fixes

Audio Format Values: Fixed OutputAudioFormat enum values to use underscore format (pcm16_8000hz, pcm16_16000hz) instead of hyphenated format for consistency with wire protocol and backward compatibility

Azure/azure-sdk-for-python azure-ai-voicelive_1.2.0b3 on GitHub

1.2.0b3 (2026-02-02)

Features Added

Other Changes

Bug Fixes

Azure/azure-sdk-for-python azure-ai-voicelive_1.2.0b3
on GitHub