1.2.0b3 (2026-02-02)
Features Added
- Support for Explicit Null Values: Enhanced
RequestSessionto properly serialize explicitly setNonevalues (e.g.,turn_detection=Nonenow correctly sends"turn_detection": nullin the WebSocket message) - Filler Response Configuration: Added support for filler response generation during latency or tool calls:
BasicFillerResponseConfigfor static filler texts that are randomly selectedLlmFillerResponseConfigfor LLM-generated context-aware filler responsesFillerTriggerenum withlatencyandtooltriggersfiller_responsefield inRequestSessionandResponseSession
- Foundry Agent Integration: Added support for Azure AI Foundry agents:
FoundryAgentToolfor defining Foundry agent configurationsResponseFoundryAgentCallItemfor Foundry agent call responsesFoundryAgentContextTypeenum for context management (no_context,agent_context)- Server events for Foundry agent call lifecycle:
ServerEventResponseFoundryAgentCallArgumentsDelta,ServerEventResponseFoundryAgentCallArgumentsDone,ServerEventResponseFoundryAgentCallInProgress,ServerEventResponseFoundryAgentCallCompleted,ServerEventResponseFoundryAgentCallFailed
- Reasoning Effort Control: Added
reasoning_effortfield toRequestSession,ResponseSession, andResponseCreateParamsfor controlling reasoning models effort levels withReasoningEffortenum (none,minimal,low,medium,high,xhigh) - Response Metadata: Added
metadatafield toResponseandResponseCreateParamsfor attaching up to 16 key-value pairs (max 64 chars for keys, 512 chars for values) - Array Encoding Support: Enhanced serialization to support pipe, space, comma, and newline-delimited array encoding formats
- Custom Text Normalization: Added
custom_text_normalization_urlfield toAzureStandardVoice,AzureCustomVoice, andAzurePersonalVoicefor custom text normalization configurations - Avatar Scene Configuration: Added
Scenemodel for controlling avatar's zoom level, position (x/y), rotation (x/y/z pitch/yaw/roll), and movement amplitude in the video frame - Enhanced Avatar Configuration: Added
sceneandoutput_audit_audiofields toAvatarConfigfor scene control and audit audio forwarding via WebSocket
Other Changes
- Dependency Update: Updated minimum
azure-coreversion from 1.36.0 to 1.37.0 - Security Enhancement: Removed
eval()usage in serialization utilities, replaced with explicit type checking for improved security - Serialization Improvements: Enhanced model_base deserialization for mutable types and array-encoded strings
Bug Fixes
- Audio Format Values: Fixed
OutputAudioFormatenum values to use underscore format (pcm16_8000hz,pcm16_16000hz) instead of hyphenated format for consistency with wire protocol and backward compatibility