1.3.0b1 (2026-05-28)
Features Added
- Azure Realtime Native Voice Support: Added
AzureRealtimeNativeVoiceand
AzureRealtimeNativeVoiceName, and expandedvoicefields to accept Azure realtime native voices. - WebRTC Call Negotiation Support: Added
ClientEventRtcCallSdpCreate,ServerEventRtcCallSdpCreated,
ServerEventRtcCallError, andRtcCallErrorDetailsfor SDP-based WebRTC call setup. - Input Text Streaming Support: Added
ClientEventInputTextDeltaandClientEventInputTextDone
for incrementally streaming text input into existing conversation items. - Hosted Agent Invocation Input: Added
invoke_inputtoResponseCreateParamsand
ServerEventResponseInvocationDeltafor hosted agent invocation passthrough data. - Audio Playback Lifecycle Events: Added
ServerEventOutputAudioBufferStartedand
ServerEventOutputAudioBufferStoppedto track model audio playback start and stop. - Echo Cancellation Configuration: Added
EchoCancellationReferenceSourceand new
reference_source/channelsoptions onAudioEchoCancellationto support both the default
server loopback reference path and client-provided stereo echo reference input. - Smart End-of-Turn Detection: Added
SmartEndOfTurnDetectionas an audio-based end-of-turn
detection option. - Parallel Tool Call Control: Added
parallel_tool_callsto session models so callers can
control whether tool calls may run in parallel.
Breaking Changes
- Image Input Field Rename: Renamed
RequestImageContentPart.urltoimage_url. Update
image input construction to useimage_url=instead ofurl=. - Default API Version Update: Changed the SDK default API version from
2026-04-10to
2026-06-01-preview. Passapi_version="2026-04-10"explicitly to keep the previous default
behavior.
Bug Fixes
- Deserialization Improvements: Improved XML model deserialization and common scalar header
deserialization paths for better compatibility and lower overhead.