1.0.0-beta.4 (2026-02-09)
Features Added
- Added
VoiceLiveRequestOptionsclass for per-request customization:- Supports custom query parameters via
addCustomQueryParameter(String key, String value)method - Supports custom headers via
addCustomHeader(String name, String value)andsetCustomHeaders(HttpHeaders)methods - Custom parameters and headers can be passed to session creation methods
- Supports custom query parameters via
- Enhanced session creation with new overloads:
- Added
startSession(String model, VoiceLiveRequestOptions requestOptions)for model with custom options - Added
startSession(VoiceLiveRequestOptions requestOptions)for custom options without explicit model parameter - Original
startSession(String model)andstartSession()methods preserved for backward compatibility
- Added
- Added Foundry Agent tool support:
FoundryAgentToolfor integrating Foundry agents as tools in VoiceLive sessionsFoundryAgentContextTypeenum for configuring agent context (no_context, agent_context)ResponseFoundryAgentCallItemfor tracking Foundry agent call responses- Foundry agent call lifecycle events:
ServerEventResponseFoundryAgentCallArgumentsDelta,ServerEventResponseFoundryAgentCallArgumentsDone,ServerEventResponseFoundryAgentCallInProgress,ServerEventResponseFoundryAgentCallCompleted,ServerEventResponseFoundryAgentCallFailed ItemType.FOUNDRY_AGENT_CALLandToolType.FOUNDRY_AGENTdiscriminator values
- Added filler response configuration for handling latency and tool calls:
FillerResponseConfigBasebase class for filler response configurationsBasicFillerResponseConfigfor static/random text filler responsesLlmFillerResponseConfigfor LLM-generated context-aware filler responsesFillerResponseConfigTypeenum (static_filler, llm_filler)FillerTriggerenum for trigger conditions (latency, tool)- Added
fillerResponseproperty toVoiceLiveSessionOptionsandVoiceLiveSessionResponse
- Added reasoning effort configuration for reasoning models:
ReasoningEffortenum with levels: none, minimal, low, medium, high, xhigh- Added
reasoningEffortproperty toVoiceLiveSessionOptions,VoiceLiveSessionResponse, andResponseCreateParams
- Added metadata support:
- Added
metadataproperty toResponseCreateParamsandSessionResponsefor attaching key-value pairs
- Added
- Added custom text normalization URL support for Azure voices:
- Added
customTextNormalizationUrlproperty toAzureCustomVoice,AzurePersonalVoice, andAzureStandardVoice
- Added
Bugs Fixed
- Fixed
OutputAudioFormatenum values from dash-separated to underscore-separated:pcm16-8000hz→pcm16_8000hzpcm16-16000hz→pcm16_16000hz