1.0.0b5 (2025-09-26)
Features Added
- Enhanced Semantic Detection Type Safety: Added new
EouThresholdLevel
enum for better type safety in end-of-utterance detection:LOW
for low sensitivity threshold levelMEDIUM
for medium sensitivity threshold levelHIGH
for high sensitivity threshold levelDEFAULT
for default sensitivity threshold level
- Improved Semantic Detection Configuration: Enhanced semantic detection classes with better type annotations:
threshold_level
parameter now supports both string values andEouThresholdLevel
enum- Cleaner type definitions for
AzureSemanticDetection
,AzureSemanticDetectionEn
, andAzureSemanticDetectionMultilingual
- Improved documentation for threshold level parameters
- Comprehensive Unit Test Suite: Added extensive unit test coverage with 200+ test cases covering:
- All enum types and their functionality
- Model creation, validation, and serialization
- Async connection functionality with proper mocking
- Client event handling and workflows
- Voice configuration across all supported types
- Message handling with content part hierarchy
- Integration scenarios and real-world usage patterns
- Recent changes validation and backwards compatibility
- API Version Update: Updated to API version
2025-10-01
(from2025-05-01-preview
) - Enhanced Type Safety: Added new
AzureVoiceType
enum with values for better Azure voice type categorization:AZURE_CUSTOM
for custom voice configurationsAZURE_STANDARD
for standard voice configurationsAZURE_PERSONAL
for personal voice configurations
- Improved Message Handling: Added
MessageRole
enum for better role type safety in message items - Enhanced Model Documentation: Comprehensive documentation improvements across all models:
- Added detailed docstrings for model classes and their parameters
- Enhanced enum value documentation with descriptions
- Improved type annotations and parameter descriptions
- Enhanced Semantic Detection: Added improved configuration options for all semantic detection classes:
- Added
threshold_level
parameter with options:"low"
,"medium"
,"high"
,"default"
(recommended over deprecatedthreshold
) - Added
timeout_ms
parameter for timeout configuration in milliseconds (recommended over deprecatedtimeout
)
- Added
- Video Background Support: Added new
Background
model for video background customization:- Support for solid color backgrounds in hex format (e.g.,
#00FF00FF
) - Support for image URL backgrounds
- Mutually exclusive color and image URL options
- Support for solid color backgrounds in hex format (e.g.,
- Enhanced Video Parameters: Extended
VideoParams
model with:background
parameter for configuring video backgrounds using the newBackground
modelgop_size
parameter for Group of Pictures (GOP) size control, affecting compression efficiency and seeking performance
- Improved Type Safety: Added
TurnDetectionType
enum for better type safety and IntelliSense support - Package Structure Modernization: Simplified package initialization with namespace package support
- Enhanced Error Handling: Added
ConnectionError
andConnectionClosed
exception classes to the async API for better WebSocket error management
Breaking Changes
- Cross-Language Package Identity Update: Updated package ID from
VoiceLive
toVoiceLive.WebSocket
for better cross-language consistency - Model Refactoring:
- Renamed
UserContentPart
toMessageContentPart
for clearer content part hierarchy - All message items now require a
content
field with list ofMessageContentPart
objects OutputTextContentPart
now inherits fromMessageContentPart
instead of being standalone
- Renamed
- Enhanced Type Safety:
- Azure voice classes now use
AzureVoiceType
enum discriminators instead of string literals - Message role discriminators now use
MessageRole
enum values for better type safety
- Azure voice classes now use
- Removed Deprecated Parameters: Completely removed deprecated parameters from semantic detection classes:
- Removed
threshold
parameter from all semantic detection classes (AzureSemanticDetection
,AzureSemanticDetectionEn
,AzureSemanticDetectionMultilingual
) - Removed
timeout
parameter from all semantic detection classes - Users must now use
threshold_level
andtimeout_ms
parameters respectively
- Removed
- Removed Synchronous API: Completely removed synchronous WebSocket operations to focus exclusively on async patterns:
- Removed sync
connect()
function and syncVoiceLiveConnection
class from main patch implementation - Removed sync
basic_voice_assistant.py
sample (only async version remains) - Simplified sync patch to minimal structure with empty exports
- All functionality now available only through async patterns
- Removed sync
- Updated Dependencies: Modified package dependencies to reflect async-only architecture:
- Moved
aiohttp>=3.9.0,<4.0.0
from optional to required dependency - Removed
websockets
optional dependency as sync API no longer exists - Removed optional dependency groups
websockets
,aiohttp
, andall-websockets
- Moved
- Model Rename:
- Renamed
AudioInputTranscriptionSettings
toAudioInputTranscriptionOptions
for consistency with naming conventions - Renamed
AzureMultilingualSemanticVad
toAzureSemanticVadMultilingual
for naming consistency with other multilingual variants
- Renamed
- Enhanced Type Safety: Turn detection discriminator types now use enum values instead of string literals for better type safety
Bug Fixes
- Serialization Improvements: Fixed type casting issue in serialization utilities for better enum handling and type safety
Other Changes
- Testing Infrastructure: Added comprehensive unit test suite with extensive coverage:
- 8 main test files with 200+ individual test methods
- Tests for all enums, models, async operations, client events, voice configurations, and message handling
- Integration tests covering real-world scenarios and recent changes
- Proper mocking for async WebSocket connections
- Backwards compatibility validation
- Test coverage for all recent changes and enhancements
- API Documentation: Updated API view properties to reflect model structure changes, new enums, and cross-language package identity
- Documentation Updates: Comprehensive updates to all markdown documentation:
- Updated README.md to reflect async-only nature with updated examples and installation instructions
- Updated samples README.md to remove sync sample references
- Enhanced BASIC_VOICE_ASSISTANT.md with comprehensive async implementation guide
- Added MIGRATION_GUIDE.md for users upgrading from previous versions