QuentinFuxa/WhisperLiveKit 0.2.6 on GitHub

Voice Activity Control (VAC) by Default: VAC is now enabled by default to improve transcription accuracy by filtering out non-speech segments before processing transcription & diarization. You can disable it with the --no-vac flag.
Simulstreaming Backend Enhancements:
- The simulstreaming backend is now the default transcription backend.
- Improved timestamp accuracy for audio segments longer than 30 seconds.
- Backends models are now recycled to optimize resource usage, by removing whisper hooks at the end of a transcription
- Added the ability to preload multiple backend models using the --preloaded_model_count argument, when several users are espected
Diarization with Silences: The diart diarization backend now correctly handles pauses and silences, improving speaker turn detection.
Time Handling: Aligned time handling between the backend and the frontend for better synchronization.
WebSocket Communication: Buffering is disabled during silent periods.
Default Model: The default model is now base.