github QuentinFuxa/WhisperLiveKit 0.2.6

latest releases: 0.2.17, 0.2.16, 0.2.15...
4 months ago
  • Voice Activity Control (VAC) by Default: VAC is now enabled by default to improve transcription accuracy by filtering out non-speech segments before processing transcription & diarization. You can disable it with the --no-vac flag.

  • Simulstreaming Backend Enhancements:

    • The simulstreaming backend is now the default transcription backend.
    • Improved timestamp accuracy for audio segments longer than 30 seconds.
    • Backends models are now recycled to optimize resource usage, by removing whisper hooks at the end of a transcription
    • Added the ability to preload multiple backend models using the --preloaded_model_count argument, when several users are espected
  • Diarization with Silences: The diart diarization backend now correctly handles pauses and silences, improving speaker turn detection.

  • Time Handling: Aligned time handling between the backend and the frontend for better synchronization.

  • WebSocket Communication: Buffering is disabled during silent periods.

  • Default Model: The default model is now base.

Don't miss a new WhisperLiveKit release

NewReleases is sending notifications on new releases.