github QuentinFuxa/WhisperLiveKit 0.2.7

latest releases: 0.2.17, 0.2.16, 0.2.15...
3 months ago

0.2.7: Diarization Improvements

  • New default backend: Sortformer is now the default diarization backend, replacing Diart
  • 6x faster processing: Reduced latency from ~2s to ~0.3s on CPU
  • Significantly improved speaker detection (Constraint: Currently supports up to 4 speakers maximum)
  • Shared model loading: A single Sortformer model SortformerDiarization is now shared across users and instances to reduce memory footprint. Speaker caches, frames, etc. are handled per user in SortformerDiarizationOnline
  • Enhanced alignment: Improved time and token synchronization between transcription and diarization results

Don't miss a new WhisperLiveKit release

NewReleases is sending notifications on new releases.