Bug Fixes
- Fix SimulStreaming VRAM leak : cross-attention tensors were accumulating unboundedly during the decoding loop (up to env 5GB on repetition loops). Now capped to a rolling window (#283, #275)
- Fix
--direct-english-translationnot working with LocalAgreement backends :task="translate"was never propagated to the ASR backend'stranscribe_kargs, so whisper always ran in transcribe mode regardless of the flag. (#306) - Fix
model_cache_dirignored in SimulStreaming : download path was hardcoded toNone. Also fixedtaskbeing passed as a boolean instead of a string ("translate"/"transcribe"). (#310) - Fix
NoneTypeconcatenation inadd_translation: crash whents.textwasNoneduring translation overlay. (#296) - Fix
Segment().from_tokens()bad classmethod call : was incorrectly called on an instance instead of the class. (#331) - Fix
token.has_endedtype narrowing : replacedtoken.is_silence()(non-existent method) with properisinstance(token, Silence)check. (#329)
Improvements
- Add
probabilityfield toASRTokenFasterWhisperASR now passes word-level probabilities through (#330, #313) - Add
insert_audio_chunktoDiartDiarization, brings the interface in line withSortformerDiarization(#332) - Initialize
OpenaiApiASR.task: was previously referenced but never set, causing potentialAttributeError.