New Features
-
Live Translation with NLLB:
- Backend flexibility: Add possibility to switch to the 'transformers' backend using
--nllb-backendfor improved performance on certain systems. See here for guidance on choosing the optimal backend. - Configurable model size: Adjust NLLB model size from 600M to 1.3B using
--nllb-sizeparameter for enhanced translation accuracy.
- Backend flexibility: Add possibility to switch to the 'transformers' backend using
-
Frontend AudioWorklet support: The frontend now automatically uses AudioWorklet instead of MediaRecorder when
--pcm-inputis enabled, useful for environments without ffmpeg or for custom implementations.
Bug Fixes
- Fixed
--warmup-fileparameter compatibility with backends like faster-whisper - Translation now runs in a dedicated asyncio thread to prevent transcription and diarization slowdowns
- Translation processing now correctly manages silent periods
- Resolved websocket timeout issues on low-end devices
- Corrected transcription/translation buffer positioning that was appearing below the translation output
Thank you to all contributors and community members who reported issues and provided valuable feedback.