New:
-
Replace ffmpeg-python with raw ffmpeg calls:
- Fixes systematic crashes after 9 minutes on some machines
- Improves reboot and restart handling
- Allows ffmpeg to restart without crashing the server on conversion errors
-
Update to latest SimulWhisper:
- Adds compatibility with English-only models
- Infers word-level timestamps for better diarization alignment
- Other improvements: https://github.com/ufal/SimulStreaming/commits/main/
-
Prevent buffer from growing indefinitely when no tokens are created
-
Fix Hugging Face token file handling in Docker
-
Remove default 8000 port in WebSocket when no port is provided