- You can now voice your subtitle files! Simply add
.srt,.assor.vttfiles to generate timed audio. Alternatively, add a text file with timestamps inHH:MM:SSorHH:MM:SS,msformat to generate audio that matches the timestamps. See here for detailed instructions.- New option: "Use silent gaps between subtitles": Prevents unnecessary audio speed-up by letting speech continue into the silent gaps between subtitles.
- New option: "Subtitle speed adjustment method": Choose how to speed up audio when needed:
- TTS Regeneration (better quality): Re-generates the audio at a faster speed for more natural sound.
- FFmpeg Time-stretch (better speed): Quickly speeds up the generated audio.
- Added support for embedding cover images in M4B files. Abogen now automatically extracts cover images from EPUB and PDF files. You can also manually specify a cover image using the
<<METADATA_COVER_PATH:path>>tag in your text file. (To prevent MPV from showing the cover image, you can addaudio-display=noto your MPV config file.) - Fixed
[WinError 1114] A dynamic link library (DLL) initialization routine failederror on Windows, pre-loading PyTorch DLLs before initializing PyQt6 to avoid DLL initialization errors, mentioned in #98 by @ephr0n. - Potential fix for
CUDA GPU is not availableissue, by ensuring PyTorch is installed correctly with CUDA support on Windows using the installer script. - Improvements in code and documentation.