Improved logging
Some information are now logged under INFO
and DEBUG
levels. The logging level can be configured like this:
import logging
logging.basicConfig()
logging.getLogger("faster_whisper").setLevel(logging.DEBUG)
More control over model downloads
New arguments were added to the WhisperModel
constructor to better control how the models are downloaded:
download_root
to specify where the model should be downloaded.local_files_only
to avoid downloading the model and directly return the path to the cached model, it it exists.
Other changes
- Improve the default VAD behavior to prevent some words from being assigned to the incorrect speech chunk in the original audio
- Fix incorrect application of option
condition_on_previous_text=False
(note that the bug still exists in openai/whisper v20230314) - Fix segment timestamps that are sometimes inconsistent with the words timestamps after VAD
- Extend the
Segment
structure with additional properties to match openai/whisper - Rename
AudioInfo
toTranscriptionInfo
and add a new propertyoptions
to summarize the transcription options that were used