Faster Whisper implementation using optimized CTranslate2 models.
GPU execution requires cuBLAS and cuDNN libs.
Last included commit: #149
Includes PR #163
Some new stuff:
no_repeat_ngram_size
arg.
Fix memory leak when batch processing.
Faster Whisper implementation using optimized CTranslate2 models.
GPU execution requires cuBLAS and cuDNN libs.
Last included commit: #149
Includes PR #163
no_repeat_ngram_size
arg.
Fix memory leak when batch processing.
Don't miss a new whisper-standalone-win release
NewReleases is sending notifications on new releases.