Overview
This is a minor release. The main reason for it is a critical bug fix that causes the software to crash randomly when the language auto-detect option is used (i.e. whisper_lang_auto_detect()
).
Other than that, the release includes refactoring of the examples, ruby bindings and some minor changes to the C API.
You can provide feedback in the existing v1.2.0 discussion.
What's Changed
Core ggml
/ whisper
whisper
: whisper : add "split_on_word" flag when using using "max_len" option by @mightymatth in #455 and @boolemancer in #476whisper
: add whisper_full_lang_id() for getting the context lang by @kamranjon in #461whisper
: fixed Beam Search Strategy and exposed whisper_pcm_to_mel_phase_vocoder by @sandrohanea in #474whisper
: suppress non-speech-related token outputs by @shibukazu in #473cmake
: install whisper.h header by @aviks in #485whisper
: fix signedness compiler warning by @shikokuchuo in #506whisper
: by default disable non-speech tokens suppression #473whisper
: add API for applying custom logits filters during decoding 0d22916whisper
: fix uninitializedexp_n_audio_ctx
by @Finnvoor in #520
Bindings
bindings
: add Ruby by @taf2 in #500readme
: add .NET repos (#303)readme
: add cython bindings (#9)readme
: add pybind11 bindings by @aarnphm in #538
Examples
ci
: add node addon test and optimize compilation configuration by @chenqianhe in #468yt-wsp.sh
: add unique filename generation by @genevera in #495examples
: refactor in order to reuse code and reduce duplication by @ggerganov in #482main
: fix stdin pipe stream by @conradg in #503make
: add "-mcpu=native" when building for aarch64 (#532)
C-style API
- Add
whisper_pcm_to_mel_phase_vocoder()
- Add
*(whisper_logits_filter_callback)()
- Change
struct whisper_full_params
- Add
whisper_full_lang_id()
New Contributors
- @mightymatth made their first contribution in #455
- @kamranjon made their first contribution in #461
- @sandrohanea made their first contribution in #474
- @shibukazu made their first contribution in #473
- @genevera made their first contribution in #495
- @shikokuchuo made their first contribution in #506
- @conradg made their first contribution in #503
- @taf2 made their first contribution in #500
- @Finnvoor made their first contribution in #520
- @aarnphm made their first contribution in #538
- @FlippFuzz made their first contribution in #532
Full Changelog: v1.2.0...v1.2.1
Highlights
Recently, I have been making progress on adding integer quantisation support in the ggml
tensor library. This will eventually allow to use quantised models which require less memory and will hopefully run faster. I think the next major release v1.3.0
will officially add quantisation support. For now, you can keep track of the progress in #540
-
🎙️ MacWhisper by @jordibruin powered by whisper.cpp