Overview
Minor update:
- With Metal, auto-fallback to CPU if device does not support Apple7 family
- Add server example
What's Changed
- ISSUE-1329: replace " with ' so it doesn't try to execute code in backticks by @spullara in #1364
- sync : ggml (ggml-alloc + linker + gguf fixes) by @ggerganov in #1501
- Fixed with_state methods, to use the correct state by @sandrohanea in #1519
- #1517 Redistribute CUDA DLLs by @tamo in #1522
- whisper : reuse whisper_decode_with_state by @ggerganov in #1521
- sdl : fix audio callback by @ggerganov in #1523
- update deprecated example by @MightyStud in #1529
- Super Simple Whisper Server by @felrock in #1380
- Close file after writing in server application by @felrock in #1533
- bench : multi-thread memcpy by @ggerganov in #1534
- Change temp file name for server application by @felrock in #1535
- Fixed Makefile for MacOS ARM 64 Go bindings by @gleicon in #1530
- Fixed metal build on macos-latest by @sandrohanea in #1544
- fix(server): typo in temperature parameter by @Okabintaro in #1545
- Request to add a new function to get the full language name by @bradmit in #1546
- server : add --print-realtime param by @ecneladis in #1541
- cuda : sync some minor stuff from llama.cpp by @ggerganov in #1548
- metal : add backend function to check device family support by @ggerganov in #1547
New Contributors
- @spullara made their first contribution in #1364
- @MightyStud made their first contribution in #1529
- @felrock made their first contribution in #1380
- @gleicon made their first contribution in #1530
- @Okabintaro made their first contribution in #1545
- @bradmit made their first contribution in #1546
- @ecneladis made their first contribution in #1541
Full Changelog: v1.5.0...v1.5.1