What's Changed
- Add gpu accel probe by @aittalam in #953
- Added executable dir to list of dylib valid paths by @aittalam in #959
- Update README with Windows executable size note by @ParthProLegend in #962
- Update llama.cpp submodule to 7b8443ac7 by @aittalam in #951
- Reduce cuda dylibs size by @aittalam in #963
- Clarify Linux GPU offload diagnostics by @adminlip in #967
- docs: rename example_llamfiles to pre-built-llamafiles for better seo by @angpt in #972
- ggml-cpu: re-hook llamafile MoE matmul on x86 (partial fix for #973) by @aittalam in #974
- CPU flash-attention fixes for #975 (workaround + AVX-512 helpers + simd_gemm) by @aittalam in #978
- common.cpp: fix Apple Silicon physical-core detection under cosmocc (#976) by @aittalam in #977
- Updated version to 0.10.2 by @aittalam in #984
- Modernise Diffusionfile Support by @cbk-res in #970
- Update llama.cpp submodule to dbe9c0c (+ embed real web UI) by @aittalam in #983
New Contributors
- @ParthProLegend made their first contribution in #962
- @adminlip made their first contribution in #967
- @cbk-res made their first contribution in #970
Full Changelog: 0.10.1...0.10.2