ggml-org/llama.cpp b8266
on GitHub

latest releases: b9711, b9707, b9704...

3 months ago

Details

llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

quantize : imatrix-fail early + code cleanup
fix manual override printing

it's in the preliminary loop now, so needs to be on its own line

revert header changes per ggerganov
remove old #includes
clarify naming

rename tensor_quantization to tensor_typo_option to descirbe its
functionality

fix per barto

macOS/iOS:

Linux:

Windows:

openEuler:

Check out latest releases or
releases around ggml-org/llama.cpp b8266

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.

Get notifications