github ggml-org/llama.cpp b8266

latest release: b8267
2 hours ago
Details

llama-quant : fail early on missing imatrix, refactor type selection, code cleanup (#19770)

  • quantize : imatrix-fail early + code cleanup

  • fix manual override printing

it's in the preliminary loop now, so needs to be on its own line

  • revert header changes per ggerganov

  • remove old #includes

  • clarify naming

rename tensor_quantization to tensor_typo_option to descirbe its
functionality

  • fix per barto

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.