ggml-org/llama.cpp b8048 on GitHub

Details

mmap: Fix Windows handle lifetime (#19598)

ggml: added cleanups in ggml_quantize_free
Add missing cleanup calls for IQ2_S, IQ1_M quantization types and IQ3XS with 512 blocks during quantization cleanup.
mmap: Fix Windows handle lifetime
Move hMapping from local variable to member variable so it stays alive for the entire lifetime of the mapping.
The file mapping handle must remain valid until UnmapViewOfFile is called.
Fixes cleanup order in destructor.
Update llama-mmap.cpp
Update llama-mmap.cpp

Remove trailing whitespace from line 567

macOS/iOS:

Linux:

Windows:

openEuler: