github ggml-org/llama.cpp b8048

latest releases: b8054, b8053, b8052...
3 hours ago
Details

mmap: Fix Windows handle lifetime (#19598)

  • ggml: added cleanups in ggml_quantize_free
    Add missing cleanup calls for IQ2_S, IQ1_M quantization types and IQ3XS with 512 blocks during quantization cleanup.

  • mmap: Fix Windows handle lifetime
    Move hMapping from local variable to member variable so it stays alive for the entire lifetime of the mapping.
    The file mapping handle must remain valid until UnmapViewOfFile is called.
    Fixes cleanup order in destructor.

  • Update llama-mmap.cpp

  • Update llama-mmap.cpp

Remove trailing whitespace from line 567

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.