github ggml-org/llama.cpp b8638

latest release: b8639
2 hours ago
Details

tests: allow exporting graph ops from HF file without downloading weights (#21182)

  • tests: allow exporting graph ops from HF file without downloading weights

  • use unique_ptr for llama_context in HF metadata case

  • fix missing non-required tensors falling back to type f32

  • use unique pointers where possible

  • use no_alloc instead of fixing f32 fallback

  • fix missing space

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.