ggml-org/llama.cpp b7429 on GitHub

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

Details

model: support GLM4V vision encoder (#18042)

convert ok
no deepstack
less new tensors
cgraph ok
add mrope for text model
faster patch merger
add GGML_ROPE_TYPE_MRNORM
add support for metal
move glm4v do dedicated graph
convert: add norm_embd
clip: add debugging fn
working correctly
fix style
use bicubic
fix mrope metal
improve cpu
convert to neox ordering on conversion
revert backend changes
force stop if using old weight
support moe variant
fix conversion
fix convert (2)
Update tools/mtmd/clip-graph.h

Co-authored-by: Georgi Gerganov ggerganov@gmail.com

process mrope_section on TextModel base class
resolve conflict merge

Co-authored-by: Georgi Gerganov ggerganov@gmail.com

macOS/iOS:

macOS Apple Silicon (arm64)
macOS Intel (x64)
iOS XCFramework

Linux:

Ubuntu x64 (CPU)
Ubuntu x64 (Vulkan)
Ubuntu s390x (CPU)

Windows:

Windows x64 (CPU)
Windows arm64 (CPU)
Windows x64 (CUDA 12)
Windows x64 (CUDA 13)
Windows x64 (Vulkan)
Windows x64 (SYCL)
Windows x64 (HIP)

openEuler:

openEuler x86 (310p)
openEuler x86 (910b)
openEuler aarch64 (310p)
openEuler aarch64 (910b)