github ggml-org/llama.cpp b7607

latest releases: b7609, b7608
10 hours ago
Details

model: support youtu-vl model (#18479)

  • Support Youtu-VL Model

  • merge code

  • fix bug

  • revert qwen2 code & support rsplit in minja.hpp

  • update warm info

  • fix annotation

  • u

  • revert minja.hpp

  • fix

  • Do not write routed_scaling_factor to gguf when routed_scaling_factor is None

  • fix expert_weights_scale

  • LGTM after whitespace fixes

  • fix

  • fix

  • fix

  • layers to layer_index

  • enum fix


Co-authored-by: Xuan-Son Nguyen son@huggingface.co
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.