github ggml-org/llama.cpp b9265

2 hours ago
Details

hexagon: ssm-conv fix for large prompts (#23307)

  • hexagon: remove gathers and better handling of vtcm in ssm-conv

  • hexagon: relax ssm-conv gating requirements

  • hexagon: add new prefill ssm-conv backend test

  • hexagon: remove trailing white space

  • hex-rope: uninline rope_cache_init, otherwise it breaks after rebaseing with SSM_CONV changes


Co-authored-by: Max Krasnyansky maxk@qti.qualcomm.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.