github ggml-org/llama.cpp b7519

latest release: b7520
5 hours ago
Details

ggml-hexagon: create generalized functions for cpu side op (#17500)

  • refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility

  • refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility

  • refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity

  • add comment

  • refactor: remove redundant buffer checks in hexagon supported operations

  • wip

  • add missing include to fix weak symbol warning

  • add ggml_hexagon_op_generic

  • refactor: simplify tensor operation initialization and buffer management in hexagon implementation

  • refactor: streamline hexagon operation initialization and buffer management

  • refactor: update function signatures and streamline request handling in hexagon operations

  • wip

  • ggml-hexagon: clean up code formatting and improve unary operation handling

  • wip

  • rename

  • fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations

  • refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility

refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility

refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity

refactor: remove redundant buffer checks in hexagon supported operations

add missing include to fix weak symbol warning

add ggml_hexagon_op_generic

refactor: simplify tensor operation initialization and buffer management in hexagon implementation

refactor: streamline hexagon operation initialization and buffer management

refactor: update function signatures and streamline request handling in hexagon operations

ggml-hexagon: clean up code formatting and improve unary operation handling

fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations

Conflicts:

ggml/src/ggml-hexagon/ggml-hexagon.cpp

  • hexagon: fix merge conflicts

  • hexagon: minor cleanup for buffer support checks

  • hexagon: factor out op_desc and the overal op logging

  • hexagon: further simplify and cleanup op dispatch logic

  • snapdragon: update adb scripts to use llama-cli and llama-completion

  • fix pipeline failure


Co-authored-by: Max Krasnyansky maxk@qti.qualcomm.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.