Details
ggml-hexagon: create generalized functions for cpu side op (#17500)
-
refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility
-
refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility
-
refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity
-
add comment
-
refactor: remove redundant buffer checks in hexagon supported operations
-
wip
-
add missing include to fix weak symbol warning
-
add ggml_hexagon_op_generic
-
refactor: simplify tensor operation initialization and buffer management in hexagon implementation
-
refactor: streamline hexagon operation initialization and buffer management
-
refactor: update function signatures and streamline request handling in hexagon operations
-
wip
-
ggml-hexagon: clean up code formatting and improve unary operation handling
-
wip
-
rename
-
fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations
-
refactor: replace ggml_hexagon_mul_mat with template-based binary operation for improved flexibility
refactor: replace ggml_hexagon_mul_mat_id with template-based binary operation for improved flexibility
refactor: initialize buffer types and streamline dspqueue_buffers_init calls for clarity
refactor: remove redundant buffer checks in hexagon supported operations
add missing include to fix weak symbol warning
add ggml_hexagon_op_generic
refactor: simplify tensor operation initialization and buffer management in hexagon implementation
refactor: streamline hexagon operation initialization and buffer management
refactor: update function signatures and streamline request handling in hexagon operations
ggml-hexagon: clean up code formatting and improve unary operation handling
fix: add support for permuted F16 tensors and enhance quantization checks in matrix operations
Conflicts:
ggml/src/ggml-hexagon/ggml-hexagon.cpp
-
hexagon: fix merge conflicts
-
hexagon: minor cleanup for buffer support checks
-
hexagon: factor out op_desc and the overal op logging
-
hexagon: further simplify and cleanup op dispatch logic
-
snapdragon: update adb scripts to use llama-cli and llama-completion
-
fix pipeline failure
Co-authored-by: Max Krasnyansky maxk@qti.qualcomm.com
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: