github ggml-org/llama.cpp b7360

4 hours ago

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

SOLVE_TRI extension to more dimensions (#17793)

  • Extended TRI

  • Fix whitespace

  • chore: update webui build output

  • Just use cuBLAS for everything...

  • Merge both versions

  • Remove incorrect imports causing failures for CI

  • Still failing... remove all direct cublas imports and rely on common imports from "common.cuh"

  • Defines for hipBlas

  • Aaaand MUSA defines...

  • I hate this job...

  • Stupid typo...

  • Update ggml/src/ggml-cuda/solve_tri.cu

Co-authored-by: Johannes Gäßler johannesg@5d6.de


Co-authored-by: Johannes Gäßler johannesg@5d6.de

macOS/iOS:

Linux:

Windows:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.