ggml-org/llama.cpp b8787
on GitHub

latest releases: b9439, b9438, b9437...

one month ago

Details

ggml-webgpu: Update register tiling matmul to use f32 accumulation (#21644)

Update register tiling matmul to use f32 accumulation
fix profiling code
Fix register tiling matmul for chrome, i'm blaming dawn
Update batch tuning value for iOS
compile fix
Fix use of new load function

macOS/iOS:

Linux:

Windows:

openEuler:

Check out latest releases or
releases around ggml-org/llama.cpp b8787

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.

Get notifications