github ggml-org/llama.cpp b9014

latest release: b9015
3 hours ago
Details

ggml-webgpu: add layer norm ops (#22406)

  • shader(norm): add layer norm ops

  • shader(norm): stablize floating point computation with Kahan summation and handle mixed types

  • shader(norm): remove the non-contiguous strides

  • shader(norm): use the original implementation rather than the kahan summation

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.