ggml-org/llama.cpp b9491
on GitHub

latest releases: b10133, b10121, b10107...

one month ago

Details

Avoid PDL race conditions by disabling restrict when PDL is used (#24030)

Removes restrict from PDL kernel headers due to incompatibility with
PDL. Adds preprocessor directives based on arch in kernel body to add
restrict to retain performance on older architectures.
Simplifies new restrict usage via macro
Add hopper to PDL restrict fix.

Co-authored-by: Oliver Simons osimons@nvidia.com

Co-authored-by: Oliver Simons osimons@nvidia.com

macOS/iOS:

macOS Apple Silicon (arm64)
macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED
macOS Intel (x64)
iOS XCFramework

Linux:

Android:

Android arm64 (CPU)

Windows:

openEuler:

DISABLED
openEuler x86 (310p)
openEuler x86 (910b, ACL Graph)
openEuler aarch64 (310p)
openEuler aarch64 (910b, ACL Graph)

UI:

UI

Check out latest releases or
releases around ggml-org/llama.cpp b9491

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.

Get notifications