3.0.0-beta.28 (2024-06-15)
Features
- compress CUDA prebuilt binaries (#236) (b89ad2d)
- automatically solve more CUDA compilation errors (#236) (b89ad2d)
Shipped with llama.cpp release b3153
To use the latest
llama.cpprelease available, runnpx --no node-llama-cpp download --release latest. (learn more)