3.18.1 (2026-03-17)
Features
- customize
postinstallbehavior (#582) (57bea3d) (documentation: CustomizingpostinstallBehavior) - experimental support for context KV cache type configurations (#582) (57bea3d) (documentation:
LlamaContextOptions["experimentalKvCacheKeyType"]) - support
NVFP4quants (#582) (57bea3d)
Shipped with llama.cpp release b8390
To use the latest
llama.cpprelease available, runnpx -n node-llama-cpp source download --release latest. (learn more)