github withcatai/node-llama-cpp v3.14.0

latest releases: v3.18.1, v3.18.0, v3.17.1...
6 months ago

3.14.0 (2025-10-02)

Features

  • Qwen3 Reranker support (#506) (00305f7) (see #506 for prequantized Qwen3 Reranker models you can use)

Bug Fixes

  • handle HuggingFace rate limit responses (#506) (00305f7)
  • adapt to llama.cpp breaking changes (#506) (00305f7)

Shipped with llama.cpp release b6673

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Don't miss a new node-llama-cpp release

NewReleases is sending notifications on new releases.