withcatai/node-llama-cpp v3.0.0-beta.18
on GitHub

latest releases: v3.18.1, v3.18.0, v3.17.1...

pre-release22 months ago

3.0.0-beta.18 (2024-05-09)

Bug Fixes

more efficient max context size finding algorithm (#214) (453c162)
make embedding-only models work correctly (#214) (453c162)
perform context shift on the correct token index on generation (#214) (453c162)
make context loading work for all models on Electron (#214) (453c162)

Features

split gguf files support (#214) (453c162)
pull command (#214) (453c162)
stopOnAbortSignal and customStopTriggers on LlamaChat and LlamaChatSession (#214) (453c162)
checkTensors parameter on loadModel (#214) (453c162)
improve Electron support (#214) (453c162)

Shipped with llama.cpp release b2834

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Check out latest releases or
releases around withcatai/node-llama-cpp v3.0.0-beta.18

Don't miss a new node-llama-cpp release

NewReleases is sending notifications on new releases.

Get notifications