3.0.0-beta.17 (2024-04-24)
Bug Fixes
FunctionaryChatWrapperbugs (#205) (ef501f9)- function calling syntax bugs (#205) ([ef501f9]
- show
GPU layersin theModelline in CLI commands (#205) ([ef501f9] - refactor: rename
LlamaChatWrappertoLlama2ChatWrapper
Features
- Llama 3 support (#205) (ef501f9)
--gpuflag in generation CLI commands (#205) (ef501f9)specialTokensparameter onmodel.detokenize(#205) (ef501f9)
Shipped with llama.cpp release b2717
To use the latest
llama.cpprelease available, runnpx --no node-llama-cpp download --release latest. (learn more)