github withcatai/node-llama-cpp v3.10.0

latest releases: v3.15.1, v3.15.0, v3.14.5...
8 months ago

3.10.0 (2025-06-12)

Features

  • JSON Schema Grammar: $defs and $ref support with full inferred types (#472) (9cdbce9)
  • inspect gguf command: format and print the Jinja chat template with --key .chatTemplate (#472) (9cdbce9)

Bug Fixes

  • JinjaTemplateChatWrapper: first function call prefix detection (#472) (9cdbce9)
  • QwenChatWrapper: improve Qwen chat template detection (#472) (9cdbce9)
  • apply maxTokens on function calling parameters (#472) (9cdbce9)
  • adjust default prompt completion length based on SWA size when relevant (#472) (9cdbce9)
  • improve thought segmentation syntax extraction (#472) (9cdbce9)
  • adapt to llama.cpp changes (#472) (9cdbce9)

Shipped with llama.cpp release b5640

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Don't miss a new node-llama-cpp release

NewReleases is sending notifications on new releases.