github ggml-org/llama.cpp b8744

2 hours ago
Details

common : enable reasoning budget sampler for gemma4 (#21697)

  • fix: enable reasoning budget sampler for gemma4

Add thinking_start_tag and thinking_end_tag to
common_chat_params_init_gemma4(). Without these, the reasoning
budget sampler never activates for gemma4.

Make the newline after "thought" optional in the PEG parser to
handle budget=0 (sampler forces end tag before the newline).

Add test case for empty thinking block.

Fixes #21487

  • use p.space() instead of p.optional(p.literal("\n")) in gemma4 thought parser

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.