github ggml-org/llama.cpp b8559

one hour ago
Details

common : inhibit lazy grammar sampler while reasoning is active (#20970)

  • common : inhibit grammar while reasoning budget is active

  • cont : update force_pos in accept

  • cont : fix tests

  • cont : tweak should apply logic

  • cont : return early not using grammar sampler

  • Add tests

  • cont : prevent backend sampling when reasoning budget enabled

  • cont : fix typo


Co-authored-by: Piotr Wilkin piotr.wilkin@syndatis.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.