github ggml-org/llama.cpp b7746

latest releases: b7751, b7750, b7749...
3 hours ago
Details

context : reserve new scheduler when graph topology changes (#18547)

  • context : reserve new scheduler when graph topology changes

  • cont : fix

  • cont : fix reserve

  • cont : reserve only when changes occur + timing

  • context : add comments

  • llama : reserve on sampler changes

  • common : allow null common_sampler

  • server : task declares needs (embd, logits, sampling)

  • server : do not init sampler if not needed

  • llama : fix need_reserve when unsetting a sampler

  • server : consolidate slot reset/clear logic

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.