github ggml-org/llama.cpp b7746

latest releases: b8262, b8261, b8260...
one month ago
Details

context : reserve new scheduler when graph topology changes (#18547)

  • context : reserve new scheduler when graph topology changes

  • cont : fix

  • cont : fix reserve

  • cont : reserve only when changes occur + timing

  • context : add comments

  • llama : reserve on sampler changes

  • common : allow null common_sampler

  • server : task declares needs (embd, logits, sampling)

  • server : do not init sampler if not needed

  • llama : fix need_reserve when unsetting a sampler

  • server : consolidate slot reset/clear logic

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.