github ggml-org/llama.cpp b9761

3 hours ago
Details

server: (router) move model downloading to dedicated process (#24834)

  • server: real-time model load progress tracking via /models/sse

  • update docs

  • server: move model download to child process

  • rm unused

  • fix most problems

  • clean up

  • nit fixes

  • fix test case

  • do not detact() thread

  • shorter MODEL_DOWNLOAD_TIMEOUT in test

  • throttle

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.