github ggml-org/llama.cpp b7574

2 hours ago
Details

server : Cmdline arg -to changes http read timeout from current 600sec default (#18279)

  • Prevent crash if TTFT >300sec, boosted to 90 days

  • server : allow configurable HTTP timeouts for child models

  • server : pass needed timeouts from params only


Co-authored-by: Greg Slocum fromgit@wbtek.slocum.net

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.