ggml-org/llama.cpp b7574
on GitHub

latest releases: b10099, b10098, b10094...

6 months ago

Details

server : Cmdline arg -to changes http read timeout from current 600sec default (#18279)

Prevent crash if TTFT >300sec, boosted to 90 days
server : allow configurable HTTP timeouts for child models
server : pass needed timeouts from params only

Co-authored-by: Greg Slocum fromgit@wbtek.slocum.net

macOS/iOS:

Linux:

Windows:

openEuler:

Check out latest releases or
releases around ggml-org/llama.cpp b7574

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.

Get notifications