github sgl-project/sglang v0.2.9
Release v0.2.9

latest releases: v0.5.2, v0.5.2rc2, v0.5.2rc1...
13 months ago

Highlights

  • New feature: Chunked prefill (#800, #811)
  • New models: Deepseek v2
  • Performance improvement: vectorized logprob computation
  • Accuracy fix: fix the double BOS problem in the chat template; move logits to float32; update flashinfer sampling kernels
  • Feature fix: fixed many missing logprob-related features in the OpenAI API server
  • CI/CD infra is now fully ready. The tests cover frontend, backend, accuracy, and performance tests.

What's Changed

New Contributors

Full Changelog: v0.2.5...v0.2.9

Don't miss a new sglang release

NewReleases is sending notifications on new releases.