github sgl-project/sglang v0.4.0
Release v0.4.0

latest releases: v0.4.3.post4, v0.4.3.post3, v0.4.3.post2...
3 months ago

Highlights

blog: https://lmsys.org/blog/2024-12-04-sglang-v0-4/

We’re excited to release SGLang v0.4, featuring significant performance improvements and new features:

  • Zero-overhead batch scheduler: 1.1x increase in throughput.
  • Cache-aware load balancer: up to 1.9x increase in throughput with 3.8x higher cache hit rate.
  • Data parallelism attention for DeepSeek models: up to 1.9x decoding throughput improvement.
  • Fast structured outputs with xgrammar: up to 10x faster.

What's Changed

New Contributors

Full Changelog: v0.3.6...v0.4.0

Don't miss a new sglang release

NewReleases is sending notifications on new releases.