github sgl-project/sglang v0.2.0
Release v0.2.0

latest releases: v0.4.2.post2, v0.4.2.post1, v0.4.2...
6 months ago

Highlights

  • We performed extensive engineering to improve the base performance. Compared to TensorRT-LLM and vLLM, SGLang now consistently delivers superior or competitive performance in both online and offline scenarios, handling models from Llama-8B to Llama-405B, on A100 and H100 GPUs, using FP8 and FP16. See the latest blog.
  • New models: Llama3 405B, Deepseek MoE, InternLM, GPTBigCode, Mistral-Nemo

What's Changed

New Contributors

Full Changelog: v0.1.20...v0.2.0

Don't miss a new sglang release

NewReleases is sending notifications on new releases.