github sgl-project/sglang gateway-v0.2.4
Release Gateway-v0.2.4

latest release: v0.5.6.post2
2 days ago

🚀 SGLang Model Gateway v0.2.4 Released!

We're excited to announce SGLang Model Gateway v0.2.4 – a massive release focused on performance, security, and production-ready observability!

✨ Headline Features

⚡ Major Performance Optimizations

We've invested heavily in performance across the entire stack:

  • Optimized radix tree for cache-aware load balancing – Smarter routing decisions with lower overhead
  • Tokenizer optimization – Dramatically reduced CPU and memory footprint during tokenization
  • Core module optimization – HTTP and gRPC routers now run leaner and faster
  • Efficient OTEL implementation – Production-grade observability with minimal performance impact

🔌 Industry-First WASM Middleware Support

Programmable middleware using WebAssembly! Extend your gateway with safe, isolated plugins. Build custom routing logic, transform requests/responses, or integrate proprietary systems – all without touching core code. Your gateway, your rules.

📊 Production-Grade Observability

Full OpenTelemetry integration with distributed tracing for both HTTP and gRPC. Track requests across your entire inference stack with native trace context propagation. Finally, real visibility into your LLM infrastructure.

⚡ Built for speed. Hardened for security. Ready for production.

Gateway Changes (98 commits)

New Contributors

Full Changelog: gateway-v0.2.3...gateway-v0.2.4

Don't miss a new sglang release

NewReleases is sending notifications on new releases.