deepspeed 0.4.0
DeepSpeed v0.4.0

on Python PyPI

latest releases: 0.15.4, 0.15.3, 0.15.2...

3 years ago

DeepSpeed v0.4.0

[Press release] DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression
New inference API inference setup
DeepSpeed Inference: Multi-GPU inference with customized inference kerenls and quantization support
- Getting Started with DeepSpeed for Inferencing Transformer based Models
Mixture-of-Quantization: A novel quantization approach for reducing model size with minimal accuracy impact
- MoQ tutorial for more details.

Check out latest releases or
releases around deepspeed 0.4.0

Don't miss a new deepspeed release

NewReleases is sending notifications on new releases.

Get notifications