github microsoft/DeepSpeed v0.3.1
DeepSpeed v0.3.1

latest releases: v0.14.2, v0.14.1, v0.14.0...
3 years ago

Updates

  • Efficient and robust compressed training through progressive layer dropping
  • JIT compilation of C++/CUDA extensions
  • Python-only install support, ~10x faster install time
  • PyPI hosted installation via pip install deepspeed
  • Removed apex dependency
  • Bug fixes for ZeRO-offload and CPU-Adam
  • Transformer support for dynamic sequence length (#424)
  • Linear warmup+decay lr schedule (#414)

Don't miss a new DeepSpeed release

NewReleases is sending notifications on new releases.