github deepspeedai/DeepSpeed v0.17.3
v0.17.3 Patch Release

latest releases: v0.17.5, v0.17.4
one month ago

What's Changed

  • [TiledMLP]: fix for bs>1 by @stas00 in #7412
  • Update version.txt after v0.17.2 release. by @loadams in #7417
  • Enable torch version dependent compilation of record_module and iter_params by @deepcharm in #7362
  • [BUGFIX] Reset bucket.elements after reduction in ZeRO Stage 3 by @rahul713rk in #7418
  • Align missing argument in AllReduceCoalescedHandle by @deepcharm in #7414
  • Improvements to Communication Logger by @alexk101 in #7404
  • trying to fix nv-accelerate-v100.yml CI job by @stas00 in #7424
  • fix: Propagate strip_tensor_paddings by @saforem2 in #7426
  • Use past_key_value when provided by @deepcharm in #7428
  • set device_id in torch's init_process_group by @stas00 in #7266
  • [Ulysses-ALST] add FA3 support by @stas00 in #7430
  • TiledMLP + SequenceTiledCompute: improve the bs>1 use-case by @stas00 in #7422
  • Remove unused yaml test configurations and update README by @loadams in #7441
  • [ALST] fix typo in the url by @stas00 in #7444
  • [ALST] fix typo in the url part2 by @stas00 in #7446
  • Remove additional unused tests (human-eval) by @loadams in #7445
  • Fix: Adapt Llama injection policy for newer transformers versions by @huanyuqu in #7443

New Contributors

Full Changelog: v0.17.2...v0.17.3

Don't miss a new DeepSpeed release

NewReleases is sending notifications on new releases.