deepspeedai/DeepSpeed v0.17.3 on GitHub

What's Changed

[TiledMLP]: fix for bs>1 by @stas00 in #7412
Update version.txt after v0.17.2 release. by @loadams in #7417
Enable torch version dependent compilation of record_module and iter_params by @deepcharm in #7362
[BUGFIX] Reset bucket.elements after reduction in ZeRO Stage 3 by @rahul713rk in #7418
Align missing argument in AllReduceCoalescedHandle by @deepcharm in #7414
Improvements to Communication Logger by @alexk101 in #7404
trying to fix nv-accelerate-v100.yml CI job by @stas00 in #7424
fix: Propagate strip_tensor_paddings by @saforem2 in #7426
Use past_key_value when provided by @deepcharm in #7428
set device_id in torch's init_process_group by @stas00 in #7266
[Ulysses-ALST] add FA3 support by @stas00 in #7430
TiledMLP + SequenceTiledCompute: improve the bs>1 use-case by @stas00 in #7422
Remove unused yaml test configurations and update README by @loadams in #7441
[ALST] fix typo in the url by @stas00 in #7444
[ALST] fix typo in the url part2 by @stas00 in #7446
Remove additional unused tests (human-eval) by @loadams in #7445
Fix: Adapt Llama injection policy for newer transformers versions by @huanyuqu in #7443

Full Changelog: v0.17.2...v0.17.3