What's Changed
- Fix DS Stable Diffusion for latest diffusers version by @lekurile in #4770
- Resolve any '..' in the file paths using os.path.abspath() by @rraminen in #4709
- Update dockerfile with updated versions by @loadams in #4780
- Run workflows when they are edited by @loadams in #4779
- BF16_Optimizer: add support for bf16 grad acc by @nelyahu in #4713
- fix autoTP issue for mpt (trust_remote_code=True) by @sywangyi in #4787
- Fix Hybrid Engine metrics printing by @lekurile in #4789
- [BUG] partition_balanced return wrong result. by @zjjMaiMai in #4312
- improve the way to determine whether a variable is None by @RUAN-ZX in #4782
- [NPU] Add HcclBackend for 1-bit adam, 1-bit lamb, 0/1 adam by @RUAN-ZX in #4733
- Fix for stage3 when setting different communication data type by @BacharL in #4540
- Add support of Falcon models (7b, 40b, 180b) to DeepSpeed-FastGen by @arashb in #4790
- Switch paths-ignore to single quotes, update paths-ignore on nv-pre-compile-ops by @loadams in #4805
- fix for tests using torch<2.1 by @mrwyattii in #4818
- Universal Checkpoint for Sequence Parallelism by @samadejacobs in #4752
- Accelerate CI fix by @mrwyattii in #4819
- fix [BUG] 'DeepSpeedGPTInference' object has no attribute 'dtype' for… by @jxysoft in #4814
- Update broken link in docs by @mrwyattii in #4822
- Update imports from Transformers by @loadams in #4817
- Minor updates to CI workflows by @mrwyattii in #4823
- fix falcon model load from_config meta_data error by @baodii in #4783
- mv DeepSpeedEngine param_names dict init post _configure_distributed_model by @nelyahu in #4803
- Refactor launcher user arg parsing by @mrwyattii in #4824
- Fix 4649 by @Alienfeel in #4650
New Contributors
- @zjjMaiMai made their first contribution in #4312
- @jxysoft made their first contribution in #4814
- @baodii made their first contribution in #4783
- @Alienfeel made their first contribution in #4650
Full Changelog: v0.12.4...v0.12.5