What's Changed
- Update pre-commit version by @loadams in #6821
- Update version.txt after 0.16.1 release by @loadams in #6826
- Pin HPU tests by @loadams in #6831
- Flops profiler support einops.einsum by @lvhoaa in #6755
- Pin pytest-subtests version for accelerate tests by @loadams in #6842
- Inference UTs check for trition support from accelerator by @raza-sikander in #6782
- Unpin pytest-subtests now that 0.14.1 is released by @loadams in #6844
- Merge LoCo with Zero++ by @XingyuXie in #6730
- Fix type error in
ZeROOrderedDict
by @oraluben in #6794 - Fix uneven head sequence parallelism bug (#6774) by @Eugene29 in #6797
- Fix nv-torch-nightly test by pinning transformers by @loadams in #6849
- Remove broken links to non-active site by @kaiksi-bb in #6854
- Avoid poisoning process with CUDA calls as soon as importing by @HollowMan6 in #6810
- Fix xpu tests workflow failure by changing pip index url by @Liangliang-Ma in #6864
- Domino updates by @GuanhuaWang in #6861
- add domino navigation by @GuanhuaWang in #6866
- Update TSC by @tjruwase in #6867
- Remove warnings from autodoc and sphinx by @loadams in #6788
- Update real_accelerator.py by @keiwoo in #6845
- Fix assertion for offloading states by @tohtana in #6855
- Remove pin from transformers version and fix Processing/Threading issues in tests by @loadams in #6822
- Add MLP/lm_head tp grain size setting. by @Yejing-Lai in #6828
- Fix --enable_each_rank_log when used with PDSH multi-node runner by @akeshet in #6863
- Update transformers ops unit tests to use
requried_torch_version
by @loadams in #6884 - Don't error out when cpu accelerator doesn't have torch (as default for whl building) by @loadams in #6886
- Add arctic model support by adding w2 to all_reduce by @pi314ever in #6856
- Update code owners by @tjruwase in #6890
New Contributors
- @lvhoaa made their first contribution in #6755
- @XingyuXie made their first contribution in #6730
- @Eugene29 made their first contribution in #6797
- @kaiksi-bb made their first contribution in #6854
- @HollowMan6 made their first contribution in #6810
- @keiwoo made their first contribution in #6845
- @akeshet made their first contribution in #6863
- @pi314ever made their first contribution in #6856
Full Changelog: v0.16.1...v0.16.2