What's Changed
- Update the locator for Megatron-LM by @rapsealk in #2564
- use get_global_rank if available by @jeffra in #2567
- Add Determined to open-source DL frameworks by @sirredbeard in #2573
- Support fp32 gradaccum for bf16 model by @delock in #2566
- Drop Maxwell Support by @cmikeh2 in #2574
- Fix quantized-inference & Add generic support of checkpoint loading by @RezaYazdaniAminabadi in #2547
- Fix MegatronLayerPolicy to have megatron_v2=True by @lekurile in #2579
- Update barrier and reduce_scatter_base to conform to PyTorch signatures by @Quentin-Anthony in #2570
- Support N-dimension input in quantization kernel by @lokoppakmsft in #2575
- Add checkpoint sharding unit tests by @mrwyattii in #2561
- Updating docs README by @jomayeri in #2587
- Updating API docs by @jomayeri in #2586
- Fix issues w. python 3.6 + add py-version checks to CI by @jeffra in #2589
- [benchmarks] get mask token from tokenizer by @jeffra in #2592
New Contributors
- @rapsealk made their first contribution in #2564
- @sirredbeard made their first contribution in #2573
Full Changelog: v0.7.6...v0.7.7