What's Changed
- Fix for distributed tests on pytorch>=1.12 by @mrwyattii in #2141
- delay torch import for inference compatability check by @jeffra in #2167
- Fix wrong unit of latency in flops-profiler (#2090) by @zionwu in #2095
- [docs] adoption updates by @jeffra in #2173
- Update for AMD CI workflow by @mrwyattii in #2172
- [docs] update offload docs to include stage 1 by @jeffra in #2178
- Fixing model partitioning without injection by @RezaYazdaniAminabadi in #2179
- Match compute and reduce dtype by @tjruwase in #2145
- Enable fused_lamb_cuda_kernel on ROCm by @rraminen in #2148
- Update README to latest Composer version by @hanlint in #2177
- [deepspeed/autotuner] Missing hjson import by @rahilbathwal5 in #2175
- [docs] add more models to adoption by @jeffra in #2189
- [CI] fix lightning tests by @jeffra in #2190
- Fix typos on README.md by @gasparitiago in #2192
- Fix the layer-past for GPT based models by @RezaYazdaniAminabadi in #2196
- Add gradient_average flag support for sparse grads by @Dipet in #2188
- Adding the compression tutorial on GPT distillation and quantization by @minjiaz in #2197
- Log user config exactly by @tjruwase in #2201
- Fix the tensor-slicing copy for qkv parameters by @RezaYazdaniAminabadi in #2198
- Refactor Distributed Tests by @mrwyattii in #2180
- fix table syntax by @kamalkraj in #2204
- Correctly detect offload configuration by @tjruwase in #2208
- add cuda 11.7 by @jeffra in #2211
- use torch 1.9 in accelerate tests by @jeffra in #2215
- [zero-3] print warning once and support torch parameter by @awan-10 in #2127
- Add support of OPT models by @arashb in #2205
- fix typos in readme. by @zhjohnchan in #2218
- Fix regression w. dist_init_required by @jeffra in #2225
- add doc for new bert example by @conglongli in #2224
- Remove the random-generator from context during inference by @RezaYazdaniAminabadi in #2228
- allow saving ckpt w/o ckpt json + bloom copy fix by @jeffra in #2237
- Correctly detect zero_offload by @tjruwase in #2213
- [docs] update community videos by @jeffra in #2249
- Refactor dist tests: Checkpointing by @tjruwase in #2202
- Make OPT policy backward compatible with pre-OPT transformers versions by @arashb in #2254
- fix ds-inference without policy by @RezaYazdaniAminabadi in #2247
New Contributors
- @zionwu made their first contribution in #2095
- @hanlint made their first contribution in #2177
- @rahilbathwal5 made their first contribution in #2175
- @gasparitiago made their first contribution in #2192
- @arashb made their first contribution in #2205
- @zhjohnchan made their first contribution in #2218
Full Changelog: v0.7.0...v0.7.1