microsoft/DeepSpeed v0.7.1 on GitHub

What's Changed

Fix for distributed tests on pytorch>=1.12 by @mrwyattii in #2141
delay torch import for inference compatability check by @jeffra in #2167
Fix wrong unit of latency in flops-profiler (#2090) by @zionwu in #2095
[docs] adoption updates by @jeffra in #2173
Update for AMD CI workflow by @mrwyattii in #2172
[docs] update offload docs to include stage 1 by @jeffra in #2178
Fixing model partitioning without injection by @RezaYazdaniAminabadi in #2179
Match compute and reduce dtype by @tjruwase in #2145
Enable fused_lamb_cuda_kernel on ROCm by @rraminen in #2148
Update README to latest Composer version by @hanlint in #2177
[deepspeed/autotuner] Missing hjson import by @rahilbathwal5 in #2175
[docs] add more models to adoption by @jeffra in #2189
[CI] fix lightning tests by @jeffra in #2190
Fix typos on README.md by @gasparitiago in #2192
Fix the layer-past for GPT based models by @RezaYazdaniAminabadi in #2196
Add gradient_average flag support for sparse grads by @Dipet in #2188
Adding the compression tutorial on GPT distillation and quantization by @minjiaz in #2197
Log user config exactly by @tjruwase in #2201
Fix the tensor-slicing copy for qkv parameters by @RezaYazdaniAminabadi in #2198
Refactor Distributed Tests by @mrwyattii in #2180
fix table syntax by @kamalkraj in #2204
Correctly detect offload configuration by @tjruwase in #2208
add cuda 11.7 by @jeffra in #2211
use torch 1.9 in accelerate tests by @jeffra in #2215
[zero-3] print warning once and support torch parameter by @awan-10 in #2127
Add support of OPT models by @arashb in #2205
fix typos in readme. by @zhjohnchan in #2218
Fix regression w. dist_init_required by @jeffra in #2225
add doc for new bert example by @conglongli in #2224
Remove the random-generator from context during inference by @RezaYazdaniAminabadi in #2228
allow saving ckpt w/o ckpt json + bloom copy fix by @jeffra in #2237
Correctly detect zero_offload by @tjruwase in #2213
[docs] update community videos by @jeffra in #2249
Refactor dist tests: Checkpointing by @tjruwase in #2202
Make OPT policy backward compatible with pre-OPT transformers versions by @arashb in #2254
fix ds-inference without policy by @RezaYazdaniAminabadi in #2247

New Contributors

@zionwu made their first contribution in #2095
@hanlint made their first contribution in #2177
@rahilbathwal5 made their first contribution in #2175
@gasparitiago made their first contribution in #2192
@arashb made their first contribution in #2205
@zhjohnchan made their first contribution in #2218

Full Changelog: v0.7.0...v0.7.1

microsoft/DeepSpeed v0.7.1 v0.7.1: Patch release on GitHub

What's Changed

New Contributors

microsoft/DeepSpeed v0.7.1
v0.7.1: Patch release

on GitHub