deepspeed 0.13.5 on Python PyPI

What's Changed

Update version.txt after 0.13.4 release by @mrwyattii in #5196
Fix assertion to run pipeline engine with a compiled module by @tohtana in #5197
Allow specifying MII branch on MII CI by @mrwyattii in #5208
[zero++] Synchronize at the end of secondary partitioning and simplify the logic by @ByronHsu in #5216
Add fp16 support of Qwen1.5 models (0.5B to 72B) to DeepSpeed-FastGen by @ZonePG in #5219
Rename nv-torch-latest-cpu workflow to cpu-torch-latest by @loadams in #5226
Fix moe cpu offload by @RezaYazdaniAminabadi in #5220
Use deepspeed.comm instead of torch.distributed by @jinyouzhi in #5225
fix fused_qkv model accuracy issue by @Yejing-Lai in #5217

Full Changelog: v0.13.4...v0.13.5