A patch release was made for the following three commits: DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259) [doc] Always call it Agents for consistency (#25958) deepspeed resume from ckpt fixes and adding support for deepspeed optimizer and HF scheduler (#25863)