What's Changed
Release
- [release] update version (#6195) by Hongxin Liu
Doc
- [doc] DeepSeek V3/R1 news (#6199) by binmakeswell
Application
- [application] add lora sft example data (#6198) by Hongxin Liu
- [application] Update README (#6196) by Tong Li
- [application] add lora sft example (#6192) by Hongxin Liu
Pre-commit.ci
Checkpointio
- [checkpointio] fix for async io (#6189) by flybird11111
- [checkpointio] fix checkpoint for 3d (#6187) by flybird11111
- [checkpointio] gather tensor before unpad it if the tensor is both padded and distributed (#6168) by Lemon Qin
- [checkpointio] support load-pin overlap (#6177) by Hongxin Liu
Hotfix
- [hotfix] fix zero optim save (#6191) by Hongxin Liu
- [hotfix] fix hybrid checkpointio for sp+dp (#6184) by flybird11111
Shardformer
- [shardformer] support pipeline for deepseek v3 and optimize lora save (#6188) by Hongxin Liu
- [shardformer] support ep for deepseek v3 (#6185) by Hongxin Liu
Ci
- [CI] Cleanup Dist Optim tests with shared helper funcs (#6125) by Wenxuan Tan
Issue template
- [Issue template] Add checkbox asking for details to reproduce error (#6104) by Wenxuan Tan
Inference
- [Inference]Fix example in readme (#6178) by Guangyao Zhang
Full Changelog: v0.4.8...v0.4.7