github hpcaitech/ColossalAI v0.4.1
Version v0.4.1 Release Today!

latest releases: v0.4.6, v0.4.5, v0.4.4...
3 months ago

What's Changed

Release

Misc

Compatibility

Chat

Shardformer

Auto parallel

  • [Auto Parallel]: Speed up intra-op plan generation by 44% (#5446) by Stephan Kö

Zero

Pre-commit.ci

Feature

Hotfix

  • [HotFix] CI,import,requirements-test for #5838 (#5892) by Runyu Lu
  • [Hotfix] Fix OPT gradient checkpointing forward by Edenzzzz
  • [hotfix] fix the bug that large tensor exceed the maximum capacity of TensorBucket (#5879) by Haze188

Feat

  • [Feat] Diffusion Model(PixArtAlpha/StableDiffusion3) Support (#5838) by Runyu Lu

Hoxfix

  • [Hoxfix] Fix CUDA_DEVICE_MAX_CONNECTIONS for comm overlap by Edenzzzz

Quant

Doc

  • [doc] Update llama + sp compatibility; fix dist optim table by Edenzzzz

Moe/zero

  • [MoE/ZeRO] Moe refactor with zero refactor (#5821) by Haze188

Full Changelog: v0.4.1...v0.4.0

Don't miss a new ColossalAI release

NewReleases is sending notifications on new releases.