github hpcaitech/ColossalAI v0.1.4
v0.1.4 Released!

latest releases: v0.4.4, v0.4.3, v0.4.2...
2 years ago

Main Features

Here are the main improvements of this release:

  1. ColoTensor: A data structure that unifies the Tensor representation of different parallel methods.
  2. Gemini: More efficient Genimi implementation reduces the overhead of model data statistic collection.
  3. CLI: a command-line tool that helps users launch distributed training tasks more easily.
  4. Pipeline Parallelism (PP): a more user-friendly API for PP.

What's Changed

ColoTensor

Gemini + ZeRO

  • [zero] add zero tensor shard strategy by @1SAA in #793
  • Revert "[zero] add zero tensor shard strategy" by @feifeibear in #806
  • [gemini] a new tensor structure by @feifeibear in #818
  • [gemini] APIs to set cpu memory capacity by @feifeibear in #809
  • [DO NOT MERGE] [zero] init fp16 params directly in ZeroInitContext by @ver217 in #808
  • [gemini] collect cpu-gpu moving volume in each iteration by @feifeibear in #813
  • [gemini] add GeminiMemoryManger by @1SAA in #832
  • [zero] use GeminiMemoryManager when sampling model data by @ver217 in #850
  • [gemini] polish code by @1SAA in #855
  • [gemini] add stateful tensor container by @1SAA in #867
  • [gemini] polish stateful_tensor_mgr by @1SAA in #876
  • [gemini] accelerate adjust_layout() by @ver217 in #878

CLI

Pipeline Parallelism

Misc

Full Changelog: v0.1.3...v0.1.4

Don't miss a new ColossalAI release

NewReleases is sending notifications on new releases.