github NVIDIA/Megatron-LM core_r0.10.0
NVIDIA Megatron Core 0.10.0

latest releases: core_v0.14.0rc7, core_v0.15.0rc3, core_v0.15.0rc2...
6 months ago
  • Adding MLA to MCore
  • Enable FP8 for GroupedMLP
  • MoE Parallel Folding
  • Enhance MoE Architecture: Support MoE Layer Frequency Patterns and Configurable MoE FFN Hidden Size
  • Multimodal: NVLM training and evaluation support in MCore
  • Mamba Hybrid
    • Increase performance and reduce memory footprint of Triton language/compiler distributed caching
    • Add more unit testing and fix bugs

Don't miss a new Megatron-LM release

NewReleases is sending notifications on new releases.