NVIDIA/Megatron-LM core_r0.10.0
NVIDIA Megatron Core 0.10.0

on GitHub

latest releases: core_v0.14.0rc7, core_v0.15.0rc3, core_v0.15.0rc2...

6 months ago

Adding MLA to MCore
Enable FP8 for GroupedMLP
MoE Parallel Folding
Enhance MoE Architecture: Support MoE Layer Frequency Patterns and Configurable MoE FFN Hidden Size
Multimodal: NVLM training and evaluation support in MCore
Mamba Hybrid
- Increase performance and reduce memory footprint of Triton language/compiler distributed caching
- Add more unit testing and fix bugs

Don't miss a new Megatron-LM release

NewReleases is sending notifications on new releases.