github NVIDIA/Megatron-LM core_v0.17.1
NVIDIA Megatron Core 0.17.1

21 hours ago
Changelog Details
  • beep boop 🤖: Bumping versions by @svcnvidia-nemo-ci :: PR: #4349
  • cp: NVFP4 native weights for DDP (4005) into core_r0.17.0 by @ko3n1g :: PR: #4290
  • docs: bump project.json and versions1.json to 0.17.0 by @ko3n1g :: PR: #4361
  • [docs] ci: fix version picker in 0.17.0 docs by @ko3n1g :: PR: #4363
  • [docs] ci: use parent-relative json_url for version picker by @ko3n1g :: PR: #4366
  • Backport NVRx async checkpoint compatibility to core_r0.17.0 by @sbak5 :: PR: #4453
  • cp: add permute fusion into hybrid ep (4089) into core_r0.17.0 by @ko3n1g :: PR: #4488
  • cp: get rid of weights_only=False (4434) into core_r0.17.0 by @ko3n1g :: PR: #4554
  • cp: SafeUnpickler class for safe pickle usage (4319) into core_r0.17.0 by @ko3n1g :: PR: #4555
  • cp: checkpoint integrity verification (4305) into core_r0.17.0 by @ko3n1g :: PR: #4556
  • fix(async_ckpt): import inspect in async_utils on core_r0.17.0 by @ko3n1g :: PR: #4597
  • chore(beep boop 🤖): Bump uv.lock (core_r0.17.0) (2026-05-04) by @svcnvidia-nemo-ci :: PR: #4598
  • cp: fix: Replace polynomial rolling hash with SHA-256 for prefix caching (#4158) by @chtruong814 :: PR: #4612
  • build: relax transformers cap to <=5.3.0 on core_r0.17.0 by @ko3n1g :: PR: #4701
  • chore: Bump TE to latest 2.14 by @chtruong814 :: PR: #4772
  • cp: additional tests for nvrx (#4522) by @chtruong814 :: PR: #4826
  • Release 0.17.0 by @ko3n1g
  • Bump mfsdp to 0.4.0 by @ko3n1g
  • cp: NVFP4 native weights for DDP (4005) into core_r0.17.0 (#4290) by @ko3n1g
  • docs: bump project.json and versions1.json to 0.17.0 (#4361) by @ko3n1g
  • [docs] ci: fix version picker in 0.17.0 docs (#4363) by @ko3n1g
  • [docs] ci: use parent-relative json_url for version picker (#4366) by @ko3n1g
  • chore(beep boop 🤖): Bump (core_r0.17.0) (2026-04-20) by @github-actions[bot]
  • Backport NVRx async checkpoint compatibility to core_r0.17.0 (#4453) by @sbak5
  • add permute fusion into hybrid ep (#4089) by @Autumn1998
  • Merge pull request #4488 from NVIDIA/cherry-pick-4089-core_r0.17.0 by @ko3n1g
  • get rid of weights_only=False (#4434) by @dimapihtar
  • SafeUnpickler class for safe pickle usage (#4319) by @dimapihtar
  • checkpoint integrity verification (#4305) by @dimapihtar
  • Merge pull request #4554 from NVIDIA/cherry-pick-4434-core_r0.17.0 by @ko3n1g
  • Merge pull request #4555 from NVIDIA/cherry-pick-4319-core_r0.17.0 by @ko3n1g
  • Merge pull request #4556 from NVIDIA/cherry-pick-4305-core_r0.17.0 by @ko3n1g
  • fix(async_ckpt): import inspect in async_utils on core_r0.17.0 (#4597) by @ko3n1g
  • chore(beep boop 🤖): Bump uv.lock (core_r0.17.0) (2026-05-04) (#4598) by @svcnvidia-nemo-ci
  • cp: fix: Replace polynomial rolling hash with SHA-256 for prefix caching (#4158) (#4612) by @chtruong814
  • build: relax transformers cap to <=5.3.0 on core_r0.17.0 (#4701) by @ko3n1g
  • chore(beep boop 🤖): Bump (core_r0.17.0) (2026-05-11) by @github-actions[bot]
  • chore: Bump TE to latest 2.14 (#4772) by @chtruong814
  • cp: additional tests for nvrx (#4522) (#4826) by @chtruong814
  • chore(beep boop 🤖): Bump (core_r0.17.0) (2026-05-18) by @github-actions[bot]
  • chore(beep boop 🤖): Bump (core_r0.17.0) (2026-05-25) by @github-actions[bot]

Don't miss a new Megatron-LM release

NewReleases is sending notifications on new releases.