App
Added
- add flow
fail()
(#18883)
Fixed
- Fix failing lightning cli entry point (#18821)
Fabric
Changed
- Calling a method other than
forward
that invokes submodules is now an error when the model is wrapped (e.g., with DDP) (#18819)
Fixed
- Fixed false-positive warnings about method calls on the Fabric-wrapped module (#18819)
- Refined the FSDP saving logic and error messaging when the path exists (#18884)
- Fixed layer conversion under
Fabric.init_module()
context manager when using theBitsandbytesPrecision
plugin (#18914)
PyTorch
Fixed
- Fixed an issue when replacing an existing
last.ckpt
file with a symlink (#18793) - Fixed an issue when
BatchSizeFinder
steps_per_trial
parameter ends up defining how many validation batches to run during the entire training (#18394) - Fixed an issue saving the
last.ckpt
file when usingModelCheckpoint
on a remote filesystem, and no logger is used (#18867) - Refined the FSDP saving logic and error messaging when the path exists (#18884)
- Fixed an issue parsing the version from folders that don't include a version number in
TensorBoardLogger
andCSVLogger
(#18897)
Contributors
@awaelchli, @Borda, @BoringDonut, @carmocca, @hiaoxui, @ioangatop, @nohalon, @rasbt, @tchaton
If we forgot someone due to not matching commit email with GitHub account, let us know :]
Full Changelog: 2.1.0...2.1.1