github huggingface/accelerate v1.4.0
v1.4.0: `torchao` FP8, TP & dataLoader support, fix memory leak

latest releases: v1.12.0, v1.11.0, v1.10.1...
12 months ago

torchao FP8, initial Tensor Parallel support, and memory leak fixes

torchao FP8

This release introduces a new FP8 API and brings in a new backend: torchao. To use, pass in AORecipeKwargs to the Accelerator while setting mixed_precision="fp8". This is initial support, as it matures we will incorporate more into it (such as accelerate config/yaml) in future releases. See our benchmark examples here

TensorParallel

We have intial support for an in-house solution to TP when working with accelerate dataloaders. check out the PR here

Bug fixes

What's Changed

New Contributors

Full Changelog: v1.3.0...v1.4.0

Don't miss a new accelerate release

NewReleases is sending notifications on new releases.