pypi accelerate 0.5.0
v0.5.0 Dispatch batches from main DataLoader

latest releases: 1.0.0rc1, 1.0.0rc0, 0.34.2...
2 years ago

v0.5.0 Dispatch batches from main DataLoader

This release introduces support for iterating through a DataLoader only on the main process, that then dispatches the batches to all processes.

Dispatch batches from main DataLoader

The motivation behind this come from dataset streaming which introduces two difficulties:

  • there might be some timeouts for some elements of the dataset, which might then be different in each process launched, thus it's impossible to make sure the data is iterated though the same way on each process
  • when using IterableDataset, each process goes through the dataset, thus applies the preprocessing on all elements. This can yield to the training being slowed down by this preprocessing.

This new feature is activated by default for all IterableDataset.

Various fixes

Don't miss a new accelerate release

NewReleases is sending notifications on new releases.