pypi datasets 1.6.2

latest releases: 2.19.1, 2.19.0, 2.18.0...
3 years ago

Fix memory issue: don't copy recordbatches in memory during a table deepcopy #2291 (@lhoestq)
This affected methods like concatenate_datasets, multiprocessed map and load_from_disk.

Breaking change:

  • when using Dataset.map with the input_columns parameter, the resulting dataset will only have the columns from input_columns and the columns added by the map functions. The other columns are discarded.

Don't miss a new datasets release

NewReleases is sending notifications on new releases.