pypi datasets 1.6.2

latest releases: 3.1.0, 3.0.2, 3.0.1...
3 years ago

Fix memory issue: don't copy recordbatches in memory during a table deepcopy #2291 (@lhoestq)
This affected methods like concatenate_datasets, multiprocessed map and load_from_disk.

Breaking change:

  • when using Dataset.map with the input_columns parameter, the resulting dataset will only have the columns from input_columns and the columns added by the map functions. The other columns are discarded.

Don't miss a new datasets release

NewReleases is sending notifications on new releases.