github jorgecarleitao/arrow2 v0.5.0

latest releases: v0.17.0, v0.16.0, v0.15.0...
2 years ago

A new release is here! 🎉🎉🎉

This one marked by further alignment with the arrow specification. Of special mention:

  • ✅ Added full support for async parquet write (by @GrandChaman)
  • ✅ Added fast extend_*values to MutablePrimitiveArray (by @ritchie46)
  • ✅ Added support for compute to BinaryArray(by @zhyass)
  • ✅ Added support to extension types (IPC, FFI, etc.) (by @jorgecarleitao)
  • ✅ Added support for the brand new MONTH_DAY_NANO interval type (by @jorgecarleitao)
  • 🚀 Improved performance of the calculation of null counts by 5x (by @jorgecarleitao)
  • 🔧 Made cargo features not default (by @jorgecarleitao)

As usual, there is a small number of backward incompatible changes. See associated issues below, which include the migration paths to each of them.

Full Changelog

Breaking changes:

  • Added Extension to DataType #361
  • MonthDayNano added to enum IntervalUnit #360
  • Make io::parquet::write::write_* return size of file in bytes #354
  • Renamed bitmap::utils::null_count to bitmap::utils::count_zeros #342
  • Made GroupFilter optional in parquet'sRecordReader and added method to set it. #386 (jorgecarleitao)
  • Removed PartialOrd and Ord of all enums in datatypes #379 (jorgecarleitao)
  • Made cargo features not default #369 (jorgecarleitao)
  • Prepare APIs for extension types #357 (jorgecarleitao)

New features:

Fixed bugs:

  • Parquet read skips a few rows at the end of the page #373
  • parquet_read fails when a column has too many rows with string values #366
  • parquet_read panics with index_out_of_bounds #351
  • Fixed error in MutableBitmap::push_unchecked #384 (jorgecarleitao)
  • Fixed display of timestamp with tz. #375 (jorgecarleitao)

Enhancements:

Documentation updates:

Testing updates:

Don't miss a new arrow2 release

NewReleases is sending notifications on new releases.