github modin-project/modin 0.23.1
Modin 0.23.1

latest releases: 0.32.0, 0.31.0, 0.27.1...
13 months ago

Modin 0.23.1

This release contains fixes that improve Modin's performance for both the NumPy and pandas APIs, as well as removes the Modin In the Cloud experimental feature. This release also includes upgrades to Modin's testing suite that significantly speed up CI.

Key Features and Updates Since 0.23.0

  • Stability and Bugfixes
    • FIX-#0000: don't test experimental xgboost with Ray nightly build (#6424)
    • FIX-#0000: fix xgboost tests with ray>2.6.0 (#6425)
    • FIX-#1930: Fix one of the cases of heterogeneous data for read_csv (#5507)
    • FIX-#4580: Fix access by row label in query and eval (#6488)
    • FIX-#5627: Stop checking temp_df.dtype == 'category' (#6360)
    • FIX-#5972: compute correct dtype for Series.str.find/index/rfind/rindex (#6426)
    • FIX-#6219: don't default to pandas for 'copy' on empty DataFrame/Series objects (#6371)
    • FIX-#6299: array method always returns array of vanilla numpy (#6300)
    • FIX-#6334: improve error message if hdk isn't installed in the environment (#6358)
    • FIX-#6347: remove 'modin in the cloud' experimental feature (#6408)
    • FIX-#6364: Make reshuffling work with 'BenchmarkMode.put(True)' (#6365)
    • FIX-#6367: Enable support for 'groupby.size()' in reshuffling groupby (#6370)
    • FIX-#6368: Apply deferred indices before map-reduce groupby (#6369)
    • FIX-#6372: precompute dtypes for 'sum' operation (#6421)
    • FIX-#6375: don't initialize engines at import time (#6374)
    • FIX-#6386: don't make unnecesary 'astype' calls for modin.array.sum op (#6395)
    • FIX-#6396: set '__factory' to 'None' in case of any problems during initialization (#6397)
    • FIX-#6402: Allow datetime and timedelta types in diff (#6403)
    • FIX-#6405: Apply disable_logging to __getattr__ (#6406)
    • FIX-#6410: add a link to @modin_project twitter (#6411)
    • FIX-#6414: fix 'read_feather' with pyarrow<11.0 (#6415)
    • FIX-#6427: make code compatible with flake8==6.1.0 (#6428)
    • FIX-#6429: exclude pymssql==2.2.8 from environments (#6430)
    • FIX-#6436: Support ~ in paths in IO functions correctly (#6448)
    • FIX-#6443: Cast boolean columns before sum|mean|median groupby aggregations (#6444)
    • FIX-#6456: create fake xgboost module for building docs (#6457)
    • FIX-#6459: support fastparquet>=2023.1.0 (#6458)
    • FIX-#6483: Default to pandas for array_ufunc (#6486)
  • Performance enhancements
    • PERF-#6437: preserve dtypes for 'reindex' (#6438)
  • Update testing suite
    • TEST-#2008: Reduce runtime of CI checks a lot (#6356)
    • TEST-#6349: Update minimum versions for test dependencies in general environments (#6350)
    • TEST-#6469: pin numexpr<2.8.5 (#6474)
  • New Features
    • FEAT-#6407: update minimum dependency versions (#6342)
  • Uncategorized improvements
    • Release version 0.23.1 (#6495)

Contributors

@AndreyPavlenko
@RehanSD
@YarShev
@anmyachev
@dchigarev
@mvashishtha
@vnlitvinov

Don't miss a new modin release

NewReleases is sending notifications on new releases.