The main purpose of this release is to port as many fixes as possible to the latest version, which supports Python 3.8.
Key Features and Updates Since 0.23.1
- Stability and Bugfixes
- FIX-#0000: Pin
unidist<=0.4.1
- FIX-#4347:
read_excel
: defaults to pandas for unsupported types ofio
(#6462) - FIX-#4507: Do not call
ray.get()
inside of the kernel executing call queues (#6633) - FIX-#4687: Change
Column.null_count
to return a built-inint
instead of NumPy scalar (#6526) - FIX-#5164: Fix
unwrap_partitions
for virtual partitions whenaxis=None
(#6560) - FIX-#5536: Remove branch disabling
__getattribute__
for experimental mode (#6529) - FIX-#6465: Fix
groupby.apply()
for UDFs that change the output's shape (#6506) - FIX-#6479: HDK CalciteBuilder: Do not call
is_bool_dtype()
for categorical (#6480) - FIX-#6509: Fix
reshuffling
in case of a string key (#6510) - FIX-#6514:
test_sort_cols_str
fromtest_dataframe.py
crashed on HDK 0.7.0 and python 3.9 (#6515) - FIX-#6516: HDK:
test_dataframe.py
is crashed if Calcite is disabled (#6517) - FIX-#6518: Fix interchange protocol for string columns (#6523)
- FIX-#6519: Consider
botocore
as an optional dependency (#6521) - FIX-#6532: Fix
read_excel
so that it doesn't userich_text
param for oldopenpyxl
(#6534) - FIX-#6535: Pin
s3fs<2023.9.0
(#6536) - FIX-#6537: Unpin
s3fs<2023.9.0
(#6544) - FIX-#6541: Fix
ValueError: buffer source array is read-only
foriloc
(#6538) - FIX-#6553: Fix
read_csv
withiterator=True
(#6554) - FIX-#6572: Execute simple queries row-wise in pandas backend (#6575)
- FIX-#6594: Fix usage of Modin objects inside UDFs for
apply
(#6673) - FIX-#6600: Fix usage of list of UDF functions in
Series.groupby.agg
(#6613) - FIX-#6601:
sort_values
shouldn't affect source dataframe/series (#6603) - FIX-#6602: Refactor
join
to avoiddistributing a dict object
warning (#6612) - FIX-#6607: Fix incorrect cache after
.sort_values()
(#6608) - FIX-#6628: Allow groupby diff for dates (#6631)
- FIX-#6632: Return Series instead of Dataframe for
groupby.apply
in case of experimental groupby (#6649) - FIX-#6635: HDK:
read_csv
: treat object dtype as string (#6636) - FIX-#6637: Fix
skiprows
parameter usage forread_excel
(#6638) - FIX-#6642: Fix
modin.numpy.array.sum
on HDK (#6643) - FIX-#6647: Added init file to make
modin/experimental/sql/hdk/query.py
part of modin package (#6646) - FIX-#6651: Make sure
Series.between
works correctly (#6656) - FIX-#6680: Specify
navigation_with_keys=True
to fix docs build (#6681)
- FIX-#0000: Pin
Contributors
@AndreyPavlenko
@Egor-Krivov
@Garra1980
@RehanSD
@anmyachev
@dchigarev
@vnlitvinov