🏆 Highlights
- Native implementation of dataframe interchange protocol (#10267)
⚠️ Deprecations
- Deprecate behavior of list/tuple inputs for
lit
(#10461)
🚀 Performance improvements
- optimise retrieval of values from
df.item
(~4-5x speedup) (#10411) - pre-alloc int_ranges (#10399)
- use hash as CSE Identifier (#10385)
✨ Enhancements
- quote style option for csv writer (#10422)
- add "raise_if_empty" flag to
read_excel
,read_csv
,scan_csv
, andread_csv_batched
(#10409) - add
use_earliest
toto_datetime
/strptime
(#10426) - add new "header_format" option for
write_excel
(#10392) - {any/all}_horizontal to expression architecture (#10412)
- Native implementation of dataframe interchange protocol (#10267)
- allow unaligned pointers in arrow FFI (#10403)
- add line_terminator option to write_csv (#10373)
- add explicit
selector
variants for signed/unsigned integers (#10384) - Add
is_local
andto_local
to categorical namespace (#10372) - enhance
selectors
expansion function, so it can operate on a schema as well as a frame (#10341) - Order percentiles in
describe
(#10378) - cse for groupby.agg and reduced cse collisions (#10381)
- improve take_every(0) exception (#10352)
- add offset and length to get_ptr (#10361)
🐞 Bug fixes
- fix pyarrow write_to_dataset wrt check_not_directory parameter (#10471)
- fix negative decimal parsing (#10444)
- ensure sorted_sink hash equals the default path (#10464)
- address inconsistency in init from square numpy arrays with/without an explicit schema (#10445)
- ensure last aggregation deals with default chunk (#10453)
- fix cse input schema (#10450)
- Fix by argument handling in join_asof (#10447)
- fix potential
OverflowError
in testing asserts with hugeUInt64
diffs (#10437) - Create delta compatible schema during writing (#10165)
- fix list groupby of array dtype (#10408)
- correct AnyValue::hash (#10391)
- finalize cast in partitioned groupby (#10359)
🛠️ Other improvements
- add
vertical_relaxed
example forpl.concat
(#10472) - Run all streaming tests on the same test runner (#10469)
- Organize OOC tests (#10463)
- add test for 10417 (#10420)
- Clean up some
Sphinx
settings (#10400) - add test for 10401 (#10405)
- Address Ruff per file ignores (#10258)
- Small improvement for PySeries.get_buffer (#10363)
Thank you to all our contributors for making this release possible!
@MarcoGorelli, @OndrejSlamecka, @alexander-beedie, @c-peters, @cmdlineluser, @drgif, @ion-elgreco, @lfn3, @orlp, @potzenhotz, @rea1bacon, @reswqa, @ritchie46, @stinodego and @zundertj