✨ Enhancements
- ensure ooc sort works ooc with all-constant values (#6235)
- The 1 billion row sort (#6156)
- optionally treat missing UTF8 values as the empty string at CSV parse-time (#6203)
- check file target is not an existing directory (#6187)
- support -ve indexing for DataFrame
head
andtail
methods (#6173) - Implement
DataFrame.unique(keep="none")
(#6169) - support use of explicit
Struct
dtypes on DataFrame/Series init (#6145)
🐞 Bug fixes
- Add list inner dtype when printing Series (#6233)
- strptime now respects pl.Datetime's time_unit (#6231)
- fix when then otherwise with arity and aggregation… (#6224)
- collect now uses the storage_options given to scan_parquet (#6223)
- set_sorted keep schema (#6222)
- pass name to value counts in aggregation (#6221)
- don't set fast_explode on list of structs (#6220)
- address a frame init/construction error, and expose
infer_schema_length
to frame init (#6210) - explode of empty nullable list (#6190)
- fix oob arr.take (#6189)
- Make
with_columns
inwith_columns_kwargs
mode compatible with more data types (#6126) - Update docstring
with_columns
to reflect a new dataframe is being returned (#6122) - fix empty streaming joins (#6149)
- fix streaming joins where the join order has been … (#6143)
- write tz-aware datetimes to csv (#6135)
- add null behavior for oob indices (#6133)
🛠️ Other improvements
- Create
DataFrame
from schema (#6225) - don't set aggregated flag on null propagated aggregation. (#6191)
- undo cargo.toml change (#6219)
- Improve drop_nulls docstrings (#6127)
- Clarify docstrings for
closed
argument (#6198) - minor docs and typing updates (plus additional test coverage for related areas) (#6182)
- explain n_field_strategy (#6158)
Thank you to all our contributors for making this release possible!
@MarceColl, @MarcoGorelli, @alexander-beedie, @gab23r, @ghuls, @jvanbuel, @n8henrie, @rben01, @ritchie46, @ropoctl, @sorhawell, @stinodego, @winding-lines and @zundertj