In this release we focused on improving testing infrastructure and fixing outstanding bugs. One particular longstanding bug with the Dask runtime, related to serialization, was fixed (#1096). This revealed an oppourtinity for more optimization when it comes to serialization within the repository.
Bugfixes + Pandas Concordance (🐛 + 🐼)
- Fix loc when column mask is boolean and no rows are masked (#1039)
- Fix opening a file from s3 when S is capitalized (#1045)
- Fix pd.to_datetime when using a Series object. (#1048)
- Fix issue with Dask engine where users were not able to use Da (#1050)
- Fix
as_index=False
forDataFrame.groupby
(#1041) - Fix ingesting parquet files that are coming from HDFS (#1074)
- fixed bug where .to_frame(name) ignored name; added parity wit (#1075)
- Fix column indexing when 2 or more columns have same name (#1077)
- Fix Series.getitem to accept a callable as key (#1084)
- bugfix - sometimes df.loc[s] applies to columns instead of rows (#1088)
- df.rename works with 'mapper' and 'axis' params (#1057)
- Fix case where apply a Series across the columns threw error (#1092)
- Fix Dask Serialization issue (#1096)
New Functionality ✨
- Add r operators to Series (#1086)
Code Quality + Testing 💯
- Update Test infrastructure to use testmon when possible (#1036)
- Re-enable testmon force selection (#1040)
- Update master to track coverage again (#1052)
- Update test for groupby and clean up groupby edge cases (#1055)
- Change Error to warning for pandas version pin (#1072)
- Upload coverage correctly from merged PRs (#1076)
Documentation 📃
- Update Installation Documentation (#1066)
- Documentation updates (#1097)
- Add documentation for signed-off-by policy (#1098)
Dependencies 🔗
Contributors this release
The following users contributed code to Modin since the last release.
@elonp (First time contributor) ⭐️
@KevOBrien (First time contributor) ⭐️
@devin-petersohn (Maintainer)
🎉🎉 Thank you! 🎉🎉