Features/Enhancements ๐
- Add
s3.read_orc
ands3.to_orc
by @LeonLuttenberger in #2312 ๐ฅ - Apache Spark on Amazon Athena -
wr.athena.create_spark_session
&wr.athena.run_spark_calculation
by @kukushking in #2314 ๐ - EMR Serverless by @kukushking in #2304 ๐ฅ
- Add
to_sql
for RDS Data API by @LeonLuttenberger in #2287 - Add Timestream
UNLOAD
by @kukushking in #2284 - Opensearch parallel bulk by @kukushking in #2310
- Allow user groups to be passed in
allowed_to_use
andallowed_to_manage
when creating QuickSight resources by @LeonLuttenberger in #2278 - Add engine/memory_format os env variables and delay engine initialization by @jaidisido in #2285
- Support reading with PyArrow-backed types by @LeonLuttenberger in #2292
- Support additional parameters for Neptune bulk load by @LeonLuttenberger in #2297
- Sync ray 2.4 parquet datasource by @kukushking in #2300
- Timestream: Add multi measure write record example by @mandawat in #2317
- Iceberg
PARTITIONED BY
and additional table properties support by @kukushking in #2322 - Add ability to pass schema to
s3.read_parquet
by @kukushking in #2328
Bug fixes ๐
- Fix recurring issue with
test_spectrum_decimal_cast
by @LeonLuttenberger in #2283 - Fix Redshift unload not escaping SQL query by @LeonLuttenberger in #2286
- Fix KeyError & add lock to athena cache manager by @kukushking in #2299
- Fix Neptune bulk load bad request by @LeonLuttenberger in #2305
- Add AWS_REGION by default to deltalake storage_options by @jaidisido in #2315
Documentation ๐
- Add page for data_api.rds.to_sql by @LeonLuttenberger in #2291
Tests ๐งช
- Add unit test for
dtype_backend
use inread_parquet_table
by @LeonLuttenberger in #2307 - Adapt benchmark tests to Glue for Ray GA breaking changes by @jaidisido in #2316
Refactoring ๐ ๏ธ
- Refactor SQL formatter by @LeonLuttenberger in #2288
- Refactor engine
register_func
to handle type checking by @LeonLuttenberger in #2309
New Contributors
Full Changelog: 3.1.1...3.2.0