github aws/aws-sdk-pandas 2.13.0
AWS Data Wrangler 2.13.0

latest releases: 3.10.0, 3.9.1, 3.9.0...
2 years ago

Caveats

⚠️ For platforms without PyArrow 6 support (e.g. MWAA, EMR, Glue PySpark Job):

➡️ pip install pyarrow==2 awswrangler

Breaking changes

  • Fix sanitize methods to align with Glue/Hive naming conventions #579

New Functionalities

  • AWS Lake Formation Governed Tables 🚀 #570
  • Support for Python 3.10 🔥 #973
  • Add partitioning to JSON datasets #962
  • Add ability to use unbuffered cursor for large MySQL datasets #928

Enhancements

  • Add awswrangler.s3.list_buckets #997
  • Add partitions_parameters to catalog partitions methods #1035
  • Refactor pagination config in list objects #955
  • Add error message to EmptyDataframe exception #991

Documentation

  • Clarify docs & add tutorial on schema evolution for CSV datasets #964

Bug Fix

  • catalog.add_column() without column_comment triggers exception #1017
  • catalog.create_parquet_table Key in dictionary does not always exist #998
  • Fix Catalog StorageDescriptor get #969

Thanks

We thank the following contributors/users for their work on this release:

@csabz09, @Falydoor, @moritzkoerber, @maxispeicher, @kukushking, @jaidisido


P.S. The AWS Lambda Layer file (.zip) and the AWS Glue file (.whl) are available below. Just upload it and run or use them from our S3 public bucket!

Don't miss a new aws-sdk-pandas release

NewReleases is sending notifications on new releases.