What's Changed
- PARQUET-2462: Bump to Thrift 0.20.0 by @Fokko in #1327
- Bump org.apache:apache from 31 to 32 by @dependabot in #1333
- PARQUET-2465: Fall back to HadoopConfig by @Fokko in #1339
- Update changes for the
master
branch by @Fokko in #1348 - PARQUET-2466: Simplify logic when calling
ParquetWriter
by @Fokko in #1338 - Bump com.google.api.grpc:proto-google-common-protos from 2.38.0 to 2.39.0 by @dependabot in #1346
- Bump com.google.guava:guava from 33.1.0-jre to 33.2.0-jre by @dependabot in #1345
- PARQUET-2476: Remove the
maven-compiler
override by @Fokko in #1341 - Bump com.fasterxml.jackson.core:jackson-databind from 2.17.0 to 2.17.1 by @dependabot in #1344
- PARQUET-2472: Close in finally block in ParquetFileWriter#end by @amogh-jahagirdar in #1350
- PARQUET-2478: Update README with link to parquet website by @alamb in #1355
- Update readme to point to 1.14.0 release by @vinooganesh in #1356
- PARQUET-2468: ParquetMetadata must convert to json by @RustedBones in #1349
- Bump org.codehaus.mojo:exec-maven-plugin from 3.2.0 to 3.3.0 by @dependabot in #1359
- Bump net.alchim31.maven:scala-maven-plugin from 4.9.0 to 4.9.1 by @dependabot in #1352
- Bump org.apache.arrow:arrow-vector from 16.0.0 to 16.1.0 by @dependabot in #1354
- PARQUET-2487: Bump io.airlift:aircompressor from 0.26 to 0.27 by @utkuaydn in #1363
- PARQUET-34: Add #contains FilterPredicate for Array columns by @clairemcginty in #1328
- Chore: Update NOTICE of parquet-avro by @yaooqinn in #1371
- PARQUET-2494: Update git attribrutes to mark png as binary by @emkornfield in #1372
- PARQUET-2498: Vector IO to handle empty range list by @steveloughran in #1374
- PARQUET-34: Extend Contains support to all ColumnFilterPredicate types by @clairemcginty in #1370
- Bump commons-logging:commons-logging from 1.3.1 to 1.3.2 by @dependabot in #1366
- Bump com.google.guava:guava from 33.2.0-jre to 33.2.1-jre by @dependabot in #1367
- PARQUET-2499: Update NOTICE to Apache Parquet Java by @caicancai in #1375
- Add github section in asf.yaml to enable issues by @wgtmac in #1362
- GH-2932: Adjust PR template to reference GitHub issue tracker by @rok in #2933
- Bump org.apache.maven.plugins:maven-shade-plugin from 3.5.3 to 3.6.0 by @dependabot in #2922
- Bump org.easymock:easymock from 5.2.0 to 5.3.0 by @dependabot in #2924
- GH-2930: Add issue templates by @rok in #2931
- GH-2939: Update PULL_REQUEST_TEMPLATE by @wgtmac in #2940
- MINOR: Make ColumnPath immutable by @findepi in #1380
- Bump com.google.truth.extensions:truth-proto-extension from 1.4.2 to 1.4.3 by @dependabot in #2936
- Update readme to point to 1.14.1 release by @wayneguow in #2947
- Bump com.google.api.grpc:proto-google-common-protos from 2.39.0 to 2.41.0 by @dependabot in #2937
- Bump commons-logging:commons-logging from 1.3.2 to 1.3.3 by @dependabot in #2954
- GH-2948: Fix NPE when using the AvroParquetReader.Builder with LocalInputFile by @joyCurry30 in #2949
- PARQUET-2139: fix file_offset field in ColumnChunk metadata by @etseidl in #1369
- Bump org.codehaus.mojo:build-helper-maven-plugin from 3.5.0 to 3.6.0 by @dependabot in #1365
- GH-2935: Avoid double close of ParquetFileWriter by @hellishfire in #2951
- GH-2959: Optimize the test case of parquet rewriter. by @joyCurry30 in #2960
- GH-2956: Use avro SchemaBuilder API to convert record by @RustedBones in #2957
- PARQUET-1126: Write unencrypted Parquet files without Hadoop by @dlvenable in #1376
- PARQUET-34: implement not() for Contains predicate by @clairemcginty in #2941
- Bump com.h2database:h2 from 2.2.224 to 2.3.230 by @dependabot in #2963
- Bump zstd-jni.version from 1.5.6-2 to 1.5.6-3 by @utkuaydn in #2968
- GH-2921: Remove Jackson JDK8 module by @Fokko in #1378
- Bump com.fasterxml.jackson.core:jackson-databind from 2.17.1 to 2.17.2 by @dependabot in #2973
- Bump jackson.version from 2.17.1 to 2.17.2 by @dependabot in #2964
- GH-2976: Parqurt CLI compression commands should accept lowercase compression name by @pan3793 in #2977
- Bump org.apache.commons:commons-lang3 from 3.14.0 to 3.15.0 by @dependabot in #2979
- Bump org.easymock:easymock from 5.3.0 to 5.4.0 by @dependabot in #2978
- GH-2992: Gate LocalTimestamp references in AvroSchemaConverter by @clairemcginty in #2993
- GH-2952: Add maven wrapper by @nastra in #2953
- GH-2994: Optimize string to binary conversion in AvroWriteSupport by @sschepens in #2995
- GH-2988: Supports disabling statistics for specific columns by @ConeyLiu in #2989
- GH-2986: Fails the file writing when footer size exceeds int max value by @ConeyLiu in #2987
- MINOR: ParquetRewriter must close opened reader/stream by @cetindogu in #3002
- PARQUET-2430: Add parquet joiner v2 by @MaxNevermind in #1335
- GH-3009: Bump zstd-jni.version from 1.5.6-3 to 1.5.6-6 by @olegbonar in #3010
- GH-2962: Set dictionary_page_offset even when encoding_stats are missing by @mothukur in #3012
- GH-3007: Ensure version specific Jackson classes are shaded by @z-anderson in #3017
- GH-3016: Do not use dependency-reduced pom for attached shaded artifact by @RustedBones in #3015
- MINOR: Don't run all the tests on a release by @Fokko in #2999
- Bump org.tukaani:xz from 1.9 to 1.10 by @dependabot in #2985
- GH-3013: Fix potential ClassCastException at reading DELTA_BYTE_ARRAY encoding by @gszadovszky in #3019
- Bump com.google.protobuf:protobuf-java from 3.25.3 to 3.25.5 in /parquet-protobuf by @dependabot in #3018
- GH-3021: Upgrade Avro dependency by @gszadovszky in #3022
- GH-3026: ParquetRewriter fails when you try to nullify and encrypt 2 different columns by @MaxNevermind in #3027
- Bump io.airlift:aircompressor from 0.27 to 2.0.2 by @dependabot in #3023
- PARQUET-3031: Support to transfer input stream when building ParquetFileReader by @turboFei in #3030
- GH-3029: Fix EncryptionPropertiesHelper not to use java.nio.file.Path by @wgtmac in #3038
- Bump net.openhft:zero-allocation-hashing from 0.16 to 0.26ea0 by @dependabot in #3024
- Bump org.apache.arrow:arrow-vector from 16.1.0 to 17.0.0 by @dependabot in #2980
- GH-3040: DictionaryFilter.canDrop may return false positive result when dict size exceeds 8k by @pan3793 in #3041
- MINOR: Throw exception in BytesInput by @Fokko in #3042
- MINOR: Update version in
README.md
by @Fokko in #3045 - MINOR: Update
issueManagement
inpom.xml
by @Fokko in #3046 - MINOR: Bump Maven parent pom by @Fokko in #3048
- MINOR: Bump Thrift to 0.21.0 by @Fokko in #3049
- MINOR: Support int value for timeOfDay in ProtoTimeConverter by @mwong38 in #3034
- MINOR: Redirect to releases by @Fokko in #3058
- MINOR: Bump latest version to 1.14.4 by @Fokko in #3057
- GH-3055: Disable column statistics for all columns by configuration by @wgtmac in #3056
- GH-3035: ParquetRewriter: Add a column renaming feature by @MaxNevermind in #3036
- Bump org.xerial.snappy:snappy-java from 1.1.10.5 to 1.1.10.7 by @dependabot in #3053
- Bump org.codehaus.mojo:buildnumber-maven-plugin from 3.2.0 to 3.2.1 by @dependabot in #3054
- Bump jackson.version from 2.17.2 to 2.18.1 by @dependabot in #3052
- GH-3059: Add configuration to disable size statistics by @wgtmac in #3060
- GH-3068: Remove
parquet-scala
by @Fokko in #3063 - Bump org.apache.maven.plugins:maven-dependency-plugin from 3.6.1 to 3.8.1 by @dependabot in #3064
- Bump org.apache.commons:commons-lang3 from 3.15.0 to 3.17.0 by @dependabot in #3065
- Bump commons-io:commons-io from 2.16.1 to 2.18.0 by @dependabot in #3071
New Contributors
- @amogh-jahagirdar made their first contribution in #1350
- @alamb made their first contribution in #1355
- @utkuaydn made their first contribution in #1363
- @yaooqinn made their first contribution in #1371
- @emkornfield made their first contribution in #1372
- @caicancai made their first contribution in #1375
- @rok made their first contribution in #2933
- @wayneguow made their first contribution in #2947
- @joyCurry30 made their first contribution in #2949
- @etseidl made their first contribution in #1369
- @hellishfire made their first contribution in #2951
- @dlvenable made their first contribution in #1376
- @nastra made their first contribution in #2953
- @sschepens made their first contribution in #2995
- @cetindogu made their first contribution in #3002
- @MaxNevermind made their first contribution in #1335
- @olegbonar made their first contribution in #3010
- @mothukur made their first contribution in #3012
- @z-anderson made their first contribution in #3017
- @turboFei made their first contribution in #3030
Full Changelog: apache-parquet-1.14.4...apache-parquet-1.15.0-rc1