github apache/gravitino v1.1.0
Apache Gravitino 1.1.0

22 hours ago

We are glad to announce the release of Apache Gravitino 1.1.0! This release builds upon the solid foundation laid by Apache Gravitino 1.0.0, introducing a range of new features, improvements, and bug fixes that enhance the platform's capabilities, performance, and security.

Highlights

  • Broader catalog support (initial Lance REST service, a reusable lakehouse-generic catalog, and Hive3) to simplify integration with diverse lakehouse deployments.
  • Stronger metadata-level authorization and security hardening for the Iceberg REST surface.
  • Multi-cluster fileset support and Python client improvements for real-world multi-region and migration workflows.
  • Stability, performance and observability work across the entity-store, caches, scan planning, connectors and CI — reducing operational friction and test flakiness.

New Features

  1. Built for the Future of AI Data: Lance REST service. #8889

As AI and ML workflows become central to data platforms, efficient access to vector data is crucial. The new Lance REST service exposes Lance datasets through a managed HTTP interface. This allows remote clients—such as inference services or notebooks—to access vector data with the high performance of the Lance format, all while adhering to Apache Gravitino's centralized security and governance policies.

  1. Generic lakehouse catalog. #8828

The lakehouse ecosystem is diverse and rapidly evolving, with new table formats and engines emerging frequently. To keep pace, we introduced a generic lakehouse catalog framework. This abstraction reduces the boilerplate code required to integrate new engines, standardizing how capabilities are negotiated and how namespaces are handled. This means faster support for new formats and a more consistent experience for developers and users alike.

  1. Access control for Iceberg REST service. #4290

The Iceberg REST catalog is becoming the standard for open table access, but production use demands robust security. We have hardened the Iceberg REST service with comprehensive authentication and authorization checks. This ensures that data accessed via standard Iceberg clients is fully protected, making Apache Gravitino a secure choice for multi-tenant and public-facing data lake deployments.

  1. Hive 3 catalog support. #5912

Many enterprises still rely on Hive 3 for their core data infrastructure, making migration a risky and complex endeavor. This feature allows users to register existing Hive 3 metastores directly as Apache Gravitino catalogs. By doing so, organizations can instantly bring their legacy data under Apache Gravitino's unified governance and management umbrella without moving data or disrupting existing workloads, paving the way for a smoother transition to modern lakehouse architectures.

  1. Multiple HDFS clusters support. #9117, #9288

In large-scale production environments, data is often distributed across multiple HDFS clusters to ensure isolation and disaster recovery. Previously, Apache Gravitino was limited in how it handled these complex topologies. With this release, users can manage filesets across multiple HDFS clusters within a single Apache Gravitino instance. This capability simplifies cross-cluster data management, improves resource isolation, and provides greater flexibility for multi-tenant architectures.

  1. Metadata authorization for IRC, statistics, tags, jobs, and policies. #4361, #8752, #8944, #8943

True governance requires securing every aspect of the metadata platform. We have expanded fine-grained authorization to cover auxiliary resources like tags, statistics, and background jobs. This enhancement closes previous security gaps, ensuring that all user interactions with the system—whether viewing statistics or managing tags—are strictly governed by least-privilege policies.

  1. New Iceberg REST endpoints. #6336

To support the full range of capabilities expected by modern analytics tools, we have implemented additional endpoints from the Iceberg REST specification. This improves compatibility with the latest query engines and clients, ensuring that users can leverage advanced planning and catalog operations without running into compatibility issues.

Improvements

Core & Server

  • Entity store and Cache: Fixed several performance and logic issues to improve stability and speed. #8697, #8743, #8815, #8817, #8710, #9148, #7916, #8546
  • Metrics: Expose more metrics for server and catalogs to enhance observability. #8594
  • Authorization: Refined permission checks. #7942.
  • Resource management: Improved resource release and closure mechanisms to prevent leaks. #8981, #9002, #8999
  • JDBC metric store: Support storing Iceberg metrics in JDBC. #8899
  • Job system enhancement: Support job alteration. #8638, #8814

Catalogs & Connectors

  • Iceberg catalog: Support metadata cache. #8314
  • Upgrade Iceberg to 1.10.0 to support scan planning. #9046
  • Improve dynamic config provider for better usability. #8970
  • Fileset catalog: Prevented filesystem instances from hanging for a long time. #9280
  • Trino connector: Support SQL UPDATE/DELETE/MERGE. #8241
  • Fix getTableStatistics in GravitinoMetadata. #9100

Clients

  • GVFS client: Improved stability and error handling. #8752, #8882, #8948, #9853.
  • Fileset bundle JARs: Refactored for a more detailed delivery strategy. #9106
  • Python client: Added support for relational catalog. #5198

Developer Experience & Operations

  • Helm chart: Enhanced configuration options and stability. #8747, #8174
  • GitHub templates: Added templates to support AI coding. #9227.
  • Tests: Refactoring and enhancement of test suites. #9223, #9107
  • Docker: Changed Apache Gravitino Docker base image. #8817
  • Code Style: Upgrade Google Java Format to support JDK 17. #8792.

Frontend Updates

  • Added pagination for files list. #8987
  • Displayed the index type in UI. #6997
  • Upgraded dependabot affected versions. #9357
  • Fixed routing issue where path '/' may not route to 'metalakes'. #9354

Bug Fixes

  • Create topic encounters NoSuchTopicException when Kafka is deployed with 3 brokers on EKS. #4158
  • Apache Gravitino IRC server returns java.lang.NoSuchMethodError: void org.apache.hadoop.security.HadoopKerberosName.setRuleMechanism. #8754
  • Several bugs in SQL provider. #8659, #9166
  • Unknown error when using fsspec through JNI. #8858

Still, there are many bugs that have not listed due to limited space. Please refer to the full list of issues and pull requests merged since the 1.0.0 release for more details.

Acknowledgements

Thanks to everyone who contributed to the 1.1.0 work — code, reviews, tests, issue triage, design, and feedback. Below is a consolidated list of contributor GitHub IDs extracted from issue and PR activity.

@10000JI, @Abyss-lord, @bharos, @chanwonlee, @chenwyi2, @danhuawang, @diqiu50, @egvr2002, @FANNG1, @HaJunYoo, @HarshMehta112, @hdygxsj, @HugoSalaDev, @hqbhoho, @Jackeyzhe, @jerqi, @jerryshao, @JoelVR2k, @joeyutong, @juiceyang, @justinmclean @KayMas2808, @keepConcentration, @khawaja-abdullah, @Kook-s, @LauraXia123, @LiuQhahah, @mchades, @namest504, @pandeysambhi, @PCloud63514, @Pikady, @pratyushkumar211, @raveendra11, @ruanwenjun, @SalmaneKhalili, @shaofengshi, @shunki-fujita, @SteveArseneLee, @sunxiaojian, @tsungchih, @yangyuxia, @youngyjd, @yuqi1129, @zhan7236

Don't miss a new gravitino release

NewReleases is sending notifications on new releases.