And rightly, because the more we all work in data management and analytics, the more we need a mix of flexible and extensible technologies that can work across our environments with a common objective, common APIs, interoperability and common needs. Snowflake recently announced the public preview of its Polaris Catalog, a move that will accelerate the path to a vendor-independent, open catalog for Apache Iceberg behind the scene, making your enterprise data all the more open.
In this development, Snowflake announced the initial release of Polaris Catalog, an open data catalog that can be used to index and manage data following the Apache Iceberg table format. Amazingly, Snowflake built a new data catalog that is open-sourced. Polaris will be gradually rolled out over the coming 90 days. The ability of query engines to communicate and cooperate with one another enables enterprises to get the most out of their data.
Greater use of open table formats such as Delta Lake and Apache Iceberg point to a one-step change to open enterprise data management. Going forward, the challenge will be achieving interoperability in a world of such platforms. Snowflake’s Polaris Catalog represents the first open standard for developers to connect via a user-facing catalog to fetch and retrieve against any engine, providing sample metadata and ‘get out of jail free’ enabling truly interoperable connections, even without a basic ability to connect and query across many leading data engines.
At the most basic level, Polaris Catalog adheres to the principles of vendor neutrality and flexibility. The open-source REST protocol used to access and retrieve data is built on Iceberg. This means that any engine can be used, and all can talk to each other, as long as they are compliant with the Iceberg file format and REST API. That ethos extends to the use of the tool by enterprises in the real world. Instead of being chained to a proprietary vendor who controls data access, it removes fear of vendor lock-in since an enterprise can switch out hosting infrastructure as it sees fit. Furthermore, they can run it on their own infrastructure, in a data cloud such as Snowflake, or in container technologies such as Docker or Kubernetes, in ways never before possible.
Recognising the vital importance of security, Snowflake continues to improve Polaris security capabilities. Among other things, Snowflake plans to develop a capability to ensure that security permissions are consistent across various engines in the multiple-engine environment that Polaris envisions. Snowflake’s goal is to ensure that data integrity and authorised access are preserved in that open ecosystem.
Meanwhile, Snowflake has scheduled the first Polaris enterprise preview for June, during which time AWS, Confluent, Dremio, Google Cloud, Microsoft Azure, Salesforce and others – the very same companies that some commentators had accused of stifling competition during Snowflake’s IPO – have announced plans to integrate with Polaris. The reaction from the broader industry is a sign that many enterprises are hungry for a greater number of open, democratic data-management options.
The excitement around the release of Polaris Catalog has spread to other industry constituencies as well. Tomer Shiran, the founder of Dremio, made the case that ‘Open ecosystems must thrive, customers deserve more control over storage, data and metadata in a vendor neutral way that fosters interoperability – no vendor lock-in … Open standards such as Apache Iceberg, teamed up with open catalog projects such as Nessie and Polaris Catalog will help enable this ecosystem interoperability, giving customers the freedom of choice they deserve.’
The term ‘open’ in the context of Polaris Catalog and, by extension, data management, embodies the ideas of transparency, interoperability and freedom from proprietary constraints. The open regime fosters a data ecosystem where enterprises are unshackled by the tools they have chosen, and by contrast encouraged to be more open to pick the best-suited solution that meets their specific data management requirements. The open initiative undertaken by Snowflake with Polaris Catalog shows us the path to the future of data when data is not just open but also operable across platforms for operational efficiency and analytical insights.
When viewed in this light, it’s clear that Snowflake’s Polaris Catalog is a big step toward an open, connected future of data management. An open ecosystem that embraces flexibility, neutrality, and security is one that will meet the needs of tomorrow’s enterprises and set a new standard for how we can use and govern data. As we await the rollout of Polaris, the enthusiasm shown by members of our industry community is an encouraging sign that there’s an appetite for a more open, connected way of doing data analytics and management. By opening up this world, Snowflake is also opening the door to a more open future.
© 2024 UC Technology Inc . All Rights Reserved.