Databricks Follows Cloudera by Adopting Iceberg, Whereas Snowflake Mulls Open Supply Strategy


A continuing stream of breaking information from the information lakehouse house is making notable tech headlines this week.

On Tuesday, Databricks introduced that it’s going to purchase Tabular, a knowledge administration firm based by the creators of Apache Iceberg, Ryan Blue, Daniel Weeks, and Jason Reidfor. The deal was for an unconfirmed sum, however some stories counsel that quantity to be between $1B and $2B (and allegedly outbidding Snowflake). The transfer goals to unify the 2 hottest open-souce lakehouse codecs – Apache Iceberg and Linux Basis Delta Lake – to reinforce knowledge compatibility throughout totally different codecs.

The prior day, Snowflake – nonetheless coping with the aftermath of final week’s knowledge breach – introduced Polaris Catalog, a vendor-neutral, open catalog for Apache Iceberg. The corporate additionally introduced at its annual consumer convention that Polaris Catalog will likely be open sourced within the subsequent 90 days.

So, how do you make sense of all these bulletins and what does this imply to you? 

Iceberg is the Champion within the Desk Format Struggle

Databricks placing this a lot worth in Iceberg is proof that Delta Lake has misplaced the desk format warfare, and Iceberg is the clear winner. Iceberg will additional turn into, and can stay, the de facto commonplace for large-scale knowledge and analytics deployments for the long term. 

Cloudera was a first mover in adopting Iceberg as central and native to our knowledge, analytics, and AI platform – reinforcing our credibility as the most effective vendor to work with once you need managed Iceberg knowledge estates, at scale, throughout all clouds and on-premises. 

How Open is Your Open Supply?

Regardless of its claims because the open knowledge lakehouse firm, Databricks is NOT well-known for being true to open supply. Not like Tabular, Databricks has made business variations as proprietary implementations of open supply know-how in a bid to retain buyer lock-in, and it’ll stay to be seen if this transfer adjustments that strategy. 

Cloudera is a impartial social gathering that manages Iceberg with out vendor lock-in and at scale – in all clouds and on-premises. Cloudera additionally counts as prospects most of the different giant organizations that immediately contribute to the mission.  That’s actually open supply.

Tabular Does Not Personal Iceberg

Tabular was based by the originators of the Iceberg mission. The corporate has about 20% of the Iceberg contributors and committers on workers (corporations like AWS, Google, Dremio, Starburst, Adobe, Apple, Netflix, and extra), which make up the majority of the contributions. It has a wholesome neighborhood, not like Delta Lake, and quite a lot of massive tech corporations who’re invested in maintaining it open supply and vendor impartial.

It is a dangerous and expensive acquisition by Databricks, notably if the 80% of the committers resolve that different committer affiliations weaken the mission to stay open supply for all.

Welcome to the Occasion

Cloudera has been forward of this sport for years. Our 2022 open lakehouse place weblog publish was basically the blueprint for the Databricks acquisition announcement

Iceberg has, and continues to be, central to Cloudera’s open knowledge lakehouse structure throughout hybrid clouds – not simply one thing for use on the aspect. Databricks failed to realize adoption for Delta Lake from communities and third-party distributors, and now should make this BIG and expensive wager. On the similar time, Snowflake’s Polaris catalog timing reveals that they’ve been compelled into this house because the market and prospects have moved Iceberg because the central desk format for his or her knowledge two years after Cloudera.

They’re each not solely late to affix the social gathering, however will miss the enjoyable–and alternative–as they play catch as much as these of us who’ve been right here from the beginning. 

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *