Apache Ozone + Iceberg Meetup-Bangalore Chapter

by Tanvi Penumudy, Mohammad Arafat Khan, Krishna Asawa

Krishna Asawa
Engineering@Cloudera
3 min readApr 5, 2023

--

Discover how Apache Iceberg and Apache Ozone can help your organization deliver timely business insights in the era of exabyte-scale data analytics

Ozone? Iceberg?

Are we talking about climate change?

While the words Iceberg and Ozone may conjure up images of climate change, they are actually two of the hottest Apache projects in the tech world today.

Discover how Apache Iceberg and Apache Ozone can help your organization deliver timely business insights in the era of exabyte-scale data analytics.

In-person meet-up from Karthik Krishnamoorthy, Senior Director, Product Management on how Cloudera brings these cutting-edge technologies together to deliver an industry leading open data lakehouse on-premises, to solve your biggest challenges.

The Ozone & Iceberg Meetup was organized by Cloudera in the Bangalore Office on 28th March for its customers and partner teams. We had a total of 23 attendees. The aim of the Meetup was to introduce the customers to Ozone and Iceberg. The Meetup witnessed a great turnout from Cloudera customers such as Wipro, Reliance, Mobelium, Canara Bank, Kyndrl, Ujjivan bank and more, highlighting the widespread interest in our products.

Cloudera manages a colossal amount of data — estimated to be over 1 ZB on prem and in the cloud — and is focusing on implementing Iceberg as the new table format and Ozone as the new storage system across the board. The Meetup was an excellent opportunity for customers to transform their businesses, optimize expenses, minimize risks, and grow revenues — among many other benefits of adopting Ozone and Iceberg.

The Meetup was driven by Karthik Krishnamoorthy, Senior Director, Product Management. SE team Avijeet, Rampradeep as well as Engineering Nandakumar and Dharmik helped answering various queries. They provided an in-depth understanding of the features, benefits, and capabilities of our solutions.

Apache Ozone, a secure, scalable, and high-performance object store, is designed to store structured, unstructured, and binary data, enabling enterprises to read, write, and run applications at scale. Apache Ozone is a modern object storage that uniquely supports the Amazon S3 interface natively as well as the Hadoop Compatible File System interface. Ozone’s architecture is designed to meet the high performance requirements of diverse workloads while being able to scale to billions of objects and 100s of petabytes of dense distributed storage nodes.

Apache Iceberg is an open table format that supports large-scale analytic data tables and ACID-compliant tables, enabling high throughput reads, efficient querying for structured and unstructured data from various sources, and time travel queries. It also supports atomic and isolated database transaction properties, partitioning techniques, and multiple layers of metadata files, which enhance performance. By leveraging these capabilities, organizations can unlock the full potential of their data.

The session was very well received with plenty of interaction with our customers and partner teams. Our customers showed curiosity about the projects and asked challenging questions.

The attendees showed strong interest in adopting the new storage solutions and a keen interest in some of the differentiating features of Ozone and Iceberg, such as Erasure Coding, Snapshots support for both S3 and Filesystem for Ozone & ACID-compliant tables, partitioning techniques, time travel queries, atomic and isolated database transaction properties, for Iceberg.

In conclusion, the Meetup provided a valuable opportunity for Cloudera’s customers and partners to gain insights into how Apache Ozone and Apache Iceberg offer better scalability, performance, and features compared to HDFS and Hive, and how these solutions can benefit their businesses. The widespread interest and enthusiasm displayed during the Meetup indicate that Ozone and Iceberg have the potential to revolutionize the big data storage and processing landscape.

--

--