Data lakehouses (in under 3 minutes)

Mark Needham

What is a data lakehouse, and why is everyone talking about them? This quick explainer covers everything you need to know.

We'll start by discussing data lakes and their common challenges—schema evolution, data integrity, query performance, data discovery, and access control—and then show you how the four-layer lakehouse architecture solves these problems.

You'll learn about open table formats like Iceberg, Delta Lake, and Hudi, plus how data catalogs like AWS Glue and Unity Catalog complete the picture. By the end, you'll understand why lakehouses are becoming the go-to solution for modern data architecture.

https://clickhouse.com/engineering-resources/data-lakehouse

Recent videos

YouTube Video: GwCRcRa8f3A

Open House

Open House 2026: Day 1 Keynote

The latest ClickHouse announcements, featuring real-world use cases from Shopify, Zoox, Visa, and Cisco.

YouTube Video: ZtvlCz7Ukg4

Open House

Fireside Chat: The state of data and AI with Bret Taylor (Sierra) and Aaron Katz (ClickHouse)

Aaron Katz (CEO, ClickHouse) and Bret Taylor (Co-Founder Sierra, Chairman of the Board, OpenAI) have an open conversation on the state of AI.

YouTube Video: FmS7VopaqNg

Open House, ClickHouse

How to build a great database (Alexey Milovidov)

The principles behind building a great database, and the new frontiers shaping the field.

Follow us

XBlueskySlackGithubTelegramMeetupRSS