Data lakehouses (in under 3 minutes)
Mark Needham
What is a data lakehouse, and why is everyone talking about them? This quick explainer covers everything you need to know.
We'll start by discussing data lakes and their common challenges—schema evolution, data integrity, query performance, data discovery, and access control—and then show you how the four-layer lakehouse architecture solves these problems.
You'll learn about open table formats like Iceberg, Delta Lake, and Hudi, plus how data catalogs like AWS Glue and Unity Catalog complete the picture. By the end, you'll understand why lakehouses are becoming the go-to solution for modern data architecture.
Recent videos
View all Videos
Open House
Open House 2026: Day 1 Keynote
The latest ClickHouse announcements, featuring real-world use cases from Shopify, Zoox, Visa, and Cisco.

Open House
Fireside Chat: The state of data and AI with Bret Taylor (Sierra) and Aaron Katz (ClickHouse)
Aaron Katz (CEO, ClickHouse) and Bret Taylor (Co-Founder Sierra, Chairman of the Board, OpenAI) have an open conversation on the state of AI.

Open House, ClickHouse
How to build a great database (Alexey Milovidov)
The principles behind building a great database, and the new frontiers shaping the field.

Open House
Fireside Chat: Ecosystem and technology trends (Vercel, dbt Labs, CoreWeave)
Aaron Katz (CEO, ClickHouse), Guillermo Rauch (CEO, Vercel), Tristan Handy (CEO, dbt Labs), and Lukas Biewald (SVP of AI, CoreWeave) discuss how AI is changing the data landscape.