An intro to column-based storage

Mark Needham

In this video, we explore column storage, the backbone of column stores or column databases like ClickHouse. We compare the concept to the more familiar row-based storage used in many relational databases, using a practical example of weather data to illustrate the differences.

Column storage isn't just a different way to organize data; it's a game-changer for analytical queries and data compression. We'll explore why this approach is particularly well-suited for modern data analysis needs and how it aligns with current CPU architectures.

Key points covered:

  • Comparison of row-based vs. column-based storage layouts
  • Advantages of column storage for data compression and efficient querying
  • Examples of compression techniques like dictionary encoding and delta encoding
  • How column storage enables faster analytical queries and aggregations
  • The benefits of column storage for CPU cache usage and SIMD operations

Recent videos

YouTube Video: GwCRcRa8f3A

Open House

Open House 2026: Day 1 Keynote

The latest ClickHouse announcements, featuring real-world use cases from Shopify, Zoox, Visa, and Cisco.

YouTube Video: ZtvlCz7Ukg4

Open House

Fireside Chat: The state of data and AI with Bret Taylor (Sierra) and Aaron Katz (ClickHouse)

Aaron Katz (CEO, ClickHouse) and Bret Taylor (Co-Founder Sierra, Chairman of the Board, OpenAI) have an open conversation on the state of AI.

YouTube Video: FmS7VopaqNg

Open House, ClickHouse

How to build a great database (Alexey Milovidov)

The principles behind building a great database, and the new frontiers shaping the field.

Follow us

XBlueskySlackGithubTelegramMeetupRSS