Hello, and welcome to the September 2025 ClickHouse newsletter!
This month, Tom Schreiber explains how GROUP BY
works at scale, we have a new Parquet reader, Airbyte has built a first-class ClickHouse connector, we learn about Laminar’s "flight recorder for AI", and more!
Featured community member: Gaurav Sen #
This month's featured community member is Gaurav Sen, Founder at InterviewReady.

Gaurav Sen is a software engineer and educator with over 650,000 YouTube subscribers, known for making complex system design concepts accessible to developers worldwide. With engineering experience at Uber, DirectI, and Morgan Stanley, he brings real-world expertise in building large-scale distributed systems and real-time analytics platforms.
Gaurav recently created popular videos walking through ClickHouse’s OpenAI and Tesla use cases. His videos help developers understand how ClickHouse can solve complex analytical problems, making advanced data engineering concepts approachable for his broad audience of software engineers and system designers.
Upcoming events #
Open House Roadshow #
In case you missed the ClickHouse Open House conference back in May, we’re taking it on tour! We’ll visit four cities in October and hope to see some of you there.
Each event will include keynotes, deep-dive talks, live demos, and AMAs with ClickHouse creators, builders, and users, as well as the opportunity to network with the ClickHouse community.
- Sydney User Conference - October 2
- NYC User Conference - October 6
- Bangalore User Conference - October 7
- Amsterdam User Conference - October 27
Global events #
- v25.9 Community Call - September 25
- Introducing ClickStack: The Future of Observability on ClickHouse - September 23
Virtual training #
- ClickHouse Deep Dive Part 1 - September 24
- Observability at Scale with ClickStack - October 1
- Observability at Scale with ClickStack - October 14
Events in AMER #
- Boston ClickHouse Meetup - September 19
- Denver Meetup - September 22
- Iceberg Bay Area Meetup - October 1
- Seattle AI Meetup - October 2
- ClickHouse Deep Dive Part 1 - In-Person Training (New York) - October 7
- Atlanta In-Person Training - Observability at Scale with ClickStack - November 10
Events in EMEA #
- BigData London - September 24-25
- PyData Amsterdam - September 24-25
- AWS Cloud Day Riyadh, September 29
- ClickHouse Meetup in Madrid - September 30
- Meet The ClickHouse Team: "Real-Time Data & AI: Best Practices with AWS & ClickHouse" (Riyadh, Kingdom of Saudi Arabia) - September 30
- ClickHouse Meetup in Madrid - September 30
- ClickHouse Meetup in Barcelona - October 1
- BigData Paris - October 1-2
- AI in ClimateTech Panel - Amsterdam C-Level Meetup - October 7
- SRE Con Dublin - October 7-9
- ClickHouse Meetup in Zürich - October 9
- Gitex Dubai - October 13-17
- ClickHouse Meetup in London - October 15
- AWS Community Day Budapest - October 16
- Amsterdam User Conference - October 28
- ClickHouse Deep Dive Part 1 In-Person Training (Amsterdam) - October 28
- BigData & AI World Madrid - October 29-30
- Gartner IT Barcelona - November 10-13
- ClickHouse Meetup in Cyprus - November 20
Events in APAC #
- ClickHouse Beijing Meetup - September 20
- ClickHouse Pune Meetup - September 20
- Data & AI Summit Singapore - September 24
- ClickHouse Singapore Meetup - September 25
- ClickHouse Melbourne Meetup - September 30
- ClickHouse (クリックハウス) Tokyo Meetup - September 30
- ClickHouse Fundamentals (Tokyo, Japan) - September 30
- Sydney In-person Training - ClickHouse Deep Dive Part 1 - October 2
- Bangalore In-person Training - ClickHouse Deep Dive Part 1 - October 7
25.8 release #

The most exciting feature in the 25.8 release is the new native Parquet reader, which has page-level parallelism and PREWHERE
pushdown. It delivers 1.8x faster performance while scanning 99.98% less data.
ClickHouse can also write data with Hive-style partitioning, has Arrow Flight integration, and even better Data Lake support.
Clickhouse Spotlight: How Airbyte built a first-class destination connector #

Airbyte recently released a new first-class ClickHouse destination connector that delivers over 3× improved performance, supports single-sync loads of datasets larger than 1 TB, and preserves correct data types for a cleaner, more structured load.
4 common ClickHouse mistakes (and how to fix them) #

Nimrod Kir'on highlights four common mistakes that can trip up new ClickHouse users - from mis-tuned setups to schema design choices - and offers practical advice to get started on the right foot.
How Laminar is using ClickHouse to reimagine observability for AI browser agents #

Laminar built a "flight recorder for AI" that captures what browser agents see and do - tackling observability for one of the hottest areas in AI development today. It uses ClickHouse Cloud to transform billions of DOM events into instant video-like replays, making debugging AI agents as simple as watching YouTube.
Scaling GROUP BY to 8,900+ cores: The engineering behind ultra-fast aggregations in ClickHouse #

It's time for another Tom Schreiber deep dive! This time, Tom explores ClickHouse's parallel replicas feature and how it scales GROUP BY operations across thousands of cores.
In his latest technical breakdown, Tom demonstrates how ClickHouse processes 100 billion rows in just 414 milliseconds—about the time it takes to snap your fingers, by leveraging mergeable partial aggregation states that enable elastic scaling across thousands of cores
Quick reads #
- Igor Gorbenko shows how you can combine CatBoost, open accident datasets, and ClickHouse to power a “route safety engine” that evaluates risk across a journey and suggests safer paths, not just faster ones.
- Himnish Hunma explores how Klaviyo uses ClickHouse under the hood, and why adding “one more abstraction” unlocked the flexibility needed to scale their real-time analytics.
- Benjamin Wootton argues that ClickHouse Cloud lets teams collapse entire modern data stacks into one powerful engine, eliminating multiple systems, cutting down on ETL, reducing redundancy, and delivering fresh analytics from raw data without sacrificing performance.
- Our very own Mike Shi was invited to the OpenObservability Talks podcast to talk about ClickStack, the open-source observability stack we introduced just a few months ago that unifies logs, metrics, traces, and session replay.
- Liza Katz dives into the subtleties of using polygon dictionaries in ClickHouse, unpacking how overlapping shapes, key layouts, and lookup semantics can catch you off guard unless you're aware of the gotchas.