Hello, and welcome to the August 2025 ClickHouse newsletter!
This month, we have lightweight updates introduced in ClickHouse 25.7, LLM observability for LibreChat with ClickStack, a Gemini vs Claude SQL bake-off, and more!
Featured community member: Gene Makarov #
This month's featured community member is Gene Makarov, CTO at SWOT Mobility.

As CTO of SWAT Mobility, he leads data science teams creating algorithms for demand-responsive public transit systems, dealing with real-time GPS data processing, traffic analysis, and logistics optimization for millions of commuters.
Gene is a long-time user of ClickHouse, picking it up for the first time almost a decade ago, just after it became open-source! Recently, he tackled an interesting challenge: rendering vector tiles directly from ClickHouse for mapping engines like MapboxGL and MapLibre.
Building on Alexey Milovidov's bitmap tile example, Gene created an elegant solution using an intermediate Golang server to generate MVT format tiles. His weekend experiment demonstrates how ClickHouse can efficiently serve hundreds of millions of geospatial points for real-time mapping applications.
Upcoming events #
Global events #
- v25.8 Community Call - August 28
Virtual training #
- Observability at Scale with ClickStack (Virtual) - August 27
- ClickHouse Query Optimization (Virtual) - August 27
- ClickHouse Deep Dive Part 1 - September 3
- ClickHouse Query Optimization Workshop - September 11
- ClickHouse Deep Dive Part 1 - September 24
Events in AMER #
- San Francisco ClickHouse Meetup - August 26
- Menlo Park In-Person Training - Observability at Scale with ClickStack: Logs, Metrics, Traces and RUM on ClickHouse - August 27
- San Francisco In-Person Training - Observability at Scale with ClickStack: Logs, Metrics, Traces and RUM on ClickHouse - August 28
- Toronto ClickHouse Meetup - September 3
- Raleigh ClickHouse Meetup - September 4
- AWS Summit Toronto - September 4
- ClickHouse Deep Dive Training - Toronto - September 5
- New York ClickHouse + Docker Meetup - September 8
- AWS Summit Los Angeles - September 17
- ClickHouse Deep Dive Part 1 - In-Person Training (New York) - 7 October
Events in EMEA #
- Tech BBQ Copenhagen - August 27-28
- ClickHouse Meetup in Tel Aviv - September 9
- AWS Summit Zurich - September 11
- HayaData Tel Aviv, September 16
- ClickHouse Meetup in Dubai - September 16
- Roundtable: "Real-Time Data & AI: Best Practices with AWS & ClickHouse" - September 17
- BigData London - September 24-25
- PyData Amsterdam - September 24-25
- AWS Cloud Day Riyadh, September 29
- ClickHouse Meetup in Madrid - September 30
- ClickHouse Meetup in Barcelona - October 1
- AI in ClimateTech Panel - Amsterdam C-Level Meetup - October 7
- ClickHouse Meetup in Zürich - October 9
- ClickHouse Meetup in London - October 15
- Amsterdam User Conference - October 27
- ClickHouse Deep Dive Part 1 In-Person Training (Amsterdam) - October 28
- ClickHouse Meetup in Cyprus - November 20
Events in APAC #
- Database Technology Conference China - August 21
- AWS Community Day Pune - August 23
- Delhi/Gurgaon Migration to ClickHouse Workshop (in-person) - September 4
- CloudCon Sydney - September 9-10
- Asia Data Analytics Conference Vietnam - September 11
- AWS Community Day Vadodara - September 13
- AWS Startup Dev Day Sydney - September 18
- Data & AI Summit Singapore - September 24
- Sydney Open House User Conference - October 2
- Bangalore Open House User Conference - October 7
25.7 release #

In ClickHouse 25.7, we've delivered lightweight SQL UPDATE and DELETE operations that are up to 1,000× times faster thanks to Anton Popov's new patch-part mechanism.
We've also added AI-powered SQL generation from Kaushik Iska (just type ??
and ask your question in plain English!), Amos Bird's clever optimization that makes count()
aggregations 20-30% faster by skipping memory allocation, and Nikita Taranov's continued JOIN improvements delivering up to 1.8× speedups.
Using Gemini and Claude For SQL Analytics - A Bake Off #

Benjamin Wootton benchmarked Claude Opus and Gemini 2.5 Pro for SQL analytics with ClickHouse, using Danny's Diner SQL. Both LLMs achieved near-perfect accuracy, generating complex SQL queries from plain English prompts via the MCP (Model Context Protocol) standard. You'll have to read the blog to find out which model solved the questions more quickly!
How we built fast UPDATEs for the ClickHouse column store #

In the first part of Tom Schreiber’s deep dive into ClickHouse updates, he explains how ClickHouse solves the performance challenges of row-level updates by treating updates as inserts through purpose-built engines like ReplacingMergeTree, CoalescingMergeTree, and CollapsingMergeTree that leverage ClickHouse's insert throughput and background merge process.
Querying ~86 Million rows/s for high-performance dashboard analytics #

Edouard Kombo shares how he abandoned his initial Python + PostgreSQL stack when faced with 4-second query times on just 50 million rows, discovering that ClickHouse's columnar storage and vectorized execution could scan 400 million rows at ~86 million rows per second - making sub-second analytics on billions of rows achievable.
LLM observability with ClickStack, OpenTelemetry, and MCP #

Dale McDiarmid and Lionel Palacin demonstrate how to build comprehensive LLM observability using ClickStack, our open-source observability stack, to instrument LibreChat - an AI chat platform with MCP support.
Quick reads #
- Mohamed Hussain S explores generating a Parquet file using Python and loading it into ClickHouse using Go.
- Mohamed Hussain S also shares some lessons he learnt about writing better ClickHouse queries.
- Divyanshu Raj explores the AggregatingMergeTree table engine, explains how it works, where it shines, and how it compares with other engines like ReplacingMergeTree.
- Bayu Setiawan builds a medallion architecture using Kafka, Debezium, Flink, Airflow, ClickHouse, and MinIO.
- Alireza Mousavizade explains how to build a real-time session analytics pipeline designed for high-throughput, resilience, and superior query performance, capable of transforming raw event streams into interactive, live dashboards.