Hello, and welcome to the November 2025 ClickHouse newsletter!
This month, we have big news with our LibreChat acquisition, a data analyst's AI-powered warehouse build, 170x log compression techniques, and auto-exporting traces to OpenTelemetry.
Featured community member: Kiyose Ryu #
This month's featured community member is Kiyose Ryu, Engineering Manager at Smartnews, Inc.
Kiyose Ryu has worked at Smartnews for the past five years, where he's helped build a real-time aggregation system using ClickHouse.
Smartnews began introducing ClickHouse in 2020, and by 2021, had converted all advertiser reports to real-time. Advertisers need up-to-the-minute data to manage their campaigns effectively, which is why Smartnews prioritized delivering accurate, real-time reports. Kiyose Ryu shared the details of their implementation at the ClickHouse Tokyo meetup.
Upcoming events #
Global virtual events #
- How F45 Turns 1B+ Data Points into 70% Higher Member Satisfaction - November 20
- v25.11 Community Call - November 25
Virtual training #
Real-time Analytics
- Real-time Analytics with ClickHouse: Level 1 - November 19
- Real-time Analytics with ClickHouse: Level 1 (APJ-friendly) - November 25
- Real-time Analytics with ClickHouse: Level 2 (APJ-friendly) - November 26
- Real-time Analytics with ClickHouse: Level 3 (APJ-friendly) - November 27
- Real-time Analytics with ClickHouse: Level 1 - December 2
- Real-time Analytics with ClickHouse: Level 2 - December 3
- Real-time Analytics with ClickHouse: Level 3 - December 4
- Real-time Analytics with ClickHouse: Level 1 - December 9
- Real-time Analytics with ClickHouse: Level 2 - December 10
- Real-time Analytics with ClickHouse: Level 3 - December 11
Observability
- Observability at Scale with ClickStack - November 26
- Observability at Scale with ClickStack - December 3
- Observability with ClickStack: Level 2 - December 17
Events in AMER #
- House Party, The SQL, Las Vegas - December 3
Events in EMEA #
- ForwardData Paris (Data&AI) - November 24
- Paris Monitoring Day - November 25
- ClickHouse Meetup Warsaw - November 26
- ClickHouse Meetup in Tel Aviv - December 29
Events in APAC #
- Mumbai In-Person Training - Real-time Analytics with ClickHouse: Level 1 - November 21
- ClickHouse + AWS Bangkok Meetup - November 25
- Open Source Summit Japan - December 8
- Jakarta In-Person Training - Real-time Analytics with ClickHouse: Level 1 - December 9
- ClickHouse Jakarta Meetup - December 9
- ClickHouse Tokyo Meetup & Year-End Party - December 15
- AI Engineering Summit Tokyo - December 16
25.10 release #
ClickHouse 25.10 sees a collection of JOIN performance optimizations. These include lazy column replication, bloom filter optimizations, and smarter push-down of complex conditions, delivering up to 24 times faster queries in some cases.
There are also several additions to the SQL syntax, including the <=> (IS NOT DISTINCT FROM) operator, negative limit and offset (perfect for getting the n most recent records but returning them in ascending order), and LIMIT BY ALL.
And finally, my personal favorite - as of ClickHouse 25.10, we can query the ClickHouse Arrow Flight server using the ClickHouse Arrow Flight client.
ClickHouse welcomes LibreChat: Introducing the open-source Agentic Data Stack #
We acquired LibreChat, the leading open-source AI chat platform, to create the "Agentic Data Stack" - combining LibreChat's multi-LLM interface with ClickHouse's analytical speed so users can query data in plain English.
Companies like Shopify, Daimler Truck, and cBioPortal are already using this stack to democratize data access, and our own AI-powered data warehouse now handles 70% of queries for 200+ users.
LibreChat remains 100% open-source under its MIT license, and we plan to implement deeper native integrations while preserving the platform's flexibility.
Streaming asynchronous inserts monitoring in ClickHouse #
AB Tasty's William Attache explains how his team built a comprehensive monitoring system to track ClickHouse asynchronous inserts end-to-end, from initial requests through flush queries to final part merges. The team created a custom view that joins data from multiple system tables to visualize the complete lifecycle of streaming inserts.
Along the way, they achieved significant cost savings by switching from JSON to RowBinary format and optimizing their batch sizes based on how ClickHouse actually creates and merges parts.
We built a vector search engine that lets you choose precision at query time #
Raufs Dunamalijevs added QBit to ClickHouse, a column type that stores floats as bit planes. It allows you to choose how many bits to read during vector search, thereby tuning recall and performance without altering the data.
From 0–1: Building our data warehouse with ClickHouse to enable self-serve analytics and observability at scale #
Viralo hit 10M users, and their PostgreSQL analytics setup collapsed, with dashboards taking over 30 minutes to load, CPUs maxing out at over 80%, and queries failing. They migrated to ClickHouse Cloud and implemented a medallion architecture, using SQL for all data transformations instead of external ETL tools, which reduced query time to under 30 seconds.
Notably, Shubham Bhardwaj's team constructed the entire warehouse using a low-code approach, leveraging Claude Sonnet for code assistance, thereby eliminating orchestration tool costs and demonstrating that innovative architecture and AI tooling can effectively replace complex data stacks.
Tracing the invisible: Building end-to-end observability in a real-time streaming pipeline #
When building end-to-end observability for their real-time metrics pipeline, Pranav Mehta's team faced a unique challenge: ClickHouse stores query spans internally, but it cannot use an SDK to push those traces to an OpenTelemetry collector, as is done with application services.
They came up with a clever idea of using an incremental materialized view that writes to a table backed by the URL engine, which points at the OTel collector API.
Now they can trace a request all the way from the application, through NATS JetStream, into ClickHouse query execution - and see it all in one view.
ClickHouse partners with Japan Cloud to establish ClickHouse K.K. and accelerate growth in Japan #
Earlier this month, ClickHouse announced the establishment of ClickHouse K.K., its Japanese subsidiary, through a strategic partnership with Japan Cloud.
Quick reads #
- Alexey Milovidov derives and visualizes real-time weather data (wind direction, speed, and air pressure) from airplane telemetry stored in ClickHouse using trigonometry and color mapping.
- Nisarg Pipaliy has written a beginner-friendly explainer that breaks down ClickHouse's table engines, showing how different engines, such as MergeTree, ReplacingMergeTree, and SummingMergeTree, give tables distinct "personalities" for handling deduplication, aggregation, and replication. This makes it clear why ClickHouse is fundamentally different from traditional row-based databases, such as PostgreSQL.
- Julian Virguez wired ClickHouse directly into Salesforce using a Lightning Web Component that queries 150M rows of Amazon review data in under a second.
- Lionel Palacin demonstrated how to achieve 170x+ compression on Nginx logs by transforming raw text into structured columns with optimized data types and strategic ordering keys. He then followed this up with another post showing how to automatically group similar log messages and extract variable fields into columns, achieving nearly 50x compression while keeping logs fully queryable and reconstructable.
- Andrii K walks through setting up a complete CDC pipeline using Debezium to capture PostgreSQL changes, stream them through Kafka with Avro serialization, and sink into ClickHouse.
Video corner #
- Mark Needham demonstrates the ClickHouse MCP server's new chDB mode, which enables in-process execution of ClickHouse queries without a server.
- At Web Summit 2025, Alexey Milovidov demonstrated loading massive public datasets into ClickHouse and then running real-time analytics, such as comparing writing styles across domains, tracking trends on Reddit, and creating an interactive map that showed the "best photo" for every location on Earth.
- At OpenHouse Bangalore 2025, Kiran Raparti and Pragnesh Bhavsar from HighLevel explained how they migrated four critical use cases from MySQL, Elasticsearch, and MongoDB to ClickHouse.



