Blog / Community

February 2025 Newsletter

author avatar
Mark Needham
Feb 19, 2025 - 8 minutes read

Well, January went by quickly, didn’t it?! That means it must be time for our second newsletter of 2025.

This month's big news is the launch of JSONBench, a benchmark suite for JSON analytics. Ryadh Dahimene tells us about agent-facing analytics, Shahar Gvirtz explains why he likes ClickHouse, Tom Schreiber dives into the join improvements in 25.1, and more.

This month's featured community member is Chris Lawrence, Dev Lead and Senior Software Engineer at AMP.

1_newsletter202502.png

Chris previously co-founded ReSync Digital, successfully launching over 30 products for early-stage startups, and has experience in machine vision and IoT solutions through his work with Skip-Line, LLC.

Chris Lawrence spoke at the ClickHouse meetup in Melbourne in August 2024. He shared how AMP’s implementation of ClickHouse Cloud has helped them transform their data pipeline from batch processing to real-time streaming, improving their analytics platform's speed and reliability. Chris also elaborated on his talk in a recent blog post.

➡️ Follow Chris on LinkedIn

Upcoming events

Global events

Free training

Events in AMER

Events in EMEA

Events in APAC

Introducing JSONBench: The billion docs JSON Challenge vs MongoDB, Elasticsearch, and more

2_newsletter202502.png

The November newsletter mentioned the new JSON data type and explained its performance benefits. To test these claims, we developed JSONBench, a benchmark suite for JSON analytics.

Tom Schreiber has published a comprehensive blog post comparing how different databases handle JSON data. The analysis covers performance benchmarks and storage approaches across multiple systems, including ClickHouse, MongoDB, and Elasticsearch.

His findings detail how each database performs with analytical queries on JSON data and explore their underlying JSON storage mechanisms.

➡️ Read the blog post

Shahar Gvirtz: 7 Reasons why I like ClickHouse

It’s always fun to come across a blog post by a community member enjoying their time with ClickHouse!

I won’t go through all of Shahar’s reasons for liking ClickHouse, but I did want to highlight one of the things that he likes, which is an underrated feature of ClickHouse - its ability to compress data. In Shahar’s words:

Logs stored in ClickHouse take up only 28% of the space they occupy in Elasticsearch.

If you ever need to tell a friend or colleague why you like ClickHouse, you could do worse than point them to this blog post!

➡️ Read the blog post

Agent-Facing Analytics

3_newsletter202502.png

Ryadh Dahimene has written a (IMHO) brilliant blog post explaining a new user persona for real-time analytics databases - AI agents!

Ryadh first takes us on a brief tour of AI developments since the launch of ChatGPT in 2022, including the "sense-think-act" loop, the introduction of support for tools by LLMs, and the recent evolution of reasoning models like OpenAI o1 and DeepSeek-R1.

He then explores the role of real-time analytics databases in agentic workflows and introduces the ClickHouse MCP Server. This is our implementation of the server side of Anthropic’s Model Context Protocol, which means you can easily converse with a ClickHouse database from the Claude Desktop.

➡️ Read the blog post

ClickHouse and Cribl: A Powerful Data Ingestion and Analysis Duo

4_newsletter202502.png

Cribl Stream is a data processing platform that works with various data sources, including telemetry data, like logs, metrics, and trace data. It can preprocess, filter, and transform events before forwarding them to destinations, helping optimize storage utilization and query efficiency. Support for ClickHouse was recently added to its list of supported outputs.

David Maislin has written a detailed guide showing how to set up and use this integration. The guide includes step-by-step instructions for creating ClickHouse tables, configuring Cribl Stream destinations, and using Cribl Search to query the data. It also demonstrates how to use ClickHouse alongside Cribl's data processing features, complete with examples using Cribl's Datagen feature to generate test data.

➡️ Read the blog post

ClickHouse Cloud evolution: compute-compute separation, improved autoscaling, and more!

5_newsletter202502.png

ClickHouse Cloud was built in record time and brought to market in December 2022. Since then, over a thousand companies have onboarded their workloads into our managed service, and every day, they now collectively run 5.5 billion queries, scanning 3.5 quadrillion records on top of 100PB of data!

Over the past two years, we've gained valuable insights from working closely with our users and have significantly evolved our cloud architecture. This blog describes the latest improvements, including compute-compute separation, high-performance machine types (moving to Graviton in AWS), single-replica services, and more reactive and seamless automatic scaling.

➡️ Read the blog post

25.1 release

In the 25.1 release blog post, Tom Schreiber did a deep dive into the improvements made to the parallel hash join algorithm probe phase. If you’re interested in database internals, that’s worth a read.

This release also introduced MinMax indices at the table level, improved the Merge table engine and table function, added auto-increment functionality, and some nice CLI usability improvements.

➡️ Read the release post

Interesting projects

While compiling the newsletter each month, I come across many ClickHouse-based projects, so I thought I’d share some of them this month.

  • apitally.io - An API monitoring and analytics tool for Python / Node.js apps. It helps users understand API usage and performance, spot issues early, and troubleshoot effectively when something goes wrong. The founder mentioned that it uses ClickHouse to store data on a Hacker News thread.
  • Openpanel - An open-source alternative to Mixpanel for capturing user behavior across web, mobile apps, and backend services. It uses ClickHouse to store events.
  • Vigilant - A lightweight tool for managing structured logs. It lets you centralize your logs, search them, and create alerts. It uses ClickHouse under the hood.
  • CH-UI - A user interface for interacting with the ClickHouse Server. It has syntax highlighting for queries and lets you see visual metrics about your instance.

Video Corner

Post of the month

My favorite post this month was by Jacob Wolf, who’s ingesting lots of data into ClickHouse.

6_newsletter202502.png

➡️ Read the post

Share this post

Subscribe to our newsletter

Stay informed on feature releases, product roadmap, support, and cloud offerings!
Loading form...
Follow us
X imageSlack imageGitHub image
Telegram imageMeetup imageRss image