ClickHouse Engineering Resources


Best Open Source Observability Solutions (2025 Guide)

Explore the top open source observability stacks for 2025. Compare ELK, LGTM, and unified observability solutions like ClickStack for cost, scale, and high-cardinality data.
Manveer Chawla • November 3, 2025

OLTP vs OLAP

Explore the differences between OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) systems, examining how they serve unique purposes, the challenges of implementing OLAP, and guidelines for selecting the right system for various use cases. Real-world examples will illustrate when and why each system excels, providing practical insights for application.
The ClickHouse Team • April 11, 2025

Columnar databases explained

In this guide, we’ll explore columnar databases. How do they differ from row-based databases? What are they good at? What are the advantages of using a column store?
The ClickHouse Team • September 16, 2025

What is real-time analytics?

In this guide, we'll learn all about real-time analytics - how does it compare to batch analytics, what are its main characteristics, use cases, and more.
The ClickHouse Team • October 2, 2025

MCP and Data Warehouses: everything you need to know

This article explores the suitability of MCP with Data Warehouses, and discusses the business and technical details you need to know to succeed
The ClickHouse Team • September 29, 2025

Real-time data visualization

This guide is all about real-time data visualization. We'll explore how it differs from normal visualization, see some examples, and learn about the tools we can use.
The ClickHouse Team • April 11, 2025

An intro to time-series databases

In this guide, we’ll learn all about time-series data and its use cases, time-series databases, and how to query time-series data.
The ClickHouse Team • September 11, 2025

What is a JSON database?

In this guide, we'll learn about JSON, the types of databases that can store JSON, and how to work with JSON data in ClickHouse.
The ClickHouse Team • April 11, 2025

What is a data application?

In this guide, we'll learn all about data applications - what are they, what are the main components, and why would you want to create one?
The ClickHouse Team • April 11, 2025

Avro vs Parquet

In this guide, we'll learn all about the Apache Avro and Apache Parquet big data formats.
The ClickHouse Team • April 11, 2025

Structured, unstructured, and semi-structured data

In this guide, we explore the three main forms of data: structured data with rigid schemas like database tables, unstructured data like text and images with no predefined format, and semi-structured data like JSON that combines elements of both while maintaining flexibility.
The ClickHouse Team • April 11, 2025

Build a dashboard in Python with ClickHouse and Streamlit

In this guide, you'll learn how to build a Python dashboard using ClickHouse and Streamlit. We'll create a real-world example that visualizes Bluesky social media data, walking through everything from basic setup to interactive visualizations. Perfect for data scientists and analysts who want to share their insights through custom dashboards.
The ClickHouse Team • June 2, 2025

Log monitoring

Discover the fundamentals of log monitoring systems, exploring different log types, monitoring techniques, and modern tools, with practical insights into how organizations leverage solutions like ClickHouse to manage massive log volumes efficiently and cost-effectively.
The ClickHouse Team • April 11, 2025

An intro to OpenTelemetry (OTel)

In this guide, we’ll explore OpenTelemetry (OTel), a framework for collecting and standardizing telemetry data—metrics, logs, and traces—enhancing observability and performance monitoring in modern software systems.
The ClickHouse Team • April 11, 2025

Telemetry data explained

In this guide, we'll explore telemetry data - the vital information that helps us understand, monitor, and improve our software systems through the collection of metrics, logs, and traces.
The ClickHouse Team • April 11, 2025

Observability

In this guide, we'll explore observability - the practice of understanding a system's internal state through its outputs, and how modern approaches are helping organizations gain deeper insights into their systems' behavior.
The ClickHouse Team • June 25, 2025

Security Information and Event Management (SIEM)

In this guide, we'll explore SIEM (Security Information and Event Management) - the central security system that collects, analyzes, and responds to security threats across your organization's entire infrastructure.
The ClickHouse Team • April 11, 2025

Application Performance Monitoring (APM)

In this guide, we'll explore Application Performance Monitoring (APM) - the practice of tracking and analyzing application behavior in real-time to ensure optimal performance and user experience.
The ClickHouse Team • April 11, 2025

Understanding LLM Observability

In this guide, we'll explore how teams monitor and debug their LLM applications, helping them understand everything from response accuracy and token usage to the complex reasoning chains of AI agents.
The ClickHouse Team • October 28, 2025

Network monitoring

In this guide, we'll explore how organizations implement network monitoring to gain visibility, troubleshoot issues, and ensure optimal performance across distributed infrastructures.
The ClickHouse Team • April 11, 2025

Structured logging

In this guide, we'll explore how structured logging transforms traditional text-based logs into queryable data, enabling organizations to build powerful monitoring, analysis, and automation capabilities at scale.
The ClickHouse Team • April 11, 2025

Open table formats

In this guide, we'll explore the Iceberg, Delta Lake, and Hudi open table formats.
The ClickHouse Team • May 9, 2025

Data catalog

In this guide, we'll explore data catalogs for open table formats like Iceberg, Delta Lake, and Hudi, explaining how these metadata systems make modern data lakes more powerful and accessible.
The ClickHouse Team • May 9, 2025

Data lakehouse

The data lakehouse combines the best of data warehouses and data lakes into a unified architecture. We'll explore its key components, advantages, and how ClickHouse fits into this modern analytics platform.
The ClickHouse Team • October 28, 2025

Apache Iceberg

Apache Iceberg transforms data lakes into robust lakehouse architectures with its high-performance table format. This article explores Iceberg's origins, key features like ACID transactions and schema evolution, and demonstrates how to query Iceberg tables in ClickHouse using both direct and catalog-based approaches.
The ClickHouse Team • May 21, 2025

Instrumenting OpenAI with OpenTelemetry (OTel)

In this guide, we’ll learn how to instrument the OpenAI client with OpenTelemetry (OTel) so that we can generate and collect observability data about our LLM calls.
The ClickHouse Team • August 1, 2025

Top 5 Splunk Alternatives in 2025

What are the best alternatives to Splunk?
The ClickHouse Team • October 29, 2025

Setting up Apache Iceberg locally using PySpark

In this guide, we'll learn how to set up Apache Iceberg locally using PySpark.
Mark Needham • August 5, 2025

Tracing LangChain apps with OpenLLMetry

In this guide, we'll learn about OpenLLMetry - the open-source observability framework for large language models.
Mark Needham • August 29, 2025

Database compression

Learn how databases compress data, the trade-offs between row and column storage, and why ClickHouse delivers industry-leading compression ratios.
The ClickHouse Team • September 17, 2025