ClickHouse engineering resources


  • The definitive guide to ClickHouse query optimization

    Master ClickHouse query optimization through architectural understanding. Learn why ORDER BY design can improve performance 100×, plus proven techniques for trillion-row millisecond queries.

    Alasdair Brown • Last updated: Nov 7, 2025

  • Top 10 OpenTelemetry Compatible Platforms for 2025

    Explore the top 10 OpenTelemetry compatible platforms for 2025. Learn how to choose the right backend based on performance, cost at scale, and analytics.

    Aditya Somani • Last updated: Nov 6, 2025

  • Best Open Source Observability Solutions (2025 Guide)

    Explore the top open source observability stacks for 2025. Compare ELK, LGTM, and unified observability solutions like ClickStack for cost, scale, and high-cardinality data.

    Manveer Chawla • Last updated: Nov 3, 2025

  • OLTP vs OLAP

    Explore the differences between OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) systems, examining how they serve unique purposes, the challenges of implementing OLAP, and guidelines for selecting the right system for various use cases. Real-world examples will illustrate when and why each system excels, providing practical insights for application.

    Last updated: Apr 11, 2025

  • Columnar databases explained

    In this guide, we’ll explore columnar databases. How do they differ from row-based databases? What are they good at? What are the advantages of using a column store?

    Last updated: Sep 16, 2025

  • What is real-time analytics?

    In this guide, we'll learn all about real-time analytics - how does it compare to batch analytics, what are its main characteristics, use cases, and more.

    Last updated: Oct 2, 2025

  • MCP and Data Warehouses: everything you need to know

    This article explores the suitability of MCP with Data Warehouses, and discusses the business and technical details you need to know to succeed

    Last updated: Sep 29, 2025

  • An intro to time-series databases

    In this guide, we’ll learn all about time-series data and its use cases, time-series databases, and how to query time-series data.

    Last updated: Sep 11, 2025

  • Real-time data visualization

    This guide is all about real-time data visualization. We'll explore how it differs from normal visualization, see some examples, and learn about the tools we can use.

    Last updated: Apr 11, 2025

  • What is a JSON database?

    In this guide, we'll learn about JSON, the types of databases that can store JSON, and how to work with JSON data in ClickHouse.

    Last updated: Apr 11, 2025

  • What is a data application?

    In this guide, we'll learn all about data applications - what are they, what are the main components, and why would you want to create one?

    Last updated: Apr 11, 2025

  • Avro vs Parquet

    In this guide, we'll learn all about the Apache Avro and Apache Parquet big data formats.

    Last updated: Apr 11, 2025

  • Structured, unstructured, and semi-structured data

    In this guide, we explore the three main forms of data: structured data with rigid schemas like database tables, unstructured data like text and images with no predefined format, and semi-structured data like JSON that combines elements of both while maintaining flexibility.

    Last updated: Apr 11, 2025

  • Build a dashboard in Python with ClickHouse and Streamlit

    In this guide, you'll learn how to build a Python dashboard using ClickHouse and Streamlit. We'll create a real-world example that visualizes Bluesky social media data, walking through everything from basic setup to interactive visualizations. Perfect for data scientists and analysts who want to share their insights through custom dashboards.

    Last updated: Jun 2, 2025

  • Log monitoring

    Discover the fundamentals of log monitoring systems, exploring different log types, monitoring techniques, and modern tools, with practical insights into how organizations leverage solutions like ClickHouse to manage massive log volumes efficiently and cost-effectively.

    Last updated: Apr 11, 2025

  • An intro to OpenTelemetry (OTel)

    In this guide, we’ll explore OpenTelemetry (OTel), a framework for collecting and standardizing telemetry data—metrics, logs, and traces—enhancing observability and performance monitoring in modern software systems.

    Last updated: Apr 11, 2025

  • Telemetry data explained

    In this guide, we'll explore telemetry data - the vital information that helps us understand, monitor, and improve our software systems through the collection of metrics, logs, and traces.

    Last updated: Apr 11, 2025

  • Observability

    In this guide, we'll explore observability - the practice of understanding a system's internal state through its outputs, and how modern approaches are helping organizations gain deeper insights into their systems' behavior.

    Last updated: Jun 25, 2025

  • Security Information and Event Management (SIEM)

    In this guide, we'll explore SIEM (Security Information and Event Management) - the central security system that collects, analyzes, and responds to security threats across your organization's entire infrastructure.

    Last updated: Apr 11, 2025

  • Understanding LLM Observability

    In this guide, we'll explore how teams monitor and debug their LLM applications, helping them understand everything from response accuracy and token usage to the complex reasoning chains of AI agents.

    Last updated: Oct 28, 2025

  • Application Performance Monitoring (APM)

    In this guide, we'll explore Application Performance Monitoring (APM) - the practice of tracking and analyzing application behavior in real-time to ensure optimal performance and user experience.

    Last updated: Apr 11, 2025

  • Network monitoring

    In this guide, we'll explore how organizations implement network monitoring to gain visibility, troubleshoot issues, and ensure optimal performance across distributed infrastructures.

    Last updated: Apr 11, 2025

  • Structured logging

    In this guide, we'll explore how structured logging transforms traditional text-based logs into queryable data, enabling organizations to build powerful monitoring, analysis, and automation capabilities at scale.

    Last updated: Apr 11, 2025

  • Open table formats

    In this guide, we'll explore the Iceberg, Delta Lake, and Hudi open table formats.

    Last updated: May 9, 2025

  • Data catalog

    In this guide, we'll explore data catalogs for open table formats like Iceberg, Delta Lake, and Hudi, explaining how these metadata systems make modern data lakes more powerful and accessible.

    Last updated: May 9, 2025

  • Data lakehouse

    The data lakehouse combines the best of data warehouses and data lakes into a unified architecture. We'll explore its key components, advantages, and how ClickHouse fits into this modern analytics platform.

    Last updated: Oct 28, 2025

  • Apache Iceberg

    Apache Iceberg transforms data lakes into robust lakehouse architectures with its high-performance table format. This article explores Iceberg's origins, key features like ACID transactions and schema evolution, and demonstrates how to query Iceberg tables in ClickHouse using both direct and catalog-based approaches.

    Last updated: May 21, 2025

  • Top 5 Splunk Alternatives in 2025

    What are the best alternatives to Splunk?

    Last updated: Oct 29, 2025

  • Instrumenting OpenAI with OpenTelemetry (OTel)

    In this guide, we’ll learn how to instrument the OpenAI client with OpenTelemetry (OTel) so that we can generate and collect observability data about our LLM calls.

    Last updated: Aug 1, 2025

  • Setting up Apache Iceberg locally using PySpark

    In this guide, we'll learn how to set up Apache Iceberg locally using PySpark.

    Mark Needham • Last updated: Aug 5, 2025

  • Tracing LangChain apps with OpenLLMetry

    In this guide, we'll learn about OpenLLMetry - the open-source observability framework for large language models.

    Mark Needham • Last updated: Aug 29, 2025

  • Database compression

    Learn how databases compress data, the trade-offs between row and column storage, and why ClickHouse delivers industry-leading compression ratios.

    Last updated: Sep 17, 2025