Resources
/
Engineering

ClickHouse engineering resources

Mastering Kubernetes observability: a guide to monitoring modern architectures
Learn why traditional Kubernetes monitoring fails at scale and how to master observability with a modern cost-effective database-first approach.
Aditya Somani • Last updated: Jan 21, 2026
How to engineer cost-efficient open source observability with ClickHouse (ClickStack) - 2026 technical playbook
A technical guide for engineers to build a cost-efficient observability stack. Learn how to use ZSTD codecs, Materialized Views, and Tiered Storage with ClickHouse to reduce observability data footprint by 10x.
Manveer Chawla • Last updated: Jan 7, 2026
What is observability in 2026? Why it's an analytics problem and why your database matters.
Observability in 2026 is an analytics problem. Learn why the traditional "three pillars" model fails and how a unified ClickHouse database solves cost and latency issues.
Aditya Somani • Last updated: Jan 7, 2026
8 top New Relic alternatives to break the pricing trap (2026 guide)
Escape New Relic's pricing trap and vendor lock-in. Our 2026 guide compares the top 8 observability alternatives—including Datadog, Grafana, and ClickHouse—for pricing, data ownership, and SQL support.
Aditya Somani • Last updated: Dec 15, 2025
The 8 best Datadog alternatives for observability at scale in 2026
Exploring Datadog alternatives? We compare the top competitors for cost-efficiency, performance at scale, and handling high-cardinality data. Find the right fit.
Aditya Somani • Last updated: Dec 8, 2025
Beyond sampling: managing petabyte-scale logs without losing data
Stop compromising on visibility. Learn how to manage petabyte-scale logs without sampling using ClickHouse’s cost-efficient, columnar architecture.
Aditya Somani • Last updated: Nov 27, 2025
A practical guide to observability TCO and cost reduction
Tired of exploding observability bills? Learn how to calculate your true TCO and see how to cut architecture costs by 70-90% vs. ingest-based pricing.
Manveer Chawla • Last updated: Nov 26, 2025
Top 5 cloud data warehouses in 2026: Architecture, cost, and open-source
Compare the top cloud data warehouses of 2025: Snowflake, BigQuery, Redshift, Databricks, and ClickHouse. Analyze cost efficiency, real-time performance, and open-source architecture.
Alasdair Brown • Last updated: Nov 22, 2025
The high-cardinality trap: why your observability platform is failing (and what to do about it)
Tired of slow queries & dropped fields? Learn why traditional observability tools fail with high-cardinality data & how a columnar database like ClickHouse solves it.
Aditya Somani • Last updated: Nov 21, 2025
7 things to consider when choosing a cloud data warehouse
This guide examines seven critical factors that actually matter when evaluating data warehouses for production use. We'll compare how ClickHouse, Snowflake, BigQuery, and Redshift handle each consideration, drawing from documented capabilities, architectural designs, and real-world customer experiences.
Last updated: Nov 18, 2025
Top 15 infrastructure monitoring tools in 2026: a performance and cost-based comparison
A complete guide to the best infrastructure monitoring tools. We compare 15 top solutions on features, performance, high-cardinality handling, and cost at scale.
Aditya Somani • Last updated: Nov 17, 2025
Best practices for storing OpenTelemetry Collector data
What are the best practices for storing your OTel Collector data?
Last updated: Nov 10, 2025
The definitive guide to ClickHouse query optimization (2026)
Master ClickHouse query optimization through architectural understanding. Learn why ORDER BY design can improve performance 100×, plus proven techniques for trillion-row millisecond queries.
Al Brown, Tom Schreiber, Lionel Palacin • Last updated: Jan 26, 2026
Top 10 OpenTelemetry Compatible Platforms for 2025
Explore the top 10 OpenTelemetry compatible platforms for 2025. Learn how to choose the right backend based on performance, cost at scale, and analytics.
Aditya Somani • Last updated: Nov 6, 2025
The ultimate guide to Open Source Observability in 2026: From silos to stacks
Explore the top open source observability stacks for 2025. Compare ELK, LGTM, and unified observability solutions like ClickStack for cost, scale, and high-cardinality data.
Manveer Chawla • Last updated: Nov 10, 2025
OLTP vs OLAP in 2026: Key differences, definitions & examples
Understand the critical differences between OLTP (Postgres, MySQL) and OLAP (ClickHouse, Snowflake) with simple definitions, examples, and use cases. Includes a full comparison table and guidance for a modern data stack.
Last updated: Dec 10, 2025
Columnar databases explained
In this guide, we’ll explore columnar databases. How do they differ from row-based databases? What are they good at? What are the advantages of using a column store?
Last updated: Sep 16, 2025
What is Real-Time Analytics? A Complete Guide (2026)
Discover what real-time analytics is, its key benefits, and its use cases, such as fraud detection. Learn how it differs from batch and what to look for in a database.
Last updated: Jan 9, 2026
MCP and Data Warehouses: everything you need to know
This article explores the suitability of MCP with Data Warehouses, and discusses the business and technical details you need to know to succeed.
Al Brown • Last updated: Nov 29, 2025
What Is a Time-Series Database? Examples, Use Cases & ClickHouse Guide
A time-series database (TSDB) stores and queries data indexed by time, including metrics, events, and logs. Learn real-world examples, common architectures, and when ClickHouse fits time-series workloads.
Last updated: Nov 18, 2025
Real-time data visualization
This guide is all about real-time data visualization. We'll explore how it differs from normal visualization, see some examples, and learn about the tools we can use.
Last updated: Apr 11, 2025
What is a JSON database?
In this guide, we'll learn about JSON, the types of databases that can store JSON, and how to work with JSON data in ClickHouse.
Last updated: Apr 11, 2025
What is a data application?
In this guide, we'll learn all about data applications - what are they, what are the main components, and why would you want to create one?
Last updated: Apr 11, 2025
Avro vs Parquet
In this guide, we'll learn all about the Apache Avro and Apache Parquet big data formats.
Last updated: Apr 11, 2025
Structured, unstructured, and semi-structured data
In this guide, we explore the three main forms of data: structured data with rigid schemas like database tables, unstructured data like text and images with no predefined format, and semi-structured data like JSON that combines elements of both while maintaining flexibility.
Last updated: Apr 11, 2025
Build a dashboard in Python with ClickHouse and Streamlit
In this guide, you'll learn how to build a Python dashboard using ClickHouse and Streamlit. We'll create a real-world example that visualizes Bluesky social media data, walking through everything from basic setup to interactive visualizations. Perfect for data scientists and analysts who want to share their insights through custom dashboards.
Last updated: Jun 2, 2025
Log monitoring
Discover the fundamentals of log monitoring systems, exploring different log types, monitoring techniques, and modern tools, with practical insights into how organizations leverage solutions like ClickHouse to manage massive log volumes efficiently and cost-effectively.
Last updated: Apr 11, 2025
An intro to OpenTelemetry (OTel)
In this guide, we’ll explore OpenTelemetry (OTel), a framework for collecting and standardizing telemetry data—metrics, logs, and traces—enhancing observability and performance monitoring in modern software systems.
Last updated: Apr 11, 2025
Telemetry data explained
In this guide, we'll explore telemetry data - the vital information that helps us understand, monitor, and improve our software systems through the collection of metrics, logs, and traces.
Last updated: Apr 11, 2025
Observability
In this guide, we'll explore observability - the practice of understanding a system's internal state through its outputs, and how modern approaches are helping organizations gain deeper insights into their systems' behavior.
Last updated: Jun 25, 2025
Security Information and Event Management (SIEM)
In this guide, we'll explore SIEM (Security Information and Event Management) - the central security system that collects, analyzes, and responds to security threats across your organization's entire infrastructure.
Last updated: Apr 11, 2025
Understanding LLM Observability
In this guide, we'll explore how teams monitor and debug their LLM applications, helping them understand everything from response accuracy and token usage to the complex reasoning chains of AI agents.
Last updated: Oct 28, 2025
Application Performance Monitoring (APM)
In this guide, we'll explore Application Performance Monitoring (APM) - the practice of tracking and analyzing application behavior in real-time to ensure optimal performance and user experience.
Last updated: Apr 11, 2025
Network monitoring
In this guide, we'll explore how organizations implement network monitoring to gain visibility, troubleshoot issues, and ensure optimal performance across distributed infrastructures.
Last updated: Apr 11, 2025
Structured logging
In this guide, we'll explore how structured logging transforms traditional text-based logs into queryable data, enabling organizations to build powerful monitoring, analysis, and automation capabilities at scale.
Last updated: Apr 11, 2025
Open table formats
In this guide, we'll explore the Iceberg, Delta Lake, and Hudi open table formats.
Last updated: May 9, 2025
Data catalog
In this guide, we'll explore data catalogs for open table formats like Iceberg, Delta Lake, and Hudi, explaining how these metadata systems make modern data lakes more powerful and accessible.
Last updated: May 9, 2025
Data lakehouse
The data lakehouse combines the best of data warehouses and data lakes into a unified architecture. We'll explore its key components, advantages, and how ClickHouse fits into this modern analytics platform.
Last updated: Oct 28, 2025
Apache Iceberg
Apache Iceberg transforms data lakes into robust lakehouse architectures with its high-performance table format. This article explores Iceberg's origins, key features like ACID transactions and schema evolution, and demonstrates how to query Iceberg tables in ClickHouse using both direct and catalog-based approaches.
Last updated: May 21, 2025
Top 5 Splunk Alternatives in 2025
What are the best alternatives to Splunk?
Last updated: Oct 29, 2025
Instrumenting OpenAI with OpenTelemetry (OTel)
In this guide, we’ll learn how to instrument the OpenAI client with OpenTelemetry (OTel) so that we can generate and collect observability data about our LLM calls.
Last updated: Aug 1, 2025
Setting up Apache Iceberg locally using PySpark
In this guide, we'll learn how to set up Apache Iceberg locally using PySpark.
Mark Needham • Last updated: Aug 5, 2025
Tracing LangChain apps with OpenLLMetry
In this guide, we'll learn about OpenLLMetry - the open-source observability framework for large language models.
Mark Needham • Last updated: Aug 29, 2025
Database compression
Learn how databases compress data, the trade-offs between row and column storage, and why ClickHouse delivers industry-leading compression ratios.
Last updated: Sep 17, 2025