Download full logo Download logomark

Videos

Intro to Full-Text Search in ClickHouse

Previous videoClickhouse Meetups Toronto Five One Four Next videoGetting Started with Managed ClickStack

Mark Needham

ClickHouse 26.2 brings the text index to production status, and in this video we take a proper look at how it works. It's implemented as an inverted index - similar to what you'd find in Lucene - so it stores a mapping from tokens to row numbers, letting the query engine skip most of the data when searching text columns.

We run through the whole thing using a 20 million row subset of the GitHub events dataset:

How the text index works under the hood (inverted index, tokenizers, preprocessors)
Creating tables with and without the index and comparing insert performance and disk usage
Writing queries using hasToken, hasAllTokens, and hasAnyTokens
Using EXPLAIN to see granule skipping in action — from 2504 granules down to 28
A real-world example: GitTrends, built on the full 10 billion row dataset

Recent videos

View all Videos

YouTube Video: GwCRcRa8f3A

Open House

Open House 2026: Day 1 Keynote

The latest ClickHouse announcements, featuring real-world use cases from Shopify, Zoox, Visa, and Cisco.

YouTube Video: ZtvlCz7Ukg4

Open House

Fireside Chat: The state of data and AI with Bret Taylor (Sierra) and Aaron Katz (ClickHouse)

Aaron Katz (CEO, ClickHouse) and Bret Taylor (Co-Founder Sierra, Chairman of the Board, OpenAI) have an open conversation on the state of AI.

YouTube Video: FmS7VopaqNg

Open House, ClickHouse

How to build a great database (Alexey Milovidov)

The principles behind building a great database, and the new frontiers shaping the field.

YouTube Video: _8DUJK8xgq8

Open House

Fireside Chat: Ecosystem and technology trends (Vercel, dbt Labs, CoreWeave)

Aaron Katz (CEO, ClickHouse), Guillermo Rauch (CEO, Vercel), Tristan Handy (CEO, dbt Labs), and Lukas Biewald (SVP of AI, CoreWeave) discuss how AI is changing the data landscape.

View all Videos

Follow us