Querying Google BigLake with ClickHouse

Mark Needham

BigLake is Google Cloud's managed data lake service, similar in spirit to Unity Catalog or AWS Glue. This short video walks through connecting ClickHouse to BigLake, browsing Google's public warehouse, and running queries against a 1.3-billion-row dataset.

  • Step-by-step credentials setup using a permissions script (BigLake API, GCS roles, quota project)
  • ClickHouse database config pointing at the BigLake Iceberg REST catalog
  • Explore NYC TaxiCab data: schema, row counts, and Iceberg snapshot history
  • Trade-off in action: remote queries are slow over the network, but a local MergeTree copy brings it down to seconds

Recent videos

YouTube Video: GwCRcRa8f3A

Open House

Open House 2026: Day 1 Keynote

The latest ClickHouse announcements, featuring real-world use cases from Shopify, Zoox, Visa, and Cisco.

YouTube Video: ZtvlCz7Ukg4

Open House

Fireside Chat: The state of data and AI with Bret Taylor (Sierra) and Aaron Katz (ClickHouse)

Aaron Katz (CEO, ClickHouse) and Bret Taylor (Co-Founder Sierra, Chairman of the Board, OpenAI) have an open conversation on the state of AI.

YouTube Video: FmS7VopaqNg

Open House, ClickHouse

How to build a great database (Alexey Milovidov)

The principles behind building a great database, and the new frontiers shaping the field.

Follow us

XBlueskySlackGithubTelegramMeetupRSS