Skip to main content
Skip to main content

AWS Glue Catalog

Experimental feature. Learn more.

ClickHouse supports integration with multiple catalogs (Unity, Glue, Polaris, etc.). In this guide, we will walk you through the steps to query your data in S3 buckets using ClickHouse and the Glue Data Catalog.

note

Glue supports many different table formats, but this integration only supports Iceberg tables.

Configuring Glue in AWS

To connect to the glue catalog, you will need to identify the region of your catalog and provide an access and secret key.

note

Currently, the Glue catalog only supports access and secret keys, but we will support additional authentication approaches in the future.

Creating a connection between Glue data catalog and ClickHouse

With your Unity Catalog configured and authentication in place, establish a connection between ClickHouse and Unity Catalog.

Query the Glue data catalog using ClickHouse

Now that the connection is in place, you can start querying Glue:

You can see above that some tables above are not Iceberg tables, for instance iceberg-benchmark.hitsparquet. You won’t be able to query these as only Iceberg is currently supported.

To query a table:

note

Backticks are required because ClickHouse doesn’t support more than one namespace.

To inspect the table DDL, run the following query:

Loading data from your Data Lake into ClickHouse

If you need to load data from Databricks into ClickHouse, start by creating a local ClickHouse table:

Then load the data from your Iceberg table: