Catalog configuration for Apache Iceberg
Configuration parameters for Iceberg catalogs used by RisingWave sources, sinks, and connections.
These configuration parameters are used when creating Iceberg sources, sinks, or connections to specify the Iceberg catalog. The catalog is responsible for managing table metadata (schema, partitioning, location). RisingWave supports several catalog types. These parameters are used in the WITH
clause of CREATE SOURCE
, CREATE SINK
, and CREATE CONNECTION
statements.
Catalog parameters
Parameter | Description | Required (Conditional) |
---|---|---|
catalog.name | The name of the Iceberg catalog. This is a user-defined identifier. Optional for the storage catalog, but required for all other catalog types. | Conditional |
catalog.type | The type of Iceberg catalog to use. Supported values:
'storage' . | No (defaults to storage ) |
catalog.uri | The URI of the catalog. The required format depends on the catalog.type :
| Conditional |
catalog.credential | Credential for accessing the Iceberg catalog, used to exchange for a token in the OAuth2 client credentials flow. Applicable only in the rest catalog. | No |
catalog.token | A Bearer token for accessing the Iceberg catalog, used for interaction with the server. Applicable only in the rest catalog. | No |
catalog.oauth2_server_uri | The oauth2_server_uri for accessing the Iceberg catalog, serving as the token endpoint URI to fetch a token if the rest catalog is not the authorization server. Applicable only in the rest catalog. | No |
catalog.scope | Scope for accessing the Iceberg catalog, providing additional scope for OAuth2. Applicable only in the rest catalog. | No |
catalog.jdbc.user | Username for JDBC catalog. | No |
catalog.jdbc.password | Password for JDBC catalog. | No |
Catalog types
Iceberg supports these types of catalogs:
Storage catalog
The Storage catalog stores all metadata in the underlying file system, such as Hadoop or S3. Currently, we only support S3 as the underlying file system.
REST catalog
The REST catalog uses a RESTful interface to manage table metadata.
JDBC catalog
The JDBC catalog stores metadata in a relational database like PostgreSQL.
Glue catalog
AWS Glue Data Catalog provides a centralized metadata store for table definitions.
Snowflake catalog
The Snowflake catalog uses Snowflake’s built-in Iceberg catalog functionality.
Hive Metastore catalog
The Hive Metastore catalog stores metadata using a Hive Metastore backend.
Use with RisingWave Managed Iceberg
While these external catalogs are primarily used with the Bring Your Own Iceberg approach, they can also be used with RisingWave Managed Iceberg when you need to integrate with existing catalog infrastructure:
For the simplest managed setup, use the Hosted Iceberg Catalog instead.
Choosing the right catalog
For Bring Your Own Iceberg:
- AWS Glue: If you’re on AWS and have existing Glue Data Catalog infrastructure
- JDBC: If you have a PostgreSQL/MySQL database for metadata storage
- REST: For custom catalog services or AWS S3 Tables
- Storage: For simple setups without external catalog dependencies
- Hive: For integration with existing Hadoop ecosystem
- Snowflake: If using Snowflake as your data warehouse
For RisingWave Managed Iceberg:
- Hosted catalog: Recommended for new projects and simple setups
- External catalogs: When you need integration with existing enterprise catalog infrastructure
Best practices
- Choose based on your ecosystem: Use catalogs that integrate with your existing infrastructure.
- Consider security: Some catalogs offer better authentication and authorization features.
- Plan for scale: Some catalogs handle large numbers of tables better than others.
- Network connectivity: Ensure RisingWave can reach your catalog service.
- Backup considerations: Understand how your catalog metadata is backed up and recovered.
Next steps
- Object storage setup: Configure your storage backend with Object storage configuration.
- Read from tables: Set up sources with Read from Iceberg.
- Write to tables: Set up sinks with Write to Iceberg.
- Managed alternative: Consider RisingWave Managed Iceberg for simpler setups.