- Ingestion can be implemented as ETL or ELT pipelines that continuously write data into Iceberg tables.
- Transformation is powered by RisingWave’s incremental materialized views, enabling efficient, cascading updates across multiple layers of derived data.

- Use internal Iceberg tables as the Bronze layer for raw data storage.
- Use materialized views as the Silver layer to filter and transform data.
- Use cascading materialized views (built on top of the Silver layer MVs) as the Gold layer to aggregate and enrich data for analytics.
Hosted catalog services
RisingWave provides two hosted catalog options for managing Iceberg metadata, schema versions, and table state:- JDBC hosted catalog — backed by RisingWave’s internal PostgreSQL-compatible metastore. See JDBC hosted catalog.
- REST hosted catalog — powered by Lakekeeper and compatible with the Iceberg REST catalog API. See REST hosted catalog.
Compaction service
RisingWave provides a managed compaction service that helps maintain table health by performing compaction and snapshot expiration.- Compaction: Merges small data files into larger, optimized files to improve read performance.
- Snapshot Expiration: Removes old, unneeded snapshots and their associated data files to reclaim storage space.
VACUUM
command. Using RisingWave’s service is optional, and you can also connect an external compactor from providers like Databricks, Tabular, or AWS EMR, or use a self-hosted Spark job.
For complete details on configuration, see the Iceberg table maintenance.
Catalog and compaction summary
Component | RisingWave Native Options | Alternative Options | Description |
---|---|---|---|
Catalog service | RisingWave hosted JDBC and REST catalogs | Glue, Hive, Nessie, or custom REST catalogs | Stores metadata and schema information |
Compaction service | RisingWave’s built-in compaction service | External services (Databricks, Tabular, EMR) or self-hosted Spark | Merges small files and expires old snapshots |