What is RisingWave? - RisingWave

Unified platform for streaming data

RisingWave delivers a full end-to-end streaming data platform — combining real-time processing with built-in storage and open-format persistence.

It supports:

Ingestion: Ingest millions of events per second from streaming and batch sources.
Stream processing: Perform real-time incremental processing to join and analyze live data with historical tables.
Delivery: Deliver fresh, consistent results to data lakes (e.g., Apache Iceberg) or any destination.

What sets RisingWave apart is its integrated storage engine:

Online serving: Row-based storage for ad-hoc point/range queries with single-digit millisecond latency.
Offline persistence: Built-in Apache Iceberg™ integration for low-cost, durable storage with open access for external query engines.

Key design decisions

RisingWave is built for ease of use and cost efficiency.

PostgreSQL compatibility

RisingWave is wire-compatible with PostgreSQL, enabling:

Expressive SQL: Supports structured, semi-structured, and unstructured data with a familiar SQL dialect.
Seamless integration: Works with psql, JDBC, and any PostgreSQL-compatible tool via the PostgreSQL wire protocol.
No manual state tuning: Eliminates the need for complex state management configurations.

S3 as primary storage

RisingWave stores tables, materialized views, and internal states of stream processing jobs in S3 (or equivalent object storage), providing:

High performance: Optimized for complex queries, including joins and time windowing.
Fast recovery: Restores from system failures within seconds.
Dynamic scaling: Instantly adjusts resources to handle workload spikes.

Elastic disk cache

Beyond caching hot data in memory, RisingWave supports elastic disk cache, a powerful performance optimization that uses local disks or EBS for efficient data caching. This minimizes access to S3, lowering processing latency and cutting S3 access costs.

Apache Iceberg™ native support

RisingWave natively integrates with Apache Iceberg™, enabling continuous ingestion of streaming data into Iceberg tables. It can also read directly from Iceberg, perform automatic compaction, and maintain table health over time. Since Iceberg is an open table format, results are accessible by other query engines — making storage not only cost-efficient, but interoperable by design.

In what use cases does RisingWave excel?

RisingWave is particularly effective for the following use cases:

Streaming analytics: Achieve sub-second data freshness in live dashboards, ideal for high-stakes scenarios like stock trading, sports betting, and IoT monitoring.
Event-driven applications: Develop sophisticated monitoring and alerting systems for critical applications such as fraud and anomaly detection.
Real-time data enrichment: Continuously ingest data from diverse sources, conduct real-time data enrichment, and efficiently deliver the results to downstream systems.
Feature engineering: Transform batch and streaming data into features in your machine learning models using a unified codebase, ensuring seamless integration and consistency.

Comparing RisingWave with other systems

RisingWave is not simply an “alternative” to any existing product, but it is often compared with stream processors, analytical databases, and operational databases.

Stream processors

Stream processors like ksqlDB, Spark Structured Streaming, and Flink SQL are frequently compared to RisingWave. While these systems have their strengths, RisingWave offers an exceptionally simple, PostgreSQL-style user experience, and eliminates the need for manual state management. It excels in:

Handling complex queries like joins, aggregations, and time windows with high performance.
Transparent dynamic scaling, allowing for scaling in and out within seconds rather than minutes or hours.
Instant failure recovery, where RisingWave recovers in seconds rather than minutes or hours.

Additionally, RisingWave greatly simplifies overall architecture, see How does RisingWave simplify your event-driven architecture?. However, compared to these stream processors, RisingWave does not offer low-level Java and Scala APIs, but compensates by offering various language UDFs and SDKs.

Analytical databases

Modern analytical databases, such as ClickHouse with materialized views, Snowflake with dynamic tables, BigQuery with continuous queries, and Databricks with Delta Live Tables, offer continuous processing capabilities. RisingWave surpasses these solutions in continuous processing by:

Offering a rich feature set for stream processing, including time windowing, watermarks, and more.
Being particularly optimized for handling complex streaming joins.
Allowing data ingestion from and delivery to any system, without locking you into a specific ecosystem.

Moreover, RisingWave’s transparent dynamic scaling and instant failure recovery mechanisms are superior to other analytical databases.

However, RisingWave does not feature columnar storage. If your workloads mostly involve ad-hoc, long-range scans rather than predefined queries, an analytical database might be a better fit.

Operational databases

RisingWave is PostgreSQL wire-compatible, enabling seamless integration with most tools in the PostgreSQL ecosystem. RisingWave is designed specifically for storing and processing streaming data, making it particularly well-suited for managing metrics and events rather than transactional data.

Note that RisingWave does not use the PostgreSQL engine internally, which results in certain PostgreSQL tools not being supported. Additionally, RisingWave does not support read-write transactions.

How does RisingWave simplify your event-driven architecture?

RisingWave aims to help simplify event-driven architecture. You can think of RisingWave as a unified system that combines event streaming, stream processing, storage, and serving capabilities. Developers can express intricate stream-processing logic through cascaded materialized views. Additionally, it allows users to persist data directly within the system, eliminating the need to deliver results to external databases for storage and query serving.

See the architecture

Access the source code

On this page

Unified platform for streaming data
Key design decisions
PostgreSQL compatibility
S3 as primary storage
Elastic disk cache
Apache Iceberg™ native support
In what use cases does RisingWave excel?
Comparing RisingWave with other systems
Stream processors
Analytical databases
Operational databases
How does RisingWave simplify your event-driven architecture?

Unified platform for streaming data

RisingWave delivers a full end-to-end streaming data platform — combining real-time processing with built-in storage and open-format persistence.

It supports:

Ingestion: Ingest millions of events per second from streaming and batch sources.
Stream processing: Perform real-time incremental processing to join and analyze live data with historical tables.
Delivery: Deliver fresh, consistent results to data lakes (e.g., Apache Iceberg) or any destination.

What sets RisingWave apart is its integrated storage engine:

Online serving: Row-based storage for ad-hoc point/range queries with single-digit millisecond latency.
Offline persistence: Built-in Apache Iceberg™ integration for low-cost, durable storage with open access for external query engines.

Key design decisions

RisingWave is built for ease of use and cost efficiency.

PostgreSQL compatibility

RisingWave is wire-compatible with PostgreSQL, enabling:

Expressive SQL: Supports structured, semi-structured, and unstructured data with a familiar SQL dialect.
Seamless integration: Works with psql, JDBC, and any PostgreSQL-compatible tool via the PostgreSQL wire protocol.
No manual state tuning: Eliminates the need for complex state management configurations.

S3 as primary storage

RisingWave stores tables, materialized views, and internal states of stream processing jobs in S3 (or equivalent object storage), providing:

High performance: Optimized for complex queries, including joins and time windowing.
Fast recovery: Restores from system failures within seconds.
Dynamic scaling: Instantly adjusts resources to handle workload spikes.

Elastic disk cache

Apache Iceberg™ native support

In what use cases does RisingWave excel?

RisingWave is particularly effective for the following use cases:

Streaming analytics: Achieve sub-second data freshness in live dashboards, ideal for high-stakes scenarios like stock trading, sports betting, and IoT monitoring.
Event-driven applications: Develop sophisticated monitoring and alerting systems for critical applications such as fraud and anomaly detection.
Real-time data enrichment: Continuously ingest data from diverse sources, conduct real-time data enrichment, and efficiently deliver the results to downstream systems.
Feature engineering: Transform batch and streaming data into features in your machine learning models using a unified codebase, ensuring seamless integration and consistency.

Comparing RisingWave with other systems

RisingWave is not simply an “alternative” to any existing product, but it is often compared with stream processors, analytical databases, and operational databases.

Stream processors

Handling complex queries like joins, aggregations, and time windows with high performance.
Transparent dynamic scaling, allowing for scaling in and out within seconds rather than minutes or hours.
Instant failure recovery, where RisingWave recovers in seconds rather than minutes or hours.

Analytical databases

Offering a rich feature set for stream processing, including time windowing, watermarks, and more.
Being particularly optimized for handling complex streaming joins.
Allowing data ingestion from and delivery to any system, without locking you into a specific ecosystem.

Moreover, RisingWave’s transparent dynamic scaling and instant failure recovery mechanisms are superior to other analytical databases.

However, RisingWave does not feature columnar storage. If your workloads mostly involve ad-hoc, long-range scans rather than predefined queries, an analytical database might be a better fit.

Operational databases

Note that RisingWave does not use the PostgreSQL engine internally, which results in certain PostgreSQL tools not being supported. Additionally, RisingWave does not support read-write transactions.

How does RisingWave simplify your event-driven architecture?

See the architecture

Access the source code

On this page

Unified platform for streaming data
Key design decisions
PostgreSQL compatibility
S3 as primary storage
Elastic disk cache
Apache Iceberg™ native support
In what use cases does RisingWave excel?
Comparing RisingWave with other systems
Stream processors
Analytical databases
Operational databases
How does RisingWave simplify your event-driven architecture?

​Unified platform for streaming data

​Key design decisions

​PostgreSQL compatibility

​S3 as primary storage

​Elastic disk cache

​Apache Iceberg™ native support

​In what use cases does RisingWave excel?

​Comparing RisingWave with other systems

​Stream processors

​Analytical databases

​Operational databases

​How does RisingWave simplify your event-driven architecture?

See the architecture

Access the source code

Changelog

​Unified platform for streaming data

​Key design decisions

​PostgreSQL compatibility

​S3 as primary storage

​Elastic disk cache

​Apache Iceberg™ native support

​In what use cases does RisingWave excel?

​Comparing RisingWave with other systems

​Stream processors

​Analytical databases

​Operational databases

​How does RisingWave simplify your event-driven architecture?

See the architecture

Access the source code

Unified platform for streaming data

Key design decisions

PostgreSQL compatibility

S3 as primary storage

Elastic disk cache

Apache Iceberg™ native support

In what use cases does RisingWave excel?

Comparing RisingWave with other systems

Stream processors

Analytical databases

Operational databases

How does RisingWave simplify your event-driven architecture?

Unified platform for streaming data

Key design decisions

PostgreSQL compatibility

S3 as primary storage

Elastic disk cache

Apache Iceberg™ native support

In what use cases does RisingWave excel?

Comparing RisingWave with other systems

Stream processors

Analytical databases

Operational databases

How does RisingWave simplify your event-driven architecture?