Learn about the different ways to ingest data into RisingWave, including continuous streaming, one-time batch loads, and periodic ingestion.
postgres-cdc
connector can be used to perform a one-time snapshot of a PostgreSQL table. For other databases, such as MySQL, you can use the corresponding CDC connector and set snapshot.mode
to initial_only
.
INSERT
statement.
datagen
connector to generate mock data streams.
Data Source | Continuous Streaming | One-Time Batch | Periodic | Notes |
---|---|---|---|---|
Apache Kafka | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
Redpanda | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
Apache Pulsar | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
AWS Kinesis | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
Google Pub/Sub | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
NATS JetStream | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
MQTT | ✅ | ❌ | ⚠️ | Streaming only; periodic via external tools |
PostgreSQL CDC | ✅ | ✅ | ⚠️ | CDC for streaming; direct connection for batch |
MySQL CDC | ✅ | ✅ | ⚠️ | CDC for streaming; direct connection for batch |
SQL Server CDC | ✅ | ✅ | ⚠️ | CDC for streaming; direct connection for batch |
MongoDB CDC | ✅ | ✅ | ⚠️ | CDC for streaming; direct connection for batch |
AWS S3 | ❌ | ✅ | ⚠️ | Batch only; periodic via external tools |
Google Cloud Storage | ❌ | ✅ | ⚠️ | Batch only; periodic via external tools |
Azure Blob | ❌ | ✅ | ⚠️ | Batch only; periodic via external tools |
Apache Iceberg | ❌ | ✅ | ⚠️ | Batch only; periodic via external tools |
Datagen | ✅ | ❌ | ❌ | Test data generation only |
Direct INSERT | ❌ | ✅ | ⚠️ | Manual insertion; periodic via external tools |