Dedicated compute node
By default, a compute node can process both streaming jobs and ad-hoc queries (i.e. serving). Alternatively, you can process streaming and serving by different compute nodes respectively, in order to mitigate resource contention, or to isolate streaming or serving failure. This guides introduces how to set up such dedicated compute nodes to decouple streaming and serving.
To decouple streaming and serving, you need at least one compute node with the streaming
role and one compute node with the serving
role.
When launching a compute node, its role can be specified via either the --role
command line argument, or RW_COMPUTE_NODE_ROLE
environment variable.
You need to restart the node to update the role. A role can be one of:
both
: The default role if not specified. Indicates that the compute node is available for both streaming and serving.serving
: Indicates that the compute node is only available for serving.streaming
: Indicates that the compute node is only available for streaming.