Developing stream applications with user-defined parallelism
The @parallel
annotation allows developers to easily
take advantage of data-parallelism. In the streaming context, data-parallelism means
replicating copies of operators, and splitting streams so that different tuples go to each
set of replicas. The process of replicating operators, and creating all of the new streams
to connect them, is called the parallel transformation. When developers add
@parallel
to a primitive or composite operator invocation, Teracloud
Streams will perform the parallel transformation at submission time. The goal is to improve
overall application throughput by executing the replicas in parallel.