Home
Developing
Learn about different Teracloud Streams tools and features to write, compile, run, and test streams applications.
Developing stream applications
Try out tutorials and explore details about stream application development, SPL features, and best practices.
Performance recommendations
You can use these performance options and recommendations to improve your stream application.
Load Splitting
You can split the computation load to improve performance if more processing power is available.
Merging or joining the streams after splitting
After you split the computation load within or across processing elements, you must merge the data streams. If you split the load with user-defined parallelism, this step is done for you by Teracloud® Streams.

Welcome
Learn about the core capabilities of Teracloud® Streams, its architecture, and key concepts.
Installing
Use this information to install, upgrade, and uninstall the Teracloud® Streams product.
Configuring
Create a basic or an enterprise domain which is a single point for configuring and managing common resources, security, and instances.
Administering
Administer the product by using the Teracloud® Streams graphical user interface, APIs, or the streamtool command-line interface.
Developing
Learn about different Teracloud Streams tools and features to write, compile, run, and test streams applications.
- Development concepts
  Development of stream applications consists of several components such as operators, streams, tuples, Streams Processing Language, toolkits, and more.
- Developing stream applications
  Try out tutorials and explore details about stream application development, SPL features, and best practices.
  - Tutorials
    Learn how to create simple stream applications by completing the tutorials.
  - Compiling stream applications
    The Streams Processing Language (SPL) Compiler is called sc and is included with the Teracloud® Streams installation package. You compile SPL to create files to run on your Teracloud Streams instance.
  - Application bundle files
    An application bundle file is a single, relocatable file that contains all the artifacts that are needed to run your application. You submit a application bundle file to run on an instance.
  - SPL features
    Teracloud® Streams provides features and functions to help you write your applications to fulfill your business needs.
  - Best practices
    Use these tips to write operators that perform effectively in their application, and in other applications.
  - Performance recommendations
    You can use these performance options and recommendations to improve your stream application.
    - Attribute on streams
      A common operation on a stream is to add an attribute.
    - Operations on Collective Types
      Temporary objects are sometimes introduced when you use mapped operators. If those temporary objects affect performance, you can write the code in a way that does not introduce temporary objects.
    - Loop Invariants
      When you initialize a tuple in a loop, some attributes might be invariant.
    - Runtime invariants
      Some computations are runtime invariant. They can be computed on startup and remain constant throughout the execution of the program.
    - Operator Merging
      Going from one operator to the next is not free, even if the operators are in the same PE. You can merge two or more operators to improve performance.
    - Load Splitting
      You can split the computation load to improve performance if more processing power is available.
      - Intra-PE load splitting
        You can split the computation load within a processing element (PE) by using user-defined parallelism or multiple threads within a PE.
      - Inter-host load splitting
        Inter-host load splitting divides the computation load across hosts. You can use methods similar to intra-processing element (PE) splitting with the Split operator except that there is no need for threaded ports on the downstream operators. You can alternatively use the parallel annotation and user-defined parallelism.
      - Merging or joining the streams after splitting
        After you split the computation load within or across processing elements, you must merge the data streams. If you split the load with user-defined parallelism, this step is done for you by Teracloud® Streams.
- Developing native functions
  Extend SPL's computational capabilities by creating native functions written in C++ or Java.
- Developing custom operators
  Create custom operators if shipped toolkits do not provide the necessary logic or behavior needed for your stream applications.
- Developing custom toolkits
  Bundle and reuse custom functions and operators across several stream applications by creating custom toolkits.
- Enabling Streams data exchange
  Teracloud® Streams provides a data exchange REST API for inserting and retrieving tuples within a job to easily integrate with other data services and external applications. Stream applications can enable the data exchange feature by using one or more Endpoint operators.
- Debugging stream applications
  Debug stream applications using the interactive, command line-based Streams Debugger (sdb).
Troubleshooting
Resolve problems with Teracloud® Streams using the troubleshooting tools provided with the product as well as the resources offered by Teracloud Support.
Reference
Find details on the SPL language, toolkits, APIs, commands, and more.
Glossary
Use this glossary to find terms and definitions for Teracloud® Streams.

Merging or joining the streams after splitting

After you split the computation load within or across processing elements, you must merge the data streams. If you split the load with user-defined parallelism, this step is done for you by Teracloud® Streams.

The most effective way to merge streams after splitting is by feeding them into the downstream operator. However, there is a case for which you cannot do this. Consider a slightly modified version of the SplitParallelizer composite that has an output stream (instead of Sink operators) :

namespace sample;
composite SplitParallelizer(output out; input in) {
  param    
    operator $source; 
    operator $body;   
    type     $typeIn;
    type     $typeOut;
  graph

    stream<$typeIn> Src = $source(in) {} 

    (stream<$typeIn> Splitted0   
     <%for(my $i=1; $i<4; ++$i) {%>
       ,stream<$typeIn> Splitted<%=$i%> 
    <%}%>) = ThreadedSplit(Src) {} 
 
    <%for(my $i=0; $i<4; ++$i) {%>   
      stream<$typeOut> Processed<%=$i%>
      = $body(Splitted<%=$i%>)   
    <%}%>

    stream<$typeOut> Processed
     = Filter(Processed0
    <%for(my $i=1; $i<4; ++$i) {%>   
      , Processed<%=$i%>  
    <%}%>) {}   
}

The preferred operator to merge or join the split streams is a Filter operator. Unlike the Functor and Union operators, the Filter operator does not copy the input tuples; it forwards them to the output stream.