Home
Developing
Learn about different Teracloud Streams tools and features to write, compile, run, and test streams applications.
Developing stream applications
Try out tutorials and explore details about stream application development, SPL features, and best practices.
Performance recommendations
You can use these performance options and recommendations to improve your stream application.
Load Splitting
You can split the computation load to improve performance if more processing power is available.
Intra-PE load splitting
You can split the computation load within a processing element (PE) by using user-defined parallelism or multiple threads within a PE.

Welcome
Learn about the core capabilities of Teracloud® Streams, its architecture, and key concepts.
Installing
Use this information to install, upgrade, and uninstall the Teracloud® Streams product.
Configuring
Create a basic or an enterprise domain which is a single point for configuring and managing common resources, security, and instances.
Administering
Administer the product by using the Teracloud® Streams graphical user interface, APIs, or the streamtool command-line interface.
Developing
Learn about different Teracloud Streams tools and features to write, compile, run, and test streams applications.
- Development concepts
  Development of stream applications consists of several components such as operators, streams, tuples, Streams Processing Language, toolkits, and more.
- Developing stream applications
  Try out tutorials and explore details about stream application development, SPL features, and best practices.
  - Tutorials
    Learn how to create simple stream applications by completing the tutorials.
  - Application directory files and structure
    Learn about the different files and common directories that make up an SPL application. View the recommended structure so the SPL compiler and other Teracloud® Streams development tools work best.
  - Compiling stream applications
    Stream applications must be compiled before they can be run. Teracloud® Streams offers the SPL compiler (sc program) to compile SPL applications into a Streams application bundle file (SAB) that contains artifacts needed to run the application.
  - Application bundle files
    A Streams application bundle file (SAB) is a single, relocatable file that contains artifacts needed to run your application. The file can be invoked directly to run the application on a single host or submitted to a Streams instance.
  - SPL features
    SPL provides features and functions to help you write your applications to fulfill your business needs.
  - Best practices
    Use these tips to write operators that perform effectively in their application, and in other applications.
  - Performance recommendations
    You can use these performance options and recommendations to improve your stream application.
    - Attribute on streams
      A common operation on a stream is to add an attribute.
    - Operations on Collective Types
      Temporary objects are sometimes introduced when you use mapped operators. If those temporary objects affect performance, you can write the code in a way that does not introduce temporary objects.
    - Loop Invariants
      When you initialize a tuple in a loop, some attributes might be invariant.
    - Runtime invariants
      Some computations are runtime invariant. They can be computed on startup and remain constant throughout the execution of the program.
    - Operator Merging
      Going from one operator to the next is not free, even if the operators are in the same PE. You can merge two or more operators to improve performance.
    - Load Splitting
      You can split the computation load to improve performance if more processing power is available.
      - Intra-PE load splitting
        You can split the computation load within a processing element (PE) by using user-defined parallelism or multiple threads within a PE.
      - Inter-host load splitting
        Inter-host load splitting divides the computation load across hosts. You can use methods similar to intra-processing element (PE) splitting with the Split operator except that there is no need for threaded ports on the downstream operators. You can alternatively use the parallel annotation and user-defined parallelism.
      - Merging or joining the streams after splitting
        After you split the computation load within or across processing elements, you must merge the data streams. If you split the load with user-defined parallelism, this step is done for you by Teracloud® Streams.
- Developing native functions
  Extend SPL's computational capabilities by creating native functions written in C++ or Java.
- Developing custom operators
  Create custom operators if shipped toolkits do not provide the necessary logic or behavior needed for your stream applications.
- Developing custom toolkits
  Bundle and reuse custom functions and operators across several stream applications by creating custom toolkits.
- Working with files
  Applications and operators can read and write to files in several ways. If relative paths are used, the data directory is used as the root of relative paths. To enable logic to access the runtime application or toolkit directory hierarchies, language-specific functions can be used.
- Enabling Streams data exchange
  Teracloud® Streams provides a data exchange REST API for inserting and retrieving tuples within a job to easily integrate with other data services and external applications. Stream applications can enable the data exchange feature by using one or more Endpoint operators.
- Debugging stream applications
  Debug stream applications using the interactive, command line-based Streams Debugger (sdb).
Troubleshooting
Resolve problems with Teracloud® Streams using the troubleshooting tools provided with the product as well as the resources offered by Teracloud Support.
Reference
Find details on the SPL language, toolkits, APIs, commands, and more.
Glossary
Use this glossary to find terms and definitions for Teracloud® Streams.

Intra-PE load splitting

You can split the computation load within a processing element (PE) by using user-defined parallelism or multiple threads within a PE.

Using the parallel annotation

Use the @parallel annotation and user-defined parallelism if you want to split the load by automatically replicating specific subgraphs of an application. When operators from outside of the parallel region are fused with operators inside of the parallel region, the operators are replicated inside the PE and threaded ports are inserted. For more information, see User-defined parallelism.

Using the ThreadedSplit operator

Use the ThreadedSplit operator if you do not need to control the split output.

This graphic shows a ThreadedSplit operator for non-content based, cyclic assignment.

Using the Split operator

Use the Split operator to feed operators with threaded input ports if you want to control the split output.

This graphic shows a Split operator for operator-configured, potentially content-based assignment.

Generate the replicated parallel code by using mixed-mode code in a composite operator. For example:

namespace sample;

composite SplitParallelizer() {
  param    
    operator $source;  
    operator $body;   
    type $type;
  graph
    stream<$type> Src = $source() {} 

  (stream<$type> Splitted0   
    <%for(my $i=1; $i<4; ++$i) {%>      // example of 4 way parallelization
      ,stream<$type> Splitted<%=$i%> 
  <%}%>) = ThreadedSplit(Src) {}  

  <%for(my $i=0; $i<4; ++$i) {%>   
    () as Sink<%=$i%> = $body(Splitted<%=$i%>)   
  <%}%>
}

namespace sample;
 
composite MySource(output stream<MyType> Out) {
  graph
    // source operator here 
}

composite MyLoopBody (input In) {
  graph
    // series of operators here 
}

composite Main {
  () as Sink = SplitParallelizer() {  
  param   
    source : sample::MySource;
    body   : sample::MyLoopBody; 
    type   : MyType; 
   } 
}

You can ensure that operators are placed in the same PE by specifying partition collocation constraints. For example:

config placement : partitionColocation("myPartColocationId");

When multiple hosts are available, you can parallelize across hosts by using a Split operator.