BatchStage (hazelcast-jet-all 0.6.1 API)

Type Parameters:

T - the type of items coming out of this stage

All Superinterfaces:

GeneralStage<T>, Stage
```
public interface BatchStage<T>
extends GeneralStage<T>
```
Represents a stage in a distributed computation pipeline that will observe a finite amount of data (a batch). It accepts input from its upstream stages (if any) and passes its output to its downstream stages.

Method Summary

All Methods Instance Methods Abstract Methods Default Methods
Modifier and Type	Method and Description
`<A,R> BatchStage<R>`	`aggregate(AggregateOperation1<? super T,A,? extends R> aggrOp)` Attaches to this stage a stage that performs the given aggregate operation over all the items it receives.
`<T1,A,R> BatchStage<R>`	`aggregate2(BatchStage<T1> stage1, AggregateOperation2<? super T,? super T1,A,? extends R> aggrOp)` Attaches to this stage a stage that performs the given aggregate operation over all the items it receives from both this stage and `stage1` you supply.
`<T1,T2,A,R> BatchStage<R>`	`aggregate3(BatchStage<T1> stage1, BatchStage<T2> stage2, AggregateOperation3<? super T,? super T1,? super T2,A,? extends R> aggrOp)` Attaches to this stage a stage that performs the given aggregate operation over all the items it receives from this stage as well as `stage1` and `stage2` you supply.
`default AggregateBuilder<T>`	`aggregateBuilder()` Returns a fluent API builder object to construct an aggregating stage with any number of contributing stages.
`<R> BatchStage<R>`	`customTransform(String stageName, DistributedSupplier<Processor> procSupplier)` Attaches to this stage a stage with a custom transform based on the provided supplier of Core API `Processor`s.
`BatchStage<T>`	`filter(DistributedPredicate<T> filterFn)` Attaches to this stage a filtering stage, one which applies the provided predicate function to each input item to decide whether to pass the item to the output or to discard it.
`<C> BatchStage<T>`	`filterUsingContext(ContextFactory<C> contextFactory, DistributedBiPredicate<? super C,? super T> filterFn)` Attaches to this stage a filtering stage, one which applies the provided predicate function to each input item to decide whether to pass the item to the output or to discard it.
`<R> BatchStage<R>`	`flatMap(DistributedFunction<? super T,? extends Traverser<? extends R>> flatMapFn)` Attaches to this stage a flat-mapping stage, one which applies the supplied function to each input item independently and emits all the items from the `Traverser` it returns.
`<C,R> BatchStage<R>`	`flatMapUsingContext(ContextFactory<C> contextFactory, DistributedBiFunction<? super C,? super T,? extends Traverser<? extends R>> flatMapFn)` Attaches to this stage a flat-mapping stage, one which applies the supplied function to each input item independently and emits all items from the `Traverser` it returns as the output items.
`<K> StageWithGrouping<T,K>`	`groupingKey(DistributedFunction<? super T,? extends K> keyFn)` Specifes the function that will extract the grouping key from the items in the associated pipeline stage, as first step in the construction of a group-and-aggregate stage.
`<K,T1_IN,T1,R> BatchStage<R>`	`hashJoin(BatchStage<T1_IN> stage1, JoinClause<K,? super T,? super T1_IN,? extends T1> joinClause1, DistributedBiFunction<T,T1,R> mapToOutputFn)` Attaches to both this and the supplied stage a hash-joining stage and returns it.
`<K1,T1_IN,T1,K2,T2_IN,T2,R> BatchStage<R>`	`hashJoin2(BatchStage<T1_IN> stage1, JoinClause<K1,? super T,? super T1_IN,? extends T1> joinClause1, BatchStage<T2_IN> stage2, JoinClause<K2,? super T,? super T2_IN,? extends T2> joinClause2, DistributedTriFunction<T,T1,T2,R> mapToOutputFn)` Attaches to this and the two supplied stages a hash-joining stage and returns it.
`default HashJoinBuilder<T>`	`hashJoinBuilder()` Returns a fluent API builder object to construct a hash join operation with any number of contributing stages.
`<R> BatchStage<R>`	`map(DistributedFunction<? super T,? extends R> mapFn)` Attaches to this stage a mapping stage, one which applies the supplied function to each input item independently and emits the function's result as the output item.
`<C,R> BatchStage<R>`	`mapUsingContext(ContextFactory<C> contextFactory, DistributedBiFunction<? super C,? super T,? extends R> mapFn)` Attaches to this stage a mapping stage, one which applies the supplied function to each input item independently and emits the function's result as the output item.
`default BatchStage<T>`	`peek()` Adds a peeking layer to this compute stage which logs its output.
`default BatchStage<T>`	`peek(DistributedFunction<? super T,? extends CharSequence> toStringFn)` Adds a peeking layer to this compute stage which logs its output.
`BatchStage<T>`	`peek(DistributedPredicate<? super T> shouldLogFn, DistributedFunction<? super T,? extends CharSequence> toStringFn)` Attaches a peeking stage which logs this stage's output and passes it through without transformation.
`BatchStage<T>`	`setLocalParallelism(int localParallelism)` Sets the preferred local parallelism (number of workers per Jet cluster member) this stage will configure its DAG vertices with.
`BatchStage<T>`	`setName(String name)` Overrides the default name of the stage with the name you choose.

Methods inherited from interface com.hazelcast.jet.pipeline.GeneralStage
addTimestamps, addTimestamps, drainTo

Methods inherited from interface com.hazelcast.jet.pipeline.Stage
getPipeline, name

- Method Detail
  - groupingKey
```
@Nonnull
<K> StageWithGrouping<T,K> groupingKey(@Nonnull
                                                DistributedFunction<? super T,? extends K> keyFn)
```
    Description copied from interface: GeneralStage
    
    Specifes the function that will extract the grouping key from the items in the associated pipeline stage, as first step in the construction of a group-and-aggregate stage.
    
    Specified by:
    
    groupingKey in interface GeneralStage<T>
    
    Type Parameters:
    
    K - type of the key
    
    Parameters:
    
    keyFn - function that extracts the grouping key
  - map
```
@Nonnull
<R> BatchStage<R> map(@Nonnull
                               DistributedFunction<? super T,? extends R> mapFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a mapping stage, one which applies the supplied function to each input item independently and emits the function's result as the output item. If the result is null, it emits nothing. Therefore this stage can be used to implement filtering semantics as well.
    
    Specified by:
    
    map in interface GeneralStage<T>
    
    Type Parameters:
    
    R - the result type of the mapping function
    
    Parameters:
    
    mapFn - a stateless mapping function
    
    Returns:
    
    the newly attached stage
  - filter
```
@Nonnull
BatchStage<T> filter(@Nonnull
                              DistributedPredicate<T> filterFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a filtering stage, one which applies the provided predicate function to each input item to decide whether to pass the item to the output or to discard it. Returns the newly attached stage.
    
    Specified by:
    
    filter in interface GeneralStage<T>
    
    Parameters:
    
    filterFn - a stateless filter predicate function
    
    Returns:
    
    the newly attached stage
  - flatMap
```
@Nonnull
<R> BatchStage<R> flatMap(@Nonnull
                                   DistributedFunction<? super T,? extends Traverser<? extends R>> flatMapFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a flat-mapping stage, one which applies the supplied function to each input item independently and emits all the items from the Traverser it returns. The traverser must be null-terminated.
    
    Specified by:
    
    flatMap in interface GeneralStage<T>
    
    Type Parameters:
    
    R - the type of items in the result's traversers
    
    Parameters:
    
    flatMapFn - a stateless flatmapping function, whose result type is Jet's Traverser
    
    Returns:
    
    the newly attached stage
  - mapUsingContext
```
@Nonnull
<C,R> BatchStage<R> mapUsingContext(@Nonnull
                                             ContextFactory<C> contextFactory,
                                             @Nonnull
                                             DistributedBiFunction<? super C,? super T,? extends R> mapFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a mapping stage, one which applies the supplied function to each input item independently and emits the function's result as the output item. The mapping function receives another parameter, the context object which Jet will create using the supplied contextFactory.
    If the mapping result is null, it emits nothing. Therefore this stage can be used to implement filtering semantics as well.
    NOTE: any state you maintain in the context object does not automatically become a part of a fault-tolerant snapshot. If Jet must restore from a snapshot, your state will either be lost (if it was just local state) or not rewound to the checkpoint (if it was stored in some durable storage).
    
    Specified by:
    
    mapUsingContext in interface GeneralStage<T>
    
    Type Parameters:
    
    C - type of context object
    
    R - the result type of the mapping function
    
    Parameters:
    
    contextFactory - the context factory
    
    mapFn - a stateless mapping function
    
    Returns:
    
    the newly attached stage
  - filterUsingContext
```
@Nonnull
<C> BatchStage<T> filterUsingContext(@Nonnull
                                              ContextFactory<C> contextFactory,
                                              @Nonnull
                                              DistributedBiPredicate<? super C,? super T> filterFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a filtering stage, one which applies the provided predicate function to each input item to decide whether to pass the item to the output or to discard it. The predicate function receives another parameter, the context object which Jet will create using the supplied contextFactory.
    NOTE: any state you maintain in the context object does not automatically become a part of a fault-tolerant snapshot. If Jet must restore from a snapshot, your state will either be lost (if it was just local state) or not rewound to the checkpoint (if it was stored in some durable storage).
    
    Specified by:
    
    filterUsingContext in interface GeneralStage<T>
    
    Type Parameters:
    
    C - type of context object
    
    Parameters:
    
    contextFactory - the context factory
    
    filterFn - a stateless filter predicate function
    
    Returns:
    
    the newly attached stage
  - flatMapUsingContext
```
@Nonnull
<C,R> BatchStage<R> flatMapUsingContext(@Nonnull
                                                 ContextFactory<C> contextFactory,
                                                 @Nonnull
                                                 DistributedBiFunction<? super C,? super T,? extends Traverser<? extends R>> flatMapFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a flat-mapping stage, one which applies the supplied function to each input item independently and emits all items from the Traverser it returns as the output items. The traverser must be null-terminated. The mapping function receives another parameter, the context object which Jet will create using the supplied contextFactory.
    NOTE: any state you maintain in the context object does not automatically become a part of a fault-tolerant snapshot. If Jet must restore from a snapshot, your state will either be lost (if it was just local state) or not rewound to the checkpoint (if it was stored in some durable storage).
    
    Specified by:
    
    flatMapUsingContext in interface GeneralStage<T>
    
    Type Parameters:
    
    C - type of context object
    
    R - the type of items in the result's traversers
    
    Parameters:
    
    contextFactory - the context factory
    
    flatMapFn - a stateless flatmapping function, whose result type is Jet's Traverser
    
    Returns:
    
    the newly attached stage
  - hashJoin
```
@Nonnull
<K,T1_IN,T1,R> BatchStage<R> hashJoin(@Nonnull
                                               BatchStage<T1_IN> stage1,
                                               @Nonnull
                                               JoinClause<K,? super T,? super T1_IN,? extends T1> joinClause1,
                                               @Nonnull
                                               DistributedBiFunction<T,T1,R> mapToOutputFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to both this and the supplied stage a hash-joining stage and returns it. This stage plays the role of the primary stage in the hash-join. Please refer to the package Javadoc for a detailed description of the hash-join transform.
    
    Specified by:
    
    hashJoin in interface GeneralStage<T>
    
    Type Parameters:
    
    K - the type of the join key
    
    T1_IN - the type of stage1 items
    
    T1 - the result type of projection on stage1 items
    
    R - the resulting output type
    
    Parameters:
    
    stage1 - the stage to hash-join with this one
    
    joinClause1 - specifies how to join the two streams
    
    mapToOutputFn - function to map the joined items to the output value
    
    Returns:
    
    the newly attached stage
  - hashJoin2
```
@Nonnull
<K1,T1_IN,T1,K2,T2_IN,T2,R> BatchStage<R> hashJoin2(@Nonnull
                                                             BatchStage<T1_IN> stage1,
                                                             @Nonnull
                                                             JoinClause<K1,? super T,? super T1_IN,? extends T1> joinClause1,
                                                             @Nonnull
                                                             BatchStage<T2_IN> stage2,
                                                             @Nonnull
                                                             JoinClause<K2,? super T,? super T2_IN,? extends T2> joinClause2,
                                                             @Nonnull
                                                             DistributedTriFunction<T,T1,T2,R> mapToOutputFn)
```
    Description copied from interface: GeneralStage
    
    Attaches to this and the two supplied stages a hash-joining stage and returns it. This stage plays the role of the primary stage in the hash-join. Please refer to the package Javadoc for a detailed description of the hash-join transform.
    
    Specified by:
    
    hashJoin2 in interface GeneralStage<T>
    
    Type Parameters:
    
    K1 - the type of key for stage1
    
    T1_IN - the type of stage1 items
    
    T1 - the result type of projection of stage1 items
    
    K2 - the type of key for stage2
    
    T2_IN - the type of stage2 items
    
    T2 - the result type of projection of stage2 items
    
    R - the resulting output type
    
    Parameters:
    
    stage1 - the first stage to join
    
    joinClause1 - specifies how to join with stage1
    
    stage2 - the second stage to join
    
    joinClause2 - specifices how to join with stage2
    
    mapToOutputFn - function to map the joined items to the output value
    
    Returns:
    
    the newly attached stage
  - hashJoinBuilder
```
@Nonnull
default HashJoinBuilder<T> hashJoinBuilder()
```
    Description copied from interface: GeneralStage
    
    Returns a fluent API builder object to construct a hash join operation with any number of contributing stages. It is mainly intended for hash-joins with three or more enriching stages. For one or two stages prefer the direct stage.hashJoinN(...) calls because they offer more static type safety.
    
    Specified by:
    
    hashJoinBuilder in interface GeneralStage<T>
  - peek
```
@Nonnull
default BatchStage<T> peek()
```
    Description copied from interface: GeneralStage
    
    Adds a peeking layer to this compute stage which logs its output. For each item the stage emits, it logs the result of its toString() method at the INFO level to the log category com.hazelcast.jet.impl.processor.PeekWrappedP.<vertexName>#<processorIndex>. The stage logs each item on whichever cluster member it happens to receive it. Its primary purpose is for development use, when running Jet on a local machine.
    
    Specified by:
    
    peek in interface GeneralStage<T>
    
    See Also:
    
    GeneralStage.peek(DistributedPredicate, DistributedFunction), GeneralStage.peek(DistributedFunction)
  - peek
```
@Nonnull
BatchStage<T> peek(@Nonnull
                            DistributedPredicate<? super T> shouldLogFn,
                            @Nonnull
                            DistributedFunction<? super T,? extends CharSequence> toStringFn)
```
    Description copied from interface: GeneralStage
    Attaches a peeking stage which logs this stage's output and passes it through without transformation. For each item the stage emits, it:
    1. uses the shouldLogFn predicate to see whether to log the item
    2. if yes, uses then uses toStringFn to get the item's string representation
    3. logs the string at the INFO level to the log category com.hazelcast.jet.impl.processor.PeekWrappedP.<vertexName>#<processorIndex>
    The stage logs each item on whichever cluster member it happens to receive it. Its primary purpose is for development use, when running Jet on a local machine.
    Specified by:
    
    peek in interface GeneralStage<T>
    
    Parameters:
    
    shouldLogFn - a function to filter the logged items. You can use alwaysTrue() as a pass-through filter when you don't need any filtering.
    
    toStringFn - a function that returns a string representation of the item
    
    See Also:
    
    GeneralStage.peek(DistributedFunction), GeneralStage.peek()
  - peek
```
@Nonnull
default BatchStage<T> peek(@Nonnull
                                    DistributedFunction<? super T,? extends CharSequence> toStringFn)
```
    Description copied from interface: GeneralStage
    Adds a peeking layer to this compute stage which logs its output. For each item the stage emits, it:
    1. uses toStringFn to get a string representation of the item
    2. logs the string at the INFO level to the log category com.hazelcast.jet.impl.processor.PeekWrappedP.<vertexName>#<processorIndex>
    The stage logs each item on whichever cluster member it happens to receive it. Its primary purpose is for development use, when running Jet on a local machine.
    Specified by:
    
    peek in interface GeneralStage<T>
    
    Parameters:
    
    toStringFn - a function that returns a string representation of the item
    
    See Also:
    
    GeneralStage.peek(DistributedPredicate, DistributedFunction), GeneralStage.peek()
  - customTransform
```
@Nonnull
<R> BatchStage<R> customTransform(@Nonnull
                                           String stageName,
                                           @Nonnull
                                           DistributedSupplier<Processor> procSupplier)
```
    Description copied from interface: GeneralStage
    
    Attaches to this stage a stage with a custom transform based on the provided supplier of Core API Processors. To be compatible with the rest of the pipeline, the processor must expect a single inbound edge and arbitrarily many outbound edges, and it must push the same data to all outbound edges.
    Note that the returned stage's type parameter is inferred from the call site and not propagated from the processor that will produce the result, so there is no actual type safety provided.
    
    Specified by:
    
    customTransform in interface GeneralStage<T>
    
    Type Parameters:
    
    R - the type of the output items
    
    Parameters:
    
    stageName - a human-readable name for the custom stage
    
    procSupplier - the supplier of processors
  - aggregate
```
@Nonnull
<A,R> BatchStage<R> aggregate(@Nonnull
                                       AggregateOperation1<? super T,A,? extends R> aggrOp)
```
    Attaches to this stage a stage that performs the given aggregate operation over all the items it receives. The aggregating stage emits a single item.
    
    Type Parameters:
    
    A - the type of the accumulator used by the aggregate operation
    
    R - the type of the result
    
    Parameters:
    
    aggrOp - the aggregate operation to perform
    
    See Also:
    
    AggregateOperations
  - aggregate2
```
@Nonnull
<T1,A,R> BatchStage<R> aggregate2(@Nonnull
                                           BatchStage<T1> stage1,
                                           @Nonnull
                                           AggregateOperation2<? super T,? super T1,A,? extends R> aggrOp)
```
    Attaches to this stage a stage that performs the given aggregate operation over all the items it receives from both this stage and stage1 you supply. The aggregate operation must specify a separate accumulator function for each of the two streams (refer to its Javadoc for a simple example).
    The aggregating stage emits a single item.
    
    Type Parameters:
    
    T1 - type of items in stage1
    
    A - type of the accumulator used by the aggregate operation
    
    R - type of the result
    
    Parameters:
    
    aggrOp - the aggregate operation to perform
    
    See Also:
    
    AggregateOperations
  - aggregate3
```
@Nonnull
<T1,T2,A,R> BatchStage<R> aggregate3(@Nonnull
                                              BatchStage<T1> stage1,
                                              @Nonnull
                                              BatchStage<T2> stage2,
                                              @Nonnull
                                              AggregateOperation3<? super T,? super T1,? super T2,A,? extends R> aggrOp)
```
    Attaches to this stage a stage that performs the given aggregate operation over all the items it receives from this stage as well as stage1 and stage2 you supply. The aggregate operation must specify a separate accumulator function for each of the three streams (refer to its Javadoc for a simple example).
    The aggregating stage emits a single item.
    
    Type Parameters:
    
    T1 - type of items in stage1
    
    T2 - type of items in stage2
    
    A - type of the accumulator used by the aggregate operation
    
    R - type of the result
    
    Parameters:
    
    aggrOp - the aggregate operation to perform
    
    See Also:
    
    AggregateOperations
  - aggregateBuilder
```
@Nonnull
default AggregateBuilder<T> aggregateBuilder()
```
    Returns a fluent API builder object to construct an aggregating stage with any number of contributing stages. It is mainly intended to co-aggregate four or more stages. For up to three stages prefer the direct stage.aggregateN(...) calls because they offer more static type safety.
  - setLocalParallelism
```
@Nonnull
BatchStage<T> setLocalParallelism(int localParallelism)
```
    Description copied from interface: Stage
    
    Sets the preferred local parallelism (number of workers per Jet cluster member) this stage will configure its DAG vertices with. Jet always uses the same number of workers on each member, so the total parallelism automatically increases if another member joins the cluster.
    Note that, while most stages are backed by 1 vertex, there are exceptions. If a stage uses two vertices, each of them will have the given local parallelism, so in total there will be twice as many processing units per member.
    The default value is -1 and it signals to Jet to figure out a default value. Jet will determine the vertex's local parallelism during job initialization from the global default and the processor meta-supplier's preferred value.
    
    Specified by:
    
    setLocalParallelism in interface Stage
    
    Returns:
    
    this stage
  - setName
```
@Nonnull
BatchStage<T> setName(@Nullable
                               String name)
```
    Description copied from interface: Stage
    
    Overrides the default name of the stage with the name you choose. This can be useful for debugging purposes, to better distinguish pipeline stages in the diagnostic output.
    
    Specified by:
    
    setName in interface Stage
    
    Parameters:
    
    name - the stage name

Interface BatchStage<T>

Method Summary

Methods inherited from interface com.hazelcast.jet.pipeline.GeneralStage

Methods inherited from interface com.hazelcast.jet.pipeline.Stage

Method Detail

groupingKey

map

filter

flatMap

mapUsingContext

filterUsingContext

flatMapUsingContext

hashJoin

hashJoin2

hashJoinBuilder

peek

peek

peek

customTransform

aggregate

aggregate2

aggregate3

aggregateBuilder

setLocalParallelism

setName