HdfsSinks (hazelcast-jet-distribution 0.7.2 API)

java.lang.Object
- com.hazelcast.jet.hadoop.HdfsSinks

public final class HdfsSinks
extends Object

Factories of Apache Hadoop HDFS sinks.

Method Summary

All Methods Static Methods Concrete Methods
Modifier and Type	Method and Description
`static <K,V> Sink<Map.Entry<K,V>>`	`hdfs(org.apache.hadoop.mapred.JobConf jobConf)` Convenience for `hdfs(JobConf, DistributedFunction, DistributedFunction)` which expects `Map.Entry<K, V>` as input and extracts its key and value parts to be written to HDFS.
`static <E,K,V> Sink<E>`	`hdfs(org.apache.hadoop.mapred.JobConf jobConf, DistributedFunction<? super E,K> extractKeyF, DistributedFunction<? super E,V> extractValueF)` Returns a sink that writes to Apache Hadoop HDFS.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Method Detail
  - hdfs
```
@Nonnull
public static <E,K,V> Sink<E> hdfs(@Nonnull
                                            org.apache.hadoop.mapred.JobConf jobConf,
                                            @Nonnull
                                            DistributedFunction<? super E,K> extractKeyF,
                                            @Nonnull
                                            DistributedFunction<? super E,V> extractValueF)
```
    Returns a sink that writes to Apache Hadoop HDFS. It transforms each received item to a key-value pair using the two supplied mapping functions. The type of key and value must conform to the expectations of the output format specified in JobConf.
    The sink creates a number of files in the output path, identified by the cluster member ID and the Processor ID. Unlike MapReduce, the data in the files is not sorted by key.
    The supplied JobConf must specify an OutputFormat with a path.
    No state is saved to snapshot for this sink. After the job is restarted, the items will likely be duplicated, providing an at-least-once guarantee.
    Default local parallelism for this processor is 2 (or less if less CPUs are available).
    
    Type Parameters:
    
    E - stream item type
    
    K - type of key to write to HDFS
    
    V - type of value to write to HDFS
    
    Parameters:
    
    jobConf - JobConf used for output format configuration
    
    extractKeyF - mapper to map a key to another key
    
    extractValueF - mapper to map a value to another value
  - hdfs
```
@Nonnull
public static <K,V> Sink<Map.Entry<K,V>> hdfs(@Nonnull
                                                       org.apache.hadoop.mapred.JobConf jobConf)
```
    Convenience for hdfs(JobConf, DistributedFunction, DistributedFunction) which expects Map.Entry<K, V> as input and extracts its key and value parts to be written to HDFS.

Class HdfsSinks

Method Summary

Methods inherited from class java.lang.Object

Method Detail

hdfs

hdfs