Streaming

Defines the parameters that enables you to create and run Map/Reduce jobs with any executable or script as the mapper and or the reducer

The following table describes the Streaming parameters.

Parameter

Description

Input path

Defines the input file for the Map step

Output path

Defines the HDFS output path for the reduce step

Mapper Command

Defines the command that runs as a mapper

Reducer Command

Defines the command that runs as a reducer

Streaming options

Defines the sets of parameters and values that are added to the end of the Streaming command line

Generic Options

Defines the sets of parameters and values that are added to the Streaming command line

Parent Topic

Hadoop Job parameters