Friday, 1 February 2013

Explain how input and output data format of the Hadoop framework? | Hadoop Interview Questions

The MapReduce framework operates exclusively on pairs, that is, the framework views the input to the job as a set of pairs and produces a set of pairs as the output of the job, conceivably of different types.
See the flow mentioned below :
(input) -> map -> -> combine/sorting -> -> reduce -> (output)

No comments: