On high level, the user of mapreduce framework needs to specify the following things:
- The job’s input location(s) in the distributed file system.
- The job’s output location in the distributed file system.
- The input format.
- The output format.
- The class containing the map function.
- The class containing the reduce function but it is optional.
- The JAR file containing the mapper and reducer classes and driver classes.