Secondary sorting aka sorting values in Hadoop’s Map/Reduce programming paradigm

Introduction Sometimes, we would like to sort the values coming into the Reducer of a Hadoop Map/Reduce (MR) Job. You can indirectly sort the values by using a combination of implementations. They are as follows. Use a composite key. Extend org.apache.hadoop.mapreduce.Partitioner. Extend org.apache.hadoop.io.WritableComparator. Other tutorials that explains this approach on sorting values going into a […]