It merges only the intermediate data segments from maptasks. For many Star Student Project data intensive Mapreduce algorithms data shuffling can lead to a significant number of disk operations, contending for the limited I/O bandwidth.
https://www.blogger.com/blogger.g?blogID=5786748987822045519#editor/target=post;postID=43882069535042307