Microsoft has announced it plans for Hadoop recently, and they have come with map reduce that forms an integral part of Apache hadoop. Before we dig deep into it, I would like to give you an overview of understanding Hadoop and Big Data.

Hadoop is an elastic distributed schema-less data processing platform which is ideal for scenarios where you have huge volume of data with low per-record value. A typical example is twitter and face book where there is a huge volume of data which cannot be grouped into a schema but at the same time as different file formats ranging from json, xml, image etc.., It is a good parsing solution for processing sophisticated data. Continue reading »

Tags: Hadoop