Industrial large data set using hadoop