MapReduce++: Simplified Processing of Unstructured Data on Large Computing Clouds

Principal Investigator’s Organization (PIO):

ITU, Lahore

Principal Investigator (PI):

Dr. Umar Saif

Summary

A purely research project aimed to design, implement and release (under open-source GPL license) a software system for parallel processing of large data on Computing Clouds. The overall goal of the project was to further the state of the art in cloud computing by improving Google’s MapReduce’s performance and scalability for a diverse set of applications and data sets. The developed version of the MapReduce cloud computing framework is designed for heterogeneous, medium-sized clusters in the developing-world.