Configured Hadoop in multi-cluster node in AWS cloud; analyzed the 22 years of Airline data i.e 130 million flight records (implemented in MapReduce) and found interesting facts on category wise performance, the probability of airline being on schedule, most common reason for cancellation