The Project contains code and explanations for creating and analyzing the large dataset related to supply chain service. The code and explanations contains the following task: Exploring the data quality and data distribution of the dataset using pandas and matplotlib libraries. Creating new features based on the existing data and applying proper statistical test to make sense of raw data