Variation in data read by machine learning module to the actual data present in CSV file