Easy to understand, yet highly efficient statistical analysis, on 'Used cars for sale dataset' from ebay. A large dataset, with over 370.000 sale records through 2016, in the german market. This is a deep cleaning and data mining exercise, going through the steps of resurrecting a seemingly dirty and unreliable dataset, to transforming it into a clean file. And uncovering hidden meaning in each variable. Tools used: #Python
You can find the code here.
Let's Socialize