Removing duplicates in a smart efficient way
FinalEDA, data quality exploration and duplicate cleaning can be found in the notebooks section.
Our duplicates detection function can be found in the scripts section.
To test the duplicate detection function, go to test and run pytest on test.py using "python -m pytest test.py"