Jupyter Notebook from the talk "1 + 1 = 1 or Record Deduplication with Python", presented at PyBay 2018 and PyGotham 2018. The slides.ipynb
version was presented at PyBay, while the slides-reduced.ipynb
version was presented at PyGotham.
It's possible to run the slides-reduced.ipynb
version online! Click here:
Install libpostal
(instructions here) and pip install -r requirements.txt
. Run jupyter notebook