vaneseltine / deduplication-slides

"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

1 + 1 = 1 or Record Deduplication with Python

Jupyter Notebook from the talk "1 + 1 = 1 or Record Deduplication with Python", presented at PyBay 2018 and PyGotham 2018. The slides.ipynb version was presented at PyBay, while the slides-reduced.ipynb version was presented at PyGotham.

Running (Binder)

It's possible to run the slides-reduced.ipynb version online! Click here: Binder

Running (Local)

Install libpostal (instructions here) and pip install -r requirements.txt. Run jupyter notebook

About

"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook


Languages

Language:HTML 52.5%Language:Jupyter Notebook 47.0%Language:Python 0.5%Language:Dockerfile 0.0%Language:CSS 0.0%