Felix Vadan's repositories
ai-powered-search
Work in Progress for AI-Powered Search (Manning Publications)
amazon-textract-code-samples
Amazon Textract Code Samples
amazon-textract-textractor
Analyze documents with Amazon Textract and generate output in multiple formats.
attend-copy-parse
Code for paper Attend, Copy, Parse - End-to-end information extraction from documents (https://arxiv.org/abs/1812.07248)
AutoClean
Package for automated data cleaning in Python.
camelot
Camelot: PDF Table Extraction for Humans
deduplication-slides
"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook
doc-similarity-lite
low configuration document similarity with sqlite
effectivepython
Effective Python: Second Edition — Source Code and Errata for the Book
figaro
Figaro Programming Language and Core Libraries
google-sheets-geocoding-macro
Geocode from addresses to latitude / longitude, and vice versa using Google Sheets
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
html-table-extractor
extract data from html table
industry-machine-learning
A curated list of applied machine learning and data science notebooks and libraries accross different industries.
materials
Bonus materials, exercises, and example projects for our Python tutorials
notes
Technical Notes On Using Data Science & Artificial Intelligence To Fight For Something That Matters.
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
probablepeople
:family: a python library for parsing unstructured western names into name components.
pyod
A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
python-nameparser
A simple Python module for parsing human names into their individual components
python-samples
🐍 Python samples for G Suite products.
Real-World-Projects-in-Python-3.x
Real World Projects in Python 3.x [Video], published by Packt
recordlinkage
A toolkit for record linkage and deduplication written in Python
risk-slim
simple customizable risk scores in python
spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
streamlit-example
Example Streamlit app that you can fork to test out share.streamlit.io
YEDDA
YEDDA: A Lightweight Collaborative Text Span Annotation Tool. Code for ACL 2018 Best Demo Paper Nomination.