Thomas Endres's starred repositories
postgresml
The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.
python-bigquery-dataframes
BigQuery DataFrames
Live-data-streaming-from-RDS-Postgres-to-Redshift
Clod Formation templates from Key2Market to set up Live data streaming from RDS Postgres to Redshift
DDEXPythonParser
DDEX Python Parser
discogs-xml2db
Imports the discogs.com monthly XML dumps into databases
audit-trigger
Simple, easily customised trigger-based auditing for PostgreSQL (Postgres). See also pgaudit.
audit_trigger
Project migrated to : https://gitlab.com/Oslandia/audit_trigger
pg_similarity
set of functions and operators for executing similarity queries
probablepeople
:family: a python library for parsing unstructured western names into name components.
postgresopen-2017
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
dedupe-examples
:id: Examples for using the dedupe library
recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
language-detect
A language detection module.
langdetect
Port of Google's language-detection library to Python.
elasticsearch-entity-resolution
Elasticsearch entity resolution plugin based on Duke
Police-Analysis-Python
Open Source Tutorial For Analyzing & Visualizing 60 Million Police Stops Using Python
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
python-slugify
Returns unicode slugs