Thomas Endres's starred repositories

postgresml

The GPU-powered AI application database. Get your app to market faster using the simplicity of SQL and the latest NLP, ML + LLM models.

Language:RustLicense:MITStargazers:5770Issues:0Issues:0

python-bigquery-dataframes

BigQuery DataFrames

Language:PythonLicense:Apache-2.0Stargazers:174Issues:0Issues:0

RapidFuzz

Rapid fuzzy string matching in Python using various string metrics

Language:C++License:MITStargazers:2494Issues:0Issues:0

splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Language:PythonLicense:MITStargazers:1196Issues:0Issues:0

Live-data-streaming-from-RDS-Postgres-to-Redshift

Clod Formation templates from Key2Market to set up Live data streaming from RDS Postgres to Redshift

Language:JavaScriptStargazers:7Issues:0Issues:0

DDEXPythonParser

DDEX Python Parser

Language:PythonStargazers:13Issues:0Issues:0

discogs-xml2db

Imports the discogs.com monthly XML dumps into databases

Language:C#License:Apache-2.0Stargazers:202Issues:0Issues:0

pgaudit

PostgreSQL Audit Extension

Language:CLicense:NOASSERTIONStargazers:1252Issues:0Issues:0

pg_cron

Run periodic jobs in PostgreSQL

Language:CLicense:PostgreSQLStargazers:2705Issues:0Issues:0

shebang

PDF and support scripts for shebang PostgreSQL talk

Language:ShellStargazers:3Issues:0Issues:0

audit-trigger

Simple, easily customised trigger-based auditing for PostgreSQL (Postgres). See also pgaudit.

Language:PLpgSQLLicense:NOASSERTIONStargazers:654Issues:0Issues:0

audit_trigger

Project migrated to : https://gitlab.com/Oslandia/audit_trigger

Language:PLpgSQLLicense:NOASSERTIONStargazers:7Issues:0Issues:0

Pyrseas

Provides utilities for Postgres database schema versioning.

Language:PythonLicense:BSD-3-ClauseStargazers:394Issues:0Issues:0

pg_similarity

set of functions and operators for executing similarity queries

Language:CLicense:BSD-3-ClauseStargazers:357Issues:0Issues:0

probablepeople

:family: a python library for parsing unstructured western names into name components.

Language:PythonLicense:MITStargazers:582Issues:0Issues:0

rake-nltk

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.

Language:PythonLicense:MITStargazers:1058Issues:0Issues:0

postgresopen-2017

Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk

Language:Jupyter NotebookStargazers:16Issues:0Issues:0

dedupe-examples

:id: Examples for using the dedupe library

Language:PythonLicense:MITStargazers:399Issues:0Issues:0

recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

Language:PythonLicense:BSD-3-ClauseStargazers:931Issues:0Issues:0

language-detect

A language detection module.

Language:PythonStargazers:11Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

langdetect

Port of Google's language-detection library to Python.

Language:PythonLicense:NOASSERTIONStargazers:1677Issues:0Issues:0

tracing

Utilities for tracing program execution line-by-line

Language:PythonLicense:MITStargazers:32Issues:0Issues:0

Duke

Duke is a fast and flexible deduplication engine written in Java

Language:JavaLicense:Apache-2.0Stargazers:613Issues:0Issues:0

elasticsearch-entity-resolution

Elasticsearch entity resolution plugin based on Duke

Language:JavaLicense:Apache-2.0Stargazers:211Issues:0Issues:0

ui-stack

:mag: A Chrome extension that lets you inspect a website's framework and libraries

Language:TypeScriptLicense:MITStargazers:177Issues:0Issues:0

Police-Analysis-Python

Open Source Tutorial For Analyzing & Visualizing 60 Million Police Stops Using Python

Language:Jupyter NotebookLicense:MITStargazers:44Issues:0Issues:0

OpenRefine

OpenRefine is a free, open source power tool for working with messy data and improving it

Language:JavaLicense:BSD-3-ClauseStargazers:10650Issues:0Issues:0

mimesis

Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.

Language:PythonLicense:MITStargazers:4354Issues:0Issues:0

python-slugify

Returns unicode slugs

Language:PythonLicense:MITStargazers:1469Issues:0Issues:0