DataOmbudsman's starred repositories

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:22657Issues:241Issues:2538

awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:15601Issues:245Issues:2124

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13864Issues:203Issues:2322

FreeTube

An Open Source YouTube app for privacy

Language:JavaScriptLicense:AGPL-3.0Stargazers:13316Issues:186Issues:3138

OpenRefine

OpenRefine is a free, open source power tool for working with messy data and improving it

Language:JavaLicense:BSD-3-ClauseStargazers:10837Issues:472Issues:3114

user.js

Firefox privacy, security and anti-tracking: a comprehensive user.js template for configuration and hardening

Language:JavaScriptLicense:MITStargazers:10069Issues:170Issues:1238

snorkel

A system for quickly generating training data with weak supervision

Language:PythonLicense:Apache-2.0Stargazers:5793Issues:167Issues:980

fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

Language:PythonLicense:BSD-3-ClauseStargazers:5184Issues:114Issues:162

Production-Level-Deep-Learning

A guideline for building practical production-level deep learning systems to be deployed in real world applications.

pandera

A light-weight, flexible, and expressive statistical data testing library

Language:PythonLicense:MITStargazers:3311Issues:20Issues:871

texthero

Text preprocessing, representation and visualization from zero to hero.

Language:PythonLicense:MITStargazers:2882Issues:42Issues:120

hiplot

HiPlot makes understanding high dimensional data easy

Language:TypeScriptLicense:MITStargazers:2743Issues:28Issues:89

dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

Language:PythonLicense:MITStargazers:2040Issues:27Issues:414

lets-plot

Multiplatform plotting library based on the Grammar of Graphics

Language:KotlinLicense:MITStargazers:1554Issues:171Issues:612

CanvasBlocker

A Firefox extension to protect from being fingerprinted.

Language:JavaScriptLicense:MPL-2.0Stargazers:1152Issues:50Issues:611

ipyvizzu

Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:949Issues:17Issues:90

pynndescent

A Python nearest neighbor descent for approximate nearest neighbors

Language:PythonLicense:BSD-2-ClauseStargazers:881Issues:14Issues:137

deploying-machine-learning-models

Code for the online course "Deployment of Machine Learning Models"

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:793Issues:31Issues:16

pdpipe

Easy pipelines for pandas DataFrames.

Language:Jupyter NotebookLicense:MITStargazers:713Issues:17Issues:53

Neat-URL

Neat URL cleans URLs, removing parameters such as Google Analytics' utm parameters.

Language:JavaScriptLicense:NOASSERTIONStargazers:619Issues:16Issues:210

webextension-skip-redirect

Some web pages use intermediary pages before redirecting to a final page. This add-on tries to extract the final url from the intermediary url and goes there straight away if successful.

Language:JavaScriptLicense:MITStargazers:482Issues:21Issues:223

ipyplot

IPyPlot is a small python package offering fast and efficient plotting of images inside Python Notebooks. It's using IPython with HTML for faster, richer and more interactive way of displaying big numbers of images.

Language:PythonLicense:MITStargazers:413Issues:8Issues:46

metriculous

Measure and visualize machine learning model performance without the usual boilerplate.

Language:PythonLicense:MITStargazers:94Issues:4Issues:6

incdbscan

Implementation of IncrementalDBSCAN clustering.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:57Issues:4Issues:2

email-providers

List of free email provider

Language:PythonLicense:MITStargazers:53Issues:10Issues:2

S3M

S3M: Siamese Stack (Trace) Similarity Measure

scikit-ext

Various scikit-learn extensions

Language:PythonLicense:MITStargazers:6Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:2Issues:2Issues:0