Olivier Binette (OlivierBinette)

OlivierBinette

Geek Repo

Company:Duke University

Location:Durham, NC

Home Page:https://olivierbinette.github.io/

Github PK Tool:Github PK Tool


Organizations
cleanzr
Data-Visualization-for-Family-Health
Duke-Chronicle-Project
forensic-science
gazetedentarihebakis
STA-690-S21
STA-790-ER
sta199-02-fall2021
sta199-fa21-003
UrbanLandUse
Valires

Olivier Binette's repositories

er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Language:PythonLicense:AGPL-3.0Stargazers:7Issues:0Issues:0

groupbyrule

Deduplicate data using fuzzy and deterministic matching rules.

Language:PythonLicense:GPL-3.0Stargazers:7Issues:1Issues:4
Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:6Issues:1Issues:12

TessTools

Tools for the use of Tesseract OCR in R

simple-typo-tolerant-search

Efficient typo-tolerant search in 76 lines of code, with no dependencies.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

CSVMeta

Lightweight csv read/write, keeping track of csv dialect and other metadata.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

JSM-2023

ER-Evaluation Demo for JSM 2023

Language:HTMLLicense:AGPL-3.0Stargazers:1Issues:1Issues:0

splink

Implementation in Apache Spark of the EM algorithm to estimate parameters of Fellegi-Sunter's canonical model of record linkage.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

streamlit-survey

Survey components for Streamlit apps

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

Stargazers:0Issues:0Issues:0

deepchecks

Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

duckdb

DuckDB is an in-process SQL OLAP Database Management System

License:MITStargazers:0Issues:0Issues:0

facets

Visualizations for machine learning datasets

License:Apache-2.0Stargazers:0Issues:0Issues:0

FeatureStore-lite

A lightweight feature store for Pandas, DuckDB, or your choice of backend.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

giskard

🐢 The testing framework for ML models, from tabular to LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

HandsOnEntityResolution

This repository accompanies the early release of Hands On Entity Resolution

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0

mismo

The SQL/Ibis powered sklearn of record linkage

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

PatentsView-Evaluation

Evaluation and benchmarking of PatentsView disambiguation algorithms

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

RMarkdown-Reproducibility-Template

Template for a reproducible RMarkdown document

License:AGPL-3.0Stargazers:0Issues:1Issues:0

seisbench

SeisBench - A toolbox for machine learning in seismology

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0

streamlit-example

Example Streamlit app that you can fork to test out share.streamlit.io

Language:PythonStargazers:0Issues:0Issues:0

trubrics-sdk

Validate your ML models and collect human feedback with Trubrics

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

License:Apache-2.0Stargazers:0Issues:0Issues:0

ul-benchmark-datasets-for-entity-resolution-archive

Unofficial archive of https://dbs.uni-leipzig.de/research/projects/benchmark-datasets-for-entity-resolution

Language:HTMLStargazers:0Issues:0Issues:0