Curtis G. Northcutt's repositories
benchmarking-keras-pytorch
🔥 Reproducibly benchmarking Keras and PyTorch models
rankpruning
🧹 Formerly for binary classification with noisy labels. Replaced by cleanlab.
confidentlearning-reproduce
Official data release to reproduce Confident Learning paper results
cnn-gpu-benchmarks
Latest (2020) CNN and GPU Benchmarks on ImageNet and CIFAR
ieee-keywords
IEEE Computer Society Keywords to Organize Knowledge
forum-diversification
Add diversity to the order of comments in forums!
reliablity_framework_for_rag
Demo showing how the Trustworthy Language Model add reliability to LLM outputs and improves RAG, agents, and data enrichment worfklows. can be used to improve fine-tuning of LLMs, accuracy of LLM outputs, and smart routing for RAG and agents.
Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
coteaching_plus
ICML'19: How does Disagreement Help Generalization against Label Corruption?
cgnorthcutt.github.io
Curtis G. Northcutt's personal website.
Co-teaching
NeurIPS'18: Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels
documentation-theme-jekyll
A Jekyll-based theme designed for documentation and help systems. See the link for detailed instructions on setting up and configuring everything.
EgoCom-Dataset
EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset
imagenet-simple-labels
Simpler human-readable labels for ImageNet 🏷
noisy_label_understanding_utilizing
ICML 2019: Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels
python-cpu-stress-test-benchmark
Benchmark your CPU multi-thread and single-thread speed without needing sudo.
PyTorch_CIFAR10
Pretrained TorchVision models on CIFAR10 dataset (with weights)
quickdraw-dataset
Documentation on how to access and use the Quick, Draw! Dataset.
Replica-Dataset
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
sentence-similarity
This repository contains various ways to calculate sentence vector similarity using NLP models
templates
Document templates for open-source projects (README, CONTRIBUTING, GitHub templates)