mttk

Martin Tutek's starred repositories

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Language:PythonNOASSERTION23252 412 294

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonApache-2.019036 277 2888

gpt-3

GPT-3: Language Models are Few-Shot Learners

15654 898 3

PRML

PRML algorithms implemented in Python

Language:Jupyter NotebookMIT11397 418 24

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustApache-2.08903 119 978

hyperopt

Distributed Asynchronous Hyperparameter Optimization in Python

Language:PythonNOASSERTION7205 121 650

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.05209 32 52

missing-semester

The Missing Semester of Your CS Education 📚

Language:CSSNOASSERTION4880 46 29

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptApache-2.03464 68 134

checklist

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Language:Jupyter NotebookMIT1998 29 89

mplcyberpunk

"Cyberpunk style" for matplotlib plots

Language:PythonMIT1673 17 24

Fantasy-Premier-League

Creates a .csv file of all players in the English Player League with their respective team and total fantasy points

Language:PythonNOASSERTION1433 133 129

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

Language:C++MIT953 26 60

bootiso

A bash program to securely create a bootable USB device from one image file.

Language:ShellGPL-3.0808 28 60

geotorch

Constrained optimization toolkit for PyTorch

Language:PythonMIT649 9 35

pytorch_block_sparse

Fast Block Sparse Matrices for Pytorch

Language:C++NOASSERTION545 61 15

Mini-Conf

Run a conference from your backyard.

Language:JavaScriptMIT535 13 49

entmax

The entmax mapping and its loss, a family of sparse softmax alternatives.

Language:PythonMIT407 9 21

sparse_learning

Sparse learning library and sparse momentum resources.

Language:PythonMIT377 19 25

annotated_encoder_decoder

The Annotated Encoder Decoder with Attention

Language:Jupyter NotebookMIT166 5 4

lstms.pth

PyTorch implementations of LSTM Variants (Dropout + Layer Norm)

Language:PythonApache-2.0136 9 3

Better_LSTM_PyTorch

An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.

Language:PythonMIT133 4 6

eraserbenchmark

A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/

Language:PythonApache-2.096 10 9

sparselandtools

:sparkles: A Python package for sparse representations and dictionary learning, including matching pursuit, K-SVD and applications.

Language:PythonMIT86 2 4

latent-treelstm

Cooperative Learning of Disjoint Syntax and Semantics

Language:PythonMIT49 10 5

VirtualTeaching

DIY setup for virtual teaching on ubuntu

39 40

Textual-Entailment-New-Protocols

This data release is meant to accompany and document the paper: https://arxiv.org/abs/2004.11997 Collecting Entailment Data for Pretraining: New Protocols and Negative Results by Samuel R. Bowman, Jennimaria Palomaki, Livio Baldini Soares, and Emily Pitler

14 70