fattorib

followers

following

stars

Toronto, Ontario

fattorib.github.io

ben fattori's repositories

Little-GPT

GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!

Language:PythonMIT22 3 1

transformer_shmap

Tensor Parallelism with JAX + Shard Map

Language:PythonMIT10 10

ZeRO-transformer

Two implementations of ZeRO-1 optimizer sharding in JAX

Language:PythonMIT9 2 5

Flax-ResNets

CIFAR10 ResNets implemented in JAX+Flax

Language:Python8 10

LeagueMatchScraper

Code to scrape League of Legends matches using the Riot Games API.

Language:Python6 10

RepVGG-CIFAR10

RepVGG models specifically for CIFAR10 and CIFAR 100. Based on RepVGG: Making VGG-style ConvNets Great Again (Ding et. al)

Language:Python6 10

StochasticDepthNets

PyTorch implementation of ResNet110 as described in Deep Networks with Stochastic Depth (Huang et al.)

Language:Python2 10

wtf-wikipedia-python

raw wikipedia XML to LM_Dataformat in under 4 hours

Language:Python2 10

Monte-Carlo-Fractal-Dimensionality

Efficient algorithm using random sampling to calculate the dimension of many basic fractals. Implemented algorithm in Python.

Language:Python100

picograd

picograd - Fully connected neural networks in Python

Language:Python1 10

GeometricDeepLearning

Introductory Geometric Deep Learning Presentation from September 2021

010

Python-Unigram

Unigram tokenization algorithm in Python

Language:Python010

tritonformer

Differentiable transformer in Triton, matching the performance of PyTorch + cuDNN/cuBLAS

Language:PythonMIT000

CudaSoftmax

Softmax CUDA kernel :)

Language:Cuda000

fattorib.github.io

Website

Language:HTMLMIT000

flashy_linear_attention

Flash linear attention kernels in Triton

Language:PythonMIT000

Fundamental-Domain

Code to generate a section of the fundamental domain for the action of the special linear group on the space of (integral) binary cubic forms. As it stands, the code is quite inefficient. In the future I hope to optimize it.

Language:MATLAB000

fusedswiglu

Fused SwiGLU Triton kernels

Language:PythonMIT000

InfoGAN-Jax

InfoGAN in Jax with small Gradio app

Language:Python010

jaxvae

Variational Autoencoder in JAX

Language:PythonMIT010

lm-evaluation-harness

Fork of lm-evaluation-harness for evaluating my custom models

Language:PythonMIT000

Python-BPE

I wrote Byte-Pair encoding but its 600x slower than 🤗

Language:Python010

ResNets-CIFAR10

PyTorch implementation of the CIFAR10 ResNets, based on Deep Residual Learning for Image Recognition (He et al.)

Language:Python010