msaroufim

Mark Saroufim's repositories

awesome-profiling

Awesome utilities for performance profiling

Apache-2.0140 70

mynotes

Language:Python17 50

mlsys-experiments

stuff

Language:Jupyter Notebook5 40

metal-tutorial

Language:Swift4 20

tinyoptimizer

Language:Python3 30

Triton-Puzzles

Puzzles for learning Triton

Apache-2.0300

cpuoffload

Language:Python2 20

setup

Language:Shell2 20

cpu-offload

Language:PythonUnlicense1 20

algorithmic-efficiency

MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.

Language:PythonApache-2.0010

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0010

gradient-checkpointing

020

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).

Language:PythonApache-2.0010

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION010

keras-benchmarks-2

Language:PythonApache-2.0010

lecturex

Language:Cuda020

Liger-Kernel

Efficient Triton Kernels for LLM Training

BSD-2-Clause000

lit-llama

Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code

Language:Python010

llama-inference

experiments with inference on llama

Language:Python010

llama2.c

Inference Llama 2 in one file of pure C

Language:PythonMIT010

llm.c

LLM training in simple, raw C/CUDA

MIT000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT010

microbenchmarks

Language:Python020

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.0010

newblog

new blog, who dis?

Language:CSS030

nvcc4jupyter

A plugin for Jupyter Notebook to run CUDA C/C++ code

Language:Python010

pyperformance

Python Performance Benchmark Suite

Language:PythonMIT010

pytorch.github.io

The website for PyTorch

Language:HTMLBSD-3-Clause010

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0010

subclass_zoo

Language:Python020