Will Berman's starred repositories

kineto

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

Language:HTMLLicense:NOASSERTIONStargazers:674Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:26086Issues:0Issues:0

enhancing-transformers

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Language:PythonLicense:MITStargazers:276Issues:0Issues:0

accelerate

πŸš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7467Issues:0Issues:0

surrealdb

A scalable, distributed, collaborative, document-graph database, for the realtime web

Language:RustLicense:NOASSERTIONStargazers:26392Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5828Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7659Issues:0Issues:0

procgen

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Language:C++License:MITStargazers:992Issues:0Issues:0

quda

QUDA is a library for performing calculations in lattice QCD on GPUs.

Language:C++License:NOASSERTIONStargazers:284Issues:0Issues:0

pytorch_forward_forward

Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation

Language:PythonLicense:MITStargazers:1429Issues:0Issues:0

transformers

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130240Issues:0Issues:0

reth

Modular, contributor-friendly and blazing-fast implementation of the Ethereum protocol, in Rust

Language:RustLicense:Apache-2.0Stargazers:3635Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33530Issues:0Issues:0

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

Language:C++License:Apache-2.0Stargazers:2489Issues:0Issues:0

stablehlo

Backward compatible ML compute opset inspired by HLO/MHLO

Language:MLIRLicense:Apache-2.0Stargazers:368Issues:0Issues:0

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:2984Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:686Issues:0Issues:0

nerf-from-image

Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

Language:PythonLicense:Apache-2.0Stargazers:376Issues:0Issues:0

lingvo

Lingvo

Language:PythonLicense:Apache-2.0Stargazers:2803Issues:0Issues:0

Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

Language:NimLicense:Apache-2.0Stargazers:1331Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32549Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38456Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:5053Issues:0Issues:0

builder

Flashbots MEV-Boost Block Builder

Language:GoLicense:LGPL-3.0Stargazers:420Issues:0Issues:0

CompilerGym

Reinforcement learning environments for compiler and program optimization tasks

Language:PythonLicense:MITStargazers:889Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8170Issues:0Issues:0

optimum

πŸš€ Accelerate training and inference of πŸ€— Transformers and πŸ€— Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:2369Issues:0Issues:0

helios

A fast, secure, and portable light client for Ethereum

Language:RustLicense:MITStargazers:1766Issues:0Issues:0

TASO

The Tensor Algebra SuperOptimizer for Deep Learning

Language:C++License:Apache-2.0Stargazers:682Issues:0Issues:0

diffusers

πŸ€— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24458Issues:0Issues:0