ben fattori (fattorib)

fattorib

Geek Repo

Location:Toronto, Ontario

Home Page:fattorib.github.io

Github PK Tool:Github PK Tool

ben fattori's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29024Issues:341Issues:267

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18310Issues:155Issues:467

triton

Development repository for the Triton language and compiler

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:9604Issues:25Issues:517

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:4297Issues:71Issues:268

algorithmica

A computer science textbook

Language:Jupyter NotebookStargazers:3159Issues:62Issues:67

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2096Issues:33Issues:99

lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Language:PythonLicense:MITStargazers:1935Issues:15Issues:23

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:1447Issues:28Issues:25
Language:PythonLicense:Apache-2.0Stargazers:1383Issues:21Issues:19

basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

Language:PythonLicense:MITStargazers:1284Issues:22Issues:59

maxas

Assembler for NVIDIA Maxwell architecture

Language:SassLicense:MITStargazers:921Issues:88Issues:11

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:920Issues:13Issues:192

safari

Convolutions for Sequence Modeling

Language:AssemblyLicense:Apache-2.0Stargazers:842Issues:36Issues:38

H3

Language Modeling with the H3 State Space Model

Language:AssemblyLicense:Apache-2.0Stargazers:496Issues:32Issues:26

examples

Example code and applications for machine learning on Graphcore IPUs

Language:PythonLicense:MITStargazers:313Issues:44Issues:3

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:308Issues:3Issues:8
Language:PythonLicense:Apache-2.0Stargazers:237Issues:6Issues:11

minChatGPT

A minimum example of aligning language models with RLHF similar to ChatGPT

Language:PythonLicense:GPL-3.0Stargazers:202Issues:5Issues:4

myGEMM

Code appendix to an OpenCL matrix-multiplication tutorial

Language:CLicense:MITStargazers:159Issues:7Issues:9

poplibs

Poplar libraries

Language:C++License:NOASSERTIONStargazers:114Issues:17Issues:0

language-model-agents

Experiments with generating opensource language model assistants

Language:HTMLLicense:Apache-2.0Stargazers:96Issues:6Issues:9

lovely-jax

JAX Arrays for human consumption

Language:Jupyter NotebookLicense:MITStargazers:87Issues:3Issues:4

tutorials

Training material for IPU users: tutorials, feature examples, simple applications

Language:PythonLicense:MITStargazers:86Issues:10Issues:4

jax-experimental

JAX for Graphcore IPU (experimental)

Language:PythonLicense:Apache-2.0Stargazers:20Issues:5Issues:6

ipu-hpc-cookbook

Useful tutorials and recipes for developers doing low-level work with the Graphcore IPU

Language:C++License:MITStargazers:20Issues:8Issues:1
Language:PythonLicense:MITStargazers:13Issues:3Issues:12
Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0