VatsaDev

VatsaDev

Geek Repo

Company:free from company

Location:Antartica

Home Page:vatsadev.github.io

Twitter:@_VatsaDev_

Github PK Tool:Github PK Tool

VatsaDev's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49227Issues:561Issues:202

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35293Issues:358Issues:306

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:29306Issues:279Issues:1204

aseprite

Animated sprite editor & pixel art tool (Windows, macOS, Linux)

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22418Issues:219Issues:125

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5195Issues:38Issues:37

01

The open-source language model computer

Language:PythonLicense:AGPL-3.0Stargazers:4798Issues:83Issues:108

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4647Issues:54Issues:98
Language:PythonLicense:Apache-2.0Stargazers:3936Issues:51Issues:112

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2559Issues:24Issues:162

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonLicense:MITStargazers:1271Issues:22Issues:34

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonLicense:GPL-3.0Stargazers:810Issues:17Issues:9

quiet-star

Code for Quiet-STaR

Language:PythonLicense:Apache-2.0Stargazers:354Issues:13Issues:7

galactic

data cleaning and curation for unstructured text

Language:PythonLicense:Apache-2.0Stargazers:323Issues:8Issues:4

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:209Issues:7Issues:10

mamba.c

Inference of Mamba models in pure C

xVal

Repository for code used in the xVal paper

Language:Jupyter NotebookStargazers:107Issues:19Issues:5

gptcore

Fast modular code to create and train cutting edge LLMs

Language:PythonLicense:Apache-2.0Stargazers:61Issues:9Issues:7

othello_mamba

Evaluating the Mamba architecture on the Othello game

quartic-transformer

Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)

Language:PythonLicense:MITStargazers:39Issues:4Issues:0

FaceRWKV

Course Project for COMP4471 on RWKV

Language:Jupyter NotebookStargazers:16Issues:3Issues:0

Lilith

Using the lilith optimizer on nanogpt

Language:PythonLicense:MITStargazers:9Issues:2Issues:0

tiny-asic-4bit-matrix-mul

Tiny matrix multiplication ASIC with 4-bit math

Language:VerilogLicense:Apache-2.0Stargazers:3Issues:2Issues:0
Language:Jupyter NotebookLicense:UnlicenseStargazers:2Issues:0Issues:0

JsLabs

make and store pages in the url, using url vars

Language:JavaScriptLicense:MITStargazers:2Issues:2Issues:0

TransformerMath

Can transformers learn math, like patterns?

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

2024-Swerve-concept

Describing Swerve functionality, mockup math

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

NCPT-Lilith

A retrain of the old nanogpt, but with the lilith optimizer

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

todo-fbla

the todo fbla project

Language:HTMLLicense:MITStargazers:1Issues:2Issues:0