VatsaDev

VatsaDev

Geek Repo

Company:free from company

Location:Antartica

Home Page:vatsadev.github.io

Twitter:@_VatsaDev_

Github PK Tool:Github PK Tool

VatsaDev's starred repositories

tiny-asic-4bit-matrix-mul

Tiny matrix multiplication ASIC with 4-bit math

Language:VerilogLicense:Apache-2.0Stargazers:3Issues:0Issues:0

JsLabs

make and store pages in the url, using url vars

Language:JavaScriptLicense:MITStargazers:2Issues:0Issues:0

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:209Issues:0Issues:0
Language:Jupyter NotebookLicense:UnlicenseStargazers:2Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22418Issues:0Issues:0

TransformerMath

Can transformers learn math, like patterns?

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

xVal

Repository for code used in the xVal paper

Language:Jupyter NotebookStargazers:107Issues:0Issues:0

quiet-star

Code for Quiet-STaR

Language:PythonLicense:Apache-2.0Stargazers:354Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49225Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:29305Issues:0Issues:0

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4647Issues:0Issues:0

yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Language:PythonLicense:GPL-3.0Stargazers:810Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5195Issues:0Issues:0

NCPT-Lilith

A retrain of the old nanogpt, but with the lilith optimizer

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonLicense:MITStargazers:1271Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35291Issues:0Issues:0

todo-fbla

the todo fbla project

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

Lilith

Using the lilith optimizer on nanogpt

Language:PythonLicense:MITStargazers:9Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

FaceRWKV

Course Project for COMP4471 on RWKV

Language:Jupyter NotebookStargazers:16Issues:0Issues:0

othello_mamba

Evaluating the Mamba architecture on the Othello game

Language:PythonStargazers:39Issues:0Issues:0

01

The open-source language model computer

Language:PythonLicense:AGPL-3.0Stargazers:4798Issues:0Issues:0

quartic-transformer

Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2559Issues:0Issues:0

aseprite

Animated sprite editor & pixel art tool (Windows, macOS, Linux)

Language:C++Stargazers:27986Issues:0Issues:0

galactic

data cleaning and curation for unstructured text

Language:PythonLicense:Apache-2.0Stargazers:323Issues:0Issues:0

gptcore

Fast modular code to create and train cutting edge LLMs

Language:PythonLicense:Apache-2.0Stargazers:61Issues:0Issues:0

2024-Swerve-concept

Describing Swerve functionality, mockup math

Language:PythonStargazers:1Issues:0Issues:0

mamba.c

Inference of Mamba models in pure C

Language:CStargazers:175Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3936Issues:0Issues:0