cg123

followers

following

stars

Los Angeles, CA

Charles O. Goddard's starred repositories

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT14147 125 583

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.012464 87 569

mamba

Mamba SSM architecture

Language:PythonApache-2.011584 98 402

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04064 47 251

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonApache-2.03649 36 90

smoltcp

a smol tcp/ip stack

Language:Rust0BSD3641 62 345

functionary

Chat language model that can use tools and interpret the results

Language:PythonMIT1217 18 93

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonApache-2.01128 12 311

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonApache-2.0944 8 9

godot_rl_agents

An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

Language:PythonMIT843 25 98

symbolica

A modern computer algebra system which aims to handle expressions with billions of terms.

Language:RustNOASSERTION407 10 5

pareas

GPU-accelerated compiler

Language:Futhark298 6 1

PolyMind

A multimodal, function calling powered LLM webui.

Language:PythonAGPL-3.0199 4 7

tensorizer

Module, Model, and Tensor Serialization/Deserialization

Language:PythonMIT149 23 44

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.0147 5 11

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonApache-2.0146 6 10

PruneMe

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Language:Python145 3 9

zett

Code for Zero-Shot Tokenizer Transfer

Language:Python101 2 9

tiny-asic-1_58bit-matrix-mul

Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit

Language:VerilogApache-2.092 110

MergeMonster

An unsupervised model merging algorithm for Transformers-based language models.

Language:PythonApache-2.09100

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonMIT72 1 2

TerDiT

TerDiT: Ternary Diffusion Models with Transformers

Language:PythonMIT48 10

Gymbo

gradient-based symbolic execution engine implemented from scratch

Language:C++Apache-2.035 20

tinynarrations

A synthetic story narration dataset to study small audio LMs.

Language:PythonNOASSERTION27 1 2

zaraki-tools

Language:Python2600

mergeui

All-in-one UI for merged LLMs in Hugging Face

Language:PythonApache-2.020 30

training_free_model_merging

This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).

Language:Python17 2 1

merging-text-transformers

Code for "Merging Text Transformers from Different Initializations"

Language:PythonMIT1300

Task-Vector-Merge-Optimzier

Language:PythonMIT1200

YuisekinAI-mergekit

Language:Python400