Charles O. Goddard (cg123)

cg123

Geek Repo

Location:Los Angeles, CA

Home Page:goddard.blog

Github PK Tool:Github PK Tool

Charles O. Goddard's starred repositories

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:14147Issues:125Issues:583

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:12464Issues:87Issues:569

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11584Issues:98Issues:402

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4064Issues:47Issues:251

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3649Issues:36Issues:90

smoltcp

a smol tcp/ip stack

Language:RustLicense:0BSDStargazers:3641Issues:62Issues:345

functionary

Chat language model that can use tools and interpret the results

Language:PythonLicense:MITStargazers:1217Issues:18Issues:93

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:1128Issues:12Issues:311

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:944Issues:8Issues:9

godot_rl_agents

An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

Language:PythonLicense:MITStargazers:843Issues:25Issues:98

symbolica

A modern computer algebra system which aims to handle expressions with billions of terms.

Language:RustLicense:NOASSERTIONStargazers:407Issues:10Issues:5

pareas

GPU-accelerated compiler

PolyMind

A multimodal, function calling powered LLM webui.

Language:PythonLicense:AGPL-3.0Stargazers:199Issues:4Issues:7

tensorizer

Module, Model, and Tensor Serialization/Deserialization

Language:PythonLicense:MITStargazers:149Issues:23Issues:44

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonLicense:Apache-2.0Stargazers:147Issues:5Issues:11

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonLicense:Apache-2.0Stargazers:146Issues:6Issues:10

PruneMe

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

zett

Code for Zero-Shot Tokenizer Transfer

tiny-asic-1_58bit-matrix-mul

Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit

Language:VerilogLicense:Apache-2.0Stargazers:92Issues:11Issues:0

MergeMonster

An unsupervised model merging algorithm for Transformers-based language models.

Language:PythonLicense:Apache-2.0Stargazers:91Issues:0Issues:0

tangent_task_arithmetic

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Language:PythonLicense:MITStargazers:72Issues:1Issues:2

TerDiT

TerDiT: Ternary Diffusion Models with Transformers

Language:PythonLicense:MITStargazers:48Issues:1Issues:0

Gymbo

gradient-based symbolic execution engine implemented from scratch

Language:C++License:Apache-2.0Stargazers:35Issues:2Issues:0

tinynarrations

A synthetic story narration dataset to study small audio LMs.

Language:PythonLicense:NOASSERTIONStargazers:27Issues:1Issues:2
Language:PythonStargazers:26Issues:0Issues:0

mergeui

All-in-one UI for merged LLMs in Hugging Face

Language:PythonLicense:Apache-2.0Stargazers:20Issues:3Issues:0

training_free_model_merging

This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).

merging-text-transformers

Code for "Merging Text Transformers from Different Initializations"

Language:PythonLicense:MITStargazers:13Issues:0Issues:0
Language:PythonLicense:MITStargazers:12Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0