DTennant

Bingchen Zhao's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT26357 202 188

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.023487 160 3670

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT20123 199 108

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION17217 204 39

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT12693 108 194

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog6281 56 19

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonMIT3070 23 29

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03033 25 120

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.02783 27 854

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT2371 19 51

auto-code-rover

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 15.95% tasks in full SWE-bench

Language:PythonNOASSERTION2232 23 28

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1179 27 86

OSWorld

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language:PythonApache-2.0988 24 15

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Language:Python983 37 20

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0848 10 26

agent-protocol

Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.

Language:PythonMIT811 12 39

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0783 18 60

OpenELM

Evolution Through Large Models

Language:PythonMIT652 25 11

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonNOASSERTION612 13 38

searchformer

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Language:Jupyter NotebookNOASSERTION262 30

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonMIT157 7 28

BAdam

Language:PythonApache-2.0155 4 6

frequency_determines_performance

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance"

Language:Jupyter NotebookMIT5200

llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Language:PythonApache-2.032 3 1

SEED

ICLR2024 paper on Continual Learning

Language:PythonMIT25 2 4

FairCLIP

[CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning

Language:Jupyter NotebookMIT23 1 1

CMS

[CVPR'24] Official PyTorch implementation of Contrastive Mean-Shift Learning for Generalized Category Discovery

Language:Python23 2 2

SPTNet

The official repository for ICLR2024 paper "SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning"

Language:PythonNOASSERTION1600

data4robotics

Language:PythonMIT1500

Unicorn-Test

Language:TeX300