Bingchen Zhao (DTennant)

DTennant

Geek Repo

Location:Shanghai

Home Page:bzhao.me

Github PK Tool:Github PK Tool

Bingchen Zhao's starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:26357Issues:202Issues:188

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23487Issues:160Issues:3670

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20123Issues:199Issues:108

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:17217Issues:204Issues:39

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:12693Issues:108Issues:194

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6281Issues:56Issues:19

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:3070Issues:23Issues:29

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3033Issues:25Issues:120

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:2783Issues:27Issues:854

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

auto-code-rover

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 15.95% tasks in full SWE-bench

Language:PythonLicense:NOASSERTIONStargazers:2232Issues:23Issues:28

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1179Issues:27Issues:86

OSWorld

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language:PythonLicense:Apache-2.0Stargazers:988Issues:24Issues:15

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:848Issues:10Issues:26

agent-protocol

Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.

Language:PythonLicense:MITStargazers:811Issues:12Issues:39

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:783Issues:18Issues:60

OpenELM

Evolution Through Large Models

Language:PythonLicense:MITStargazers:652Issues:25Issues:11

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonLicense:NOASSERTIONStargazers:612Issues:13Issues:38

searchformer

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:262Issues:3Issues:0

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:157Issues:7Issues:28
Language:PythonLicense:Apache-2.0Stargazers:155Issues:4Issues:6

frequency_determines_performance

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance"

Language:Jupyter NotebookLicense:MITStargazers:52Issues:0Issues:0

llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Language:PythonLicense:Apache-2.0Stargazers:32Issues:3Issues:1

SEED

ICLR2024 paper on Continual Learning

Language:PythonLicense:MITStargazers:25Issues:2Issues:4

FairCLIP

[CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning

Language:Jupyter NotebookLicense:MITStargazers:23Issues:1Issues:1

CMS

[CVPR'24] Official PyTorch implementation of Contrastive Mean-Shift Learning for Generalized Category Discovery

SPTNet

The official repository for ICLR2024 paper "SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning"

Language:PythonLicense:NOASSERTIONStargazers:16Issues:0Issues:0
Language:PythonLicense:MITStargazers:15Issues:0Issues:0
Language:TeXStargazers:3Issues:0Issues:0