Orion-Zheng

followers

following

stars

Singapore

zheng-zian-andy.com

@zian_andy_zheng

Zian(Andy) Zheng's starred repositories

Vript

Language:PythonNOASSERTION7700

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0415300

Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT757100

llamafia.github

Language:PythonApache-2.029400

MixEval

The official evaluation suite and dynamic data release for MixEval.

Language:Python15500

LLMPruner

Language:Python27500

ChatTTS

A generative speech model for daily dialogue.

Language:PythonNOASSERTION2640400

pytorch-learning

learning notes when learning the source code of pytorch

2300

multi_gpu_training

Language:Python22200

web

React web interface for the OpenDota platform

Language:JavaScriptMIT107400

dota2py

Python tools for Dota 2

Language:Protocol BufferMIT11500

compendium

Dota 2 replay knowledge in book form.

2300

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION170900

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

MIT27100

dota2-clarity

Custom console scripts for Dota 2.

Language:Python8800

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.01986100

DeepLearningSystem

AI Infra主要是指AI的基础建设，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。

9700

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonApache-2.093700

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookMIT101900

llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

Language:Python2400

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0184100

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonNOASSERTION247300

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:Python78200

G_VBSM_Dataset_Condensation

[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)

Language:Python1700

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT864500

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3430500

dora

Implementation of DoRA

Language:PythonMIT27100

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookMIT262000

onboarding

Onboarding guide to Jimmy Lin's research group at the University of Waterloo

2200