Zian(Andy) Zheng (Orion-Zheng)

Orion-Zheng

Geek Repo

Location:Singapore

Home Page:zheng-zian-andy.com

Twitter:@zian_andy_zheng

Github PK Tool:Github PK Tool

Zian(Andy) Zheng's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:77Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4153Issues:0Issues:0

Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Stargazers:112Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7571Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:294Issues:0Issues:0

MixEval

The official evaluation suite and dynamic data release for MixEval.

Language:PythonStargazers:155Issues:0Issues:0
Language:PythonStargazers:275Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:26404Issues:0Issues:0

pytorch-learning

learning notes when learning the source code of pytorch

Stargazers:23Issues:0Issues:0
Language:PythonStargazers:222Issues:0Issues:0

web

React web interface for the OpenDota platform

Language:JavaScriptLicense:MITStargazers:1074Issues:0Issues:0

dota2py

Python tools for Dota 2

Language:Protocol BufferLicense:MITStargazers:115Issues:0Issues:0

compendium

Dota 2 replay knowledge in book form.

Stargazers:23Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1709Issues:0Issues:0

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

License:MITStargazers:271Issues:0Issues:0

dota2-clarity

Custom console scripts for Dota 2.

Language:PythonStargazers:88Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:19861Issues:0Issues:0

DeepLearningSystem

AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。

Stargazers:97Issues:0Issues:0

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:937Issues:0Issues:0

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookLicense:MITStargazers:1019Issues:0Issues:0

llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

Language:PythonStargazers:24Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:1841Issues:0Issues:0

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2473Issues:0Issues:0

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:PythonStargazers:782Issues:0Issues:0

G_VBSM_Dataset_Condensation

[CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)

Language:PythonStargazers:17Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8645Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:34305Issues:0Issues:0

dora

Implementation of DoRA

Language:PythonLicense:MITStargazers:271Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:Jupyter NotebookLicense:MITStargazers:2620Issues:0Issues:0

onboarding

Onboarding guide to Jimmy Lin's research group at the University of Waterloo

Stargazers:22Issues:0Issues:0