xueyuuu

xueyuuu

Geek Repo

Company:Dept. of Computer Science. The University of Hong Kong

Location:Hong Kong

Home Page:https://www.researchgate.net/profile/Xueyu-Wu-2

Github PK Tool:Github PK Tool

xueyuuu's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53976Issues:514Issues:925

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38238Issues:381Issues:1592

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33473Issues:340Issues:2616

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33269Issues:749Issues:1199

pandora

潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.

Language:PythonLicense:GPL-2.0Stargazers:20758Issues:129Issues:1138

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11652Issues:104Issues:835

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11247Issues:99Issues:370

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:10062Issues:156Issues:3518

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9061Issues:109Issues:80

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:8136Issues:102Issues:1159

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7396Issues:112Issues:287
Language:PythonLicense:Apache-2.0Stargazers:6969Issues:67Issues:65

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++License:Apache-2.0Stargazers:5779Issues:146Issues:960

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:4113Issues:149Issues:252

AI-System

System for AI Education Resource.

Language:PythonLicense:CC-BY-4.0Stargazers:3104Issues:67Issues:48

DrakeTyporaTheme

十二种主题风格 - Material Google JetBrains Vue Juejin Purple Ayu Dark

Language:CSSLicense:MITStargazers:2712Issues:21Issues:153

line_profiler

Line-by-line profiling for Python

Language:PythonLicense:NOASSERTIONStargazers:2549Issues:15Issues:94

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1966Issues:34Issues:76

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1448Issues:12Issues:113

CodeClass

在线阅读更方便:https://coderlemon.com/ 计算机编程学习路线 + 学习资源,最全面的编程学习课堂,从小白到架构师,关于编程所有你需要掌握的内容都在这里,持续更新中!

libai

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Language:PythonLicense:Apache-2.0Stargazers:377Issues:43Issues:79

torch-quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.

Language:PythonLicense:Apache-2.0Stargazers:283Issues:12Issues:60

COSMA

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

Language:C++License:BSD-3-ClauseStargazers:183Issues:21Issues:67

PoSE

Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)

Language:PythonLicense:MITStargazers:174Issues:5Issues:14
Language:PythonLicense:Apache-2.0Stargazers:42Issues:6Issues:2

PyG-OGB-Tricks

Bags of Tricks in OGB (node classification) with GCNs.

Language:PythonLicense:MITStargazers:35Issues:1Issues:5

PipeGCN

[ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin

Language:PythonLicense:MITStargazers:26Issues:3Issues:1

PyTorch-ParameterServer

An implementation of parameter server framework in PyTorch RPC.