xueyuuu

xueyuuu

Geek Repo

Company:Dept. of Computer Science. The University of Hong Kong

Location:Hong Kong

Home Page:https://www.researchgate.net/profile/Xueyu-Wu-2

Github PK Tool:Github PK Tool

xueyuuu's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54423Issues:514Issues:936

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38394Issues:384Issues:1619

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33990Issues:341Issues:2655

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33492Issues:748Issues:1226

pandora

潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.

Language:PythonLicense:GPL-2.0Stargazers:20735Issues:129Issues:1138

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12578Issues:117Issues:914

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11874Issues:97Issues:435

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:10326Issues:157Issues:3583

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9089Issues:109Issues:81

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:8205Issues:102Issues:1164

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7505Issues:112Issues:288
Language:PythonLicense:Apache-2.0Stargazers:7029Issues:66Issues:68

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++License:Apache-2.0Stargazers:5810Issues:145Issues:962

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:4174Issues:149Issues:255

AI-System

System for AI Education Resource.

Language:PythonLicense:CC-BY-4.0Stargazers:3227Issues:68Issues:48

DrakeTyporaTheme

十二种主题风格 - Material Google JetBrains Vue Juejin Purple Ayu Dark

Language:CSSLicense:MITStargazers:2782Issues:21Issues:154

line_profiler

Line-by-line profiling for Python

Language:PythonLicense:NOASSERTIONStargazers:2593Issues:15Issues:96

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2044Issues:34Issues:79

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1488Issues:13Issues:113

CodeClass

在线阅读更方便:https://coderlemon.com/ 计算机编程学习路线 + 学习资源,最全面的编程学习课堂,从小白到架构师,关于编程所有你需要掌握的内容都在这里,持续更新中!

libai

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Language:PythonLicense:Apache-2.0Stargazers:381Issues:42Issues:79

torch-quiver

PyTorch Library for Low-Latency, High-Throughput Graph Learning on GPUs.

Language:PythonLicense:Apache-2.0Stargazers:285Issues:12Issues:60

COSMA

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

Language:C++License:BSD-3-ClauseStargazers:185Issues:21Issues:67

PoSE

Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)

Language:PythonLicense:MITStargazers:182Issues:5Issues:16
Language:PythonLicense:Apache-2.0Stargazers:44Issues:6Issues:2

PyG-OGB-Tricks

Bags of Tricks in OGB (node classification) with GCNs.

Language:PythonLicense:MITStargazers:35Issues:1Issues:5

PipeGCN

[ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin

Language:PythonLicense:MITStargazers:27Issues:3Issues:1

PyTorch-ParameterServer

An implementation of parameter server framework in PyTorch RPC.