Dicardo Xue (DicardoX)

DicardoX

Geek Repo

Company:Shanghai Jiao Tong University

Github PK Tool:Github PK Tool

Dicardo Xue's starred repositories

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:73959Issues:674Issues:3417

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Language:C++License:Apache-2.0Stargazers:41489Issues:1359Issues:11394

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32842Issues:476Issues:18313

chatgpt-web

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25699Issues:221Issues:4207

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18529Issues:168Issues:1313

awesome-free-chatgpt

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Language:PythonLicense:MITStargazers:17611Issues:138Issues:722

nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

carrot

Free ChatGPT Site List 这儿为你准备了众多免费好用的ChatGPT镜像站点

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:15360Issues:243Issues:2114

web-llm

High-performance In-browser LLM Inference Engine

Language:TypeScriptLicense:Apache-2.0Stargazers:12206Issues:113Issues:279

triton

Development repository for the Triton language and compiler

tutorials

PyTorch tutorials.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8078Issues:178Issues:784

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8047Issues:87Issues:1751

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

Language:C++License:Apache-2.0Stargazers:2543Issues:40Issues:318

clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Language:Jupyter NotebookStargazers:1554Issues:77Issues:176

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:798Issues:27Issues:209

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonLicense:Apache-2.0Stargazers:791Issues:11Issues:69

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:707Issues:14Issues:61

kernel_tuner

Kernel Tuner

Language:PythonLicense:Apache-2.0Stargazers:267Issues:9Issues:104

hivedscheduler

Kubernetes Scheduler for Deep Learning

Language:GoLicense:MITStargazers:252Issues:27Issues:9
Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:170Issues:6Issues:5

habitat

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Language:PythonLicense:Apache-2.0Stargazers:54Issues:5Issues:10

ElasticFlow

Artifacts for our ASPLOS'23 paper ElasticFlow

Language:PythonLicense:Apache-2.0Stargazers:52Issues:1Issues:3

HeliosData

Helios Traces from SenseTime

elasticflow-traces

Integrated Training Platform (ITP) traces used in ElasticFlow paper.

License:MITStargazers:28Issues:4Issues:0

ComScribe

ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.

Language:C++License:BSD-3-ClauseStargazers:25Issues:0Issues:1

HeliosArtifact

HeliosArtifact

Language:Jupyter NotebookLicense:MITStargazers:17Issues:0Issues:0

Information-Retrieval

Programming Assignments done using Python

Language:PythonStargazers:13Issues:2Issues:0

Liquid

Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters

Language:PythonLicense:Apache-2.0Stargazers:10Issues:3Issues:0