Dicardo Xue (DicardoX)

DicardoX

Geek Repo

Company:Shanghai Jiao Tong University

Github PK Tool:Github PK Tool

Dicardo Xue's starred repositories

Oobleck

A resilient distributed training framework

Language:PythonLicense:Apache-2.0Stargazers:75Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:75137Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25699Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8047Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:12412Issues:0Issues:0

carrot

Free ChatGPT Site List 这儿为你准备了众多免费好用的ChatGPT镜像站点

Stargazers:16855Issues:0Issues:0

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:15360Issues:0Issues:0

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonLicense:Apache-2.0Stargazers:791Issues:0Issues:0

habitat

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

Language:C++License:Apache-2.0Stargazers:2543Issues:0Issues:0

web-llm

High-performance In-browser LLM Inference Engine

Language:TypeScriptLicense:Apache-2.0Stargazers:12208Issues:0Issues:0

ComScribe

ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.

Language:C++License:BSD-3-ClauseStargazers:25Issues:0Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:798Issues:0Issues:0

kernel_tuner

Kernel Tuner

Language:PythonLicense:Apache-2.0Stargazers:267Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18529Issues:0Issues:0

chatgpt-web

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Language:VueLicense:MITStargazers:31195Issues:0Issues:0

awesome-free-chatgpt

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Language:PythonLicense:MITStargazers:17611Issues:0Issues:0

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Language:C++License:Apache-2.0Stargazers:41490Issues:0Issues:0

elasticflow-traces

Integrated Training Platform (ITP) traces used in ElasticFlow paper.

License:MITStargazers:28Issues:0Issues:0
Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:170Issues:0Issues:0

Information-Retrieval

Programming Assignments done using Python

Language:PythonStargazers:13Issues:0Issues:0

nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

License:Apache-2.0Stargazers:17183Issues:0Issues:0

ElasticFlow

Artifacts for our ASPLOS'23 paper ElasticFlow

Language:PythonLicense:Apache-2.0Stargazers:52Issues:0Issues:0

tutorials

PyTorch tutorials.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8078Issues:0Issues:0

HeliosArtifact

HeliosArtifact

Language:Jupyter NotebookLicense:MITStargazers:17Issues:0Issues:0

HeliosData

Helios Traces from SenseTime

License:CC-BY-4.0Stargazers:46Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32841Issues:0Issues:0

clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Language:Jupyter NotebookStargazers:1554Issues:0Issues:0

hivedscheduler

Kubernetes Scheduler for Deep Learning

Language:GoLicense:MITStargazers:252Issues:0Issues:0

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:707Issues:0Issues:0