DicardoX

Dicardo Xue's starred repositories

FlexTensor

Automatic Schedule Exploration and Optimization Framework for Tensor Computations

Language:PythonMIT17400

MAGIS

MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)

Language:PythonMIT3400

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Language:PythonBSD-3-Clause248300

brainstorm

Compiler for Dynamic Neural Networks

Language:Python4200

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0177000

nnscaler

MIT3200

FTPipe

FTPipe and related pipeline model parallelism research.

Language:Python4100

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonNOASSERTION312900

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause386700

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT452500

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION828300

xv6-labs-2022-solutions

MIT 6.828 (6.S081) (6.1810) xv6-labs-2022 实验的答案和解析

Language:C10100

corenet

CoreNet: A library for training deep neural networks

Language:PythonNOASSERTION690400

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonApache-2.044300