Zilin Zhu's repositories
ring-flash-attention
Ring attention implementation with flash attention
whisper-openvino
openvino version of openai/whisper
pdf-with-its-own-md5
A PDF template that contains its own MD5!
chatgpt-desktop
Desktop version of ChatGPT, support manually set cookie
pytorch-malloc
An external memory allocator example for PyTorch.
wandb-discord-bot
A discord bot for monitoring wandb project and runs.
electron-fc
A electron based famicom(NES) emulator
DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
DeepSpeedExamples
Example models using DeepSpeed
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
flash-attention
Fast and memory-efficient exact attention
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
qiskit-translations
Home of Qiskit documentation translations
triton
Development repository for the Triton language and compiler