Zilin Zhu's repositories
ring-flash-attention
Ring attention implementation with flash attention
whisper-openvino
openvino version of openai/whisper
faster-nougat
Implementation of nougat that focuses on processing pdf locally.
pdf-with-its-own-md5
A PDF template that contains its own MD5!
chatgpt-desktop
Desktop version of ChatGPT, support manually set cookie
wandb-discord-bot
A discord bot for monitoring wandb project and runs.
electron-fc
A electron based famicom(NES) emulator
flash-attention
Fast and memory-efficient exact attention
scattermoe
Triton-based implementation of Sparse Mixture of Experts.
grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
Language:CudaApache-2.0000
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:PythonApache-2.0000
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Apache-2.0000