zhuzilin

Zilin Zhu's repositories

ring-flash-attention

Ring attention implementation with flash attention

Language:PythonMIT621 12 38

whisper-openvino

openvino version of openai/whisper

Language:Jupyter NotebookMIT164 60

faster-nougat

Implementation of nougat that focuses on processing pdf locally.

Language:Python75 4 1

pdf-with-its-own-md5

A PDF template that contains its own MD5!

Language:TeX39 30

es

A JavaScript interpreter from scratch, supporting ES5 syntax.

Language:C++AGPL-3.027 5 2

chatgpt-desktop

Desktop version of ChatGPT, support manually set cookie

Language:JavaScriptMIT17 2 1

vllm-group

Language:PythonMIT10 10

aqt-pytorch

Language:Python7 20

wandb-discord-bot

A discord bot for monitoring wandb project and runs.

Language:JavaScript6 2 1

blog

my blog~

Language:JavaScriptMIT2 30

llama

Inference code for LLaMA models

Language:PythonGPL-3.02 10

torchrec_mapper

Language:C++2 30

zhuzilin

2 30

electron-fc

A electron based famicom(NES) emulator

Language:JavaScriptMIT1 20

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1 10

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.0100

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.0100

triton

Development repository for the Triton language and compiler

Language:C++MIT1 10

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT100

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0100

base64-img

Language:Python020

bun

Incredibly fast JavaScript runtime, bundler, transpiler and package manager – all in one.

Language:Zig010

FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Language:C++NOASSERTION010

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0020

grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Language:CudaApache-2.0000

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonApache-2.0010

megablocks

Language:PythonApache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++NOASSERTION020

torchrec

Pytorch domain library for recommendation systems

Language:PythonBSD-3-Clause010

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000