Beast code in Giters

Genghan Zhang's repositories

CS224N-Spring2024-DFP-Student-Handout

Starter Code for Default Final Project, Spring 2024

Language:PythonApache-2.0100

segScan

Cooperative group on segScan

Language:CudaApache-2.01 10

auto-compile

Language:Python000

Byte-Flexgen

Language:PythonApache-2.0000

ByteEngine

An LLM engine based on ByteTransformer.

Language:C++Apache-2.0000

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++Apache-2.0000

BytetransformerX

Language:C++Apache-2.0000

ChatGLM-X

ChatGLM with xformers

Language:PythonApache-2.0000

compiler-and-arch

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

000

ctf

Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays

Language:C++NOASSERTION000

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION000

dejavu_profile

Profiling of Deja Vu kernels

Language:Python000

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0000

FlameGraph

Stack trace visualizer

Language:Perl000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause000

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaApache-2.0000

GLM-demo

Codebase for ChatGLM-6B demo.

Language:PythonMIT000

googletest

GoogleTest - Google Testing and Mocking Framework

BSD-3-Clause000

MyPicBed

This is my picbed

000

ppl-website

Language:HTMLNOASSERTION000

pyllama

LLaMA: Open and Efficient Foundation Language Models

GPL-3.0000

sam

Language:PythonMIT000

splatt

The Surprisingly ParalleL spArse Tensor Toolkit.

Language:CMIT000

stk

Apache-2.0000

taco

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

Language:C++NOASSERTION000

thuthesis

LaTeX Thesis Template for Tsinghua University

Language:TeXLPPL-1.3c000

tvm.tl

An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.

Language:PythonApache-2.0000

Welder

OSDI 2023 Welder, deeplearning compiler

000

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION000

zhang677.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptMIT000