Genghan Zhang (zhang677)

zhang677

Geek Repo

Company:Stanford University

Github PK Tool:Github PK Tool

Genghan Zhang's repositories

CS224N-Spring2024-DFP-Student-Handout

Starter Code for Default Final Project, Spring 2024

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

segScan

Cooperative group on segScan

Language:CudaLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ByteEngine

An LLM engine based on ByteTransformer.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-X

ChatGLM with xformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

compiler-and-arch

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

Stargazers:0Issues:0Issues:0

ctf

Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

dejavu_profile

Profiling of Deja Vu kernels

Language:PythonStargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

FlameGraph

Stack trace visualizer

Language:PerlStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GLM-demo

Codebase for ChatGLM-6B demo.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

googletest

GoogleTest - Google Testing and Mocking Framework

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MyPicBed

This is my picbed

Stargazers:0Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pyllama

LLaMA: Open and Efficient Foundation Language Models

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

splatt

The Surprisingly ParalleL spArse Tensor Toolkit.

Language:CLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

taco

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

thuthesis

LaTeX Thesis Template for Tsinghua University

Language:TeXLicense:LPPL-1.3cStargazers:0Issues:0Issues:0

tvm.tl

An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Welder

OSDI 2023 Welder, deeplearning compiler

Stargazers:0Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

zhang677.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0