Zeyu Li (galeselee)

galeselee

Geek Repo

Company:Guang zhou, China

Location:Guang Zhou

Home Page:zeyuli.cn

Github PK Tool:Github PK Tool


Organizations
pkusc

Zeyu Li's repositories

Language:CSSLicense:GPL-2.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Awesome_LLM_System-PaperList

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!

Stargazers:157Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DVFS_PaperList

Energy is a very noticable topic. Dynaimc Voltage and Frequency Scaling is a technique for CPU and GPU power consumption. Here is a paperlist of DVFS and power consumption.

Stargazers:1Issues:0Issues:0

llama-models

Utilities intended for use with Llama models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

galeselee.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tinyllm

FlexLLM is a flexsible and tiny LLM Serving framework. And it is a personal customization from lightllm

Language:PythonStargazers:0Issues:0Issues:0

VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

sarathi-serve

A low-latency & high-throughput serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

License:MITStargazers:0Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

License:NOASSERTIONStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

perf-book

The book "Performance Analysis and Tuning on Modern CPU"

License:CC0-1.0Stargazers:0Issues:0Issues:0

galeselee

The description card

Stargazers:0Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

License:MITStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

License:NOASSERTIONStargazers:0Issues:0Issues:0

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

License:Apache-2.0Stargazers:0Issues:0Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

HOSCF

HOSCF: EFFICIENT DECOUPLING ALGORITHMS FOR FINDING THE1 BEST RANK-ONE APPROXIMATION OF HIGHER-ORDER TENSORS

Language:C++Stargazers:2Issues:0Issues:0

CutlassHelloWorld

This is a repo for Cutlass learning.

Stargazers:1Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

License:MITStargazers:0Issues:0Issues:0