Woosuk Kwon (WoosukKwon)

WoosukKwon

Geek Repo

Company:University of California, Berkeley

Location:Berkeley, CA

Home Page:https://woosuk.me

Twitter:@woosuk_k

Github PK Tool:Github PK Tool

Woosuk Kwon's repositories

retraining-free-pruning

[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

torch-xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0