nyunAI

nyunAI

Geek Repo

Github PK Tool:Github PK Tool

nyunAI's repositories

Language:PythonLicense:AGPL-3.0Stargazers:60Issues:0Issues:0
Language:PythonStargazers:50Issues:3Issues:0
Language:PythonStargazers:4Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

License:Apache-2.0Stargazers:0Issues:0Issues:0

FLAP

Patch for Grouped Query Attention

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

nyuntam-docs

This is the official documentation for nyuntam

Language:PythonStargazers:0Issues:0Issues:0

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0