Peyton (pprp)

pprp

Geek Repo

Company:Data Science and Analytic Thrust, Information Hub, HKUST(GZ)

Location:GuangZhou

Home Page:https://www.zhihu.com/people/peijieDong

Github PK Tool:Github PK Tool

Peyton's repositories

SimpleCVPaperReading

:smile:博客论文列表:分系列整理

Language:JavaScriptStargazers:384Issues:3Issues:0

Awesome-LLM-Prune

Awesome list for LLM pruning.

PicoNAS

Modularized NAS Framework

Language:PythonLicense:GPL-3.0Stargazers:7Issues:1Issues:7

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

License:MITStargazers:2Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:1Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

Stargazers:0Issues:0Issues:0

BitDistiller

A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

License:Apache-2.0Stargazers:0Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

License:Apache-2.0Stargazers:0Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

License:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Stargazers:0Issues:0Issues:0

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

License:Apache-2.0Stargazers:0Issues:0Issues:0

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Stargazers:0Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Stargazers:0Issues:0Issues:0

llama.cpp

LLM inference in C/C++

License:MITStargazers:0Issues:0Issues:0

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llm-kick

[ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.

Stargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

pprp.github.io

Personal Academic Page for pprp

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

License:MITStargazers:0Issues:0Issues:0

qllm-eval

Code Repository of Evaluating Quantized Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

quanto

A pytorch Quantization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

License:Apache-2.0Stargazers:0Issues:0Issues:0