wangxin's repositories
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
cutlass
CUDA Templates for Linear Algebra Subroutines
EAGLE
Official Implementation of EAGLE
learn-cuda
A complete CUDA tutorial ranging from first GPU programs to advanced asynchronous methods
macOS-QQ-WeChat-API
用于 macOS 使用 QQ、微信获取用户好友、获取聊天记录、打开与指定好友的聊天窗口、对指定好友发送任意消息的 API 接口
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
so-vits-svc
SoftVC VITS Singing Voice Conversion
TigerBot
TigerBot: A multi-language multi-task LLM
TLLM_QMM
TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pytorch module. We modified the dequantation and weight preprocessing to align with popular quantization alogirthms such as AWQ and GPTQ, and combine them with new FP8 quantization.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
WeChatExtension-ForMac
Mac微信功能拓展/微信插件/微信小助手(A plugin for Mac WeChat)