whutbd's starred repositories

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9730Issues:0Issues:0

learntorch

Tutorials for PyTorch C++

Language:MakefileLicense:MITStargazers:40Issues:0Issues:0

Leetcode

Play Leetcode with different programming language

Language:C++Stargazers:1459Issues:0Issues:0

cuda_code

easy cuda code

Language:CudaStargazers:3Issues:0Issues:0

kimi-free-api

🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。

Language:TypeScriptLicense:GPL-3.0Stargazers:3427Issues:0Issues:0

ngram

The n-gram Language Model

Language:CStargazers:1138Issues:0Issues:0

llama2.cpp

Inference Llama 2 in one file of pure C++

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Language:C++License:Apache-2.0Stargazers:2831Issues:0Issues:0
Language:C++Stargazers:943Issues:0Issues:0
Language:C++Stargazers:60Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:608Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23166Issues:0Issues:0

riven

CPU Memory Compiler and Parallel programing

Language:C++Stargazers:21Issues:0Issues:0

MegRay

A communication library for deep learning

Language:C++License:NOASSERTIONStargazers:50Issues:0Issues:0
Language:CudaStargazers:94Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40074Issues:0Issues:0

InferLLM

a lightweight LLM model inference framework

Language:C++License:Apache-2.0Stargazers:660Issues:0Issues:0

tensorrt-cpp-api

TensorRT C++ API Tutorial

Language:C++License:MITStargazers:544Issues:0Issues:0

Play-Leetcode

My Solutions to Leetcode problems. All solutions support C++ language, some support Java and Python. Multiple solutions will be given by most problems. Enjoy:) 我的Leetcode解答。所有的问题都支持C++语言,一部分问题支持Java语言。近乎所有问题都会提供多个算法解决。大家加油!:)

Language:C++Stargazers:2709Issues:0Issues:0

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Language:CudaLicense:Apache-2.0Stargazers:501Issues:0Issues:0

KuiperLLama

动手实现大模型推理框架

Language:C++Stargazers:83Issues:0Issues:0

cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Language:C++License:MITStargazers:376Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:2565Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:6029Issues:0Issues:0

stable-ts

Timestamping Spoken Words

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

llumnix

Efficient and easy multi-instance LLM serving

Language:PythonLicense:Apache-2.0Stargazers:55Issues:0Issues:0

knowhere

Knowhere is an open-source vector search engine, integrating FAISS, HNSW, etc.

Language:C++License:Apache-2.0Stargazers:157Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15111Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9856Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3001Issues:0Issues:0