Sm1les

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05968 68 269

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language:PythonAGPL-3.05178 60 2013

WeChatFerry

微信机器人底层框架，可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.

Language:C++MIT3876 55 165

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.03244 38 392

KuiperInfer

校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

Language:C++MIT2435 25 27

JittorLLMs

计图大模型推理库，具有高性能、配置要求低、中文支持好、可移植等特点

Language:PythonApache-2.02363 28 181

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookApache-2.02229 32 105

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.02055 19 81

PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Language:PythonApache-2.01911 16 17

WebCPM

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

Language:HTMLApache-2.0976 24 26

RecurrentGPT

Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

Language:PythonGPL-3.0958 12 24

lilac

Curate better data for LLMs

Language:PythonApache-2.0943 13 292

MAP-NEO

Language:Python846 10 34

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonMIT620 11 15

resume

My resume in LaTeX (template suited for new graduates; 应届生简历模板)

Language:TeX579 7 20

Tabular-LLM

本项目旨在收集开源的表格智能任务数据集（比如表格问答、表格-文本生成等），将原始数据整理为指令微调格式的数据并微调LLM，进而增强LLM对于表格数据的理解，最终构建出专门面向表格智能任务的大型语言模型。

424 2 7

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.0407 10 69

ChatPLUG

A Chinese Open-Domain Dialogue System

Language:PythonApache-2.0310 10 15

rwkv-cpp-accelerated

A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies

Language:C++MIT306 10 21

simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

Language:PythonMIT203 10 4

chatglm2_finetuning

chatglm2 6b finetuning and alpaca finetuning

Language:PythonApache-2.0144 3 30

tensorlink

Unlock Unlimited Potential! Share Your GPU Power Across Your Local Network!

Language:GoGPL-3.035 4 3

rmib

The official Implementation code for RMIB: Representation Matching Information Bottleneck for Matching Text Representations (ICML2024)

Language:PythonApache-2.05 10