:)s (Sm1les)

Sm1les

Geek Repo

Location:Beijing

Home Page:sm1les.com

Github PK Tool:Github PK Tool

:)s's starred repositories

llama.cpp

LLM inference in C/C++

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40465Issues:394Issues:1293

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35656Issues:997Issues:188

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31818Issues:203Issues:4911

llama2.c

Inference Llama 2 in one file of pure C

pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Language:PythonLicense:NOASSERTIONStargazers:8118Issues:148Issues:1144

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:6016Issues:74Issues:534

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5968Issues:68Issues:269

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Language:PythonLicense:AGPL-3.0Stargazers:5178Issues:60Issues:2013

WeChatFerry

微信机器人底层框架,可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.

Language:C++License:MITStargazers:3876Issues:55Issues:165

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3244Issues:38Issues:392

KuiperInfer

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

Language:C++License:MITStargazers:2435Issues:25Issues:27

JittorLLMs

计图大模型推理库,具有高性能、配置要求低、中文支持好、可移植等特点

Language:PythonLicense:Apache-2.0Stargazers:2363Issues:28Issues:181

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2229Issues:32Issues:105

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2055Issues:19Issues:81

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:1911Issues:16Issues:17

WebCPM

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

Language:HTMLLicense:Apache-2.0Stargazers:976Issues:24Issues:26

RecurrentGPT

Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text

Language:PythonLicense:GPL-3.0Stargazers:958Issues:12Issues:24

lilac

Curate better data for LLMs

Language:PythonLicense:Apache-2.0Stargazers:943Issues:13Issues:292

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:620Issues:11Issues:15

resume

My resume in LaTeX (template suited for new graduates; 应届生简历模板)

Tabular-LLM

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonLicense:Apache-2.0Stargazers:407Issues:10Issues:69

ChatPLUG

A Chinese Open-Domain Dialogue System

Language:PythonLicense:Apache-2.0Stargazers:310Issues:10Issues:15

rwkv-cpp-accelerated

A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependencies

Language:C++License:MITStargazers:306Issues:10Issues:21

simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

Language:PythonLicense:MITStargazers:203Issues:10Issues:4

chatglm2_finetuning

chatglm2 6b finetuning and alpaca finetuning

Language:PythonLicense:Apache-2.0Stargazers:144Issues:3Issues:30

tensorlink

Unlock Unlimited Potential! Share Your GPU Power Across Your Local Network!

Language:GoLicense:GPL-3.0Stargazers:35Issues:4Issues:3

rmib

The official Implementation code for RMIB: Representation Matching Information Bottleneck for Matching Text Representations (ICML2024)

Language:PythonLicense:Apache-2.0Stargazers:5Issues:1Issues:0