Beast code in Giters

zhaobinNF's starred repositories

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonApache-2.036853 429 1641

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.027240 185 4341

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

13470 185 21

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:Python9639 146 59

ChatLaw

ChatLaw：A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

AGPL-3.06684 37 74

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4399 49 285

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT3320 27 265

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonNOASSERTION2702 28 257

learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

Language:Shell1925 14 20

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.01887 19 77

llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

Language:HTML1588 6 4

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.01252 17 82

MOSS-RLHF

Language:PythonApache-2.01233 34 51

LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

Apache-2.01191 10 1

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

MIT1094 17 6

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Language:PythonMIT845 18 97

uniem

unified embedding model

Language:PythonApache-2.0793 7 101

RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

Language:PythonApache-2.0755 19 107

Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Language:PythonApache-2.0572 5 20

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonApache-2.0443 10 87

MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调，我们的眼光不止于医疗问答

Language:Python292 5 12

RLHF

Implementation of Chinese ChatGPT

Language:Python282 8 25

SynoCN

中文近义词表 Chinese Synonyms

240 7 6

open-chatgpt

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Language:PythonApache-2.0170 11 6

NTU-ReinforcementLearning-Notes

国立**大学李宏毅老师讲解的深度强化学习学习笔记

Language:Python117 30

alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Language:PythonMIT103 3 16

onnx-embedding

A repository for creating, and sample code for consuming an ONNX embedding model

Language:PythonApache-2.025 40

ChatGLM-Efficient-Tuning-Explained

Language:PythonApache-2.023 10

acl2024-dapr

Language:PythonApache-2.022 20

-DRL

李宏毅老师强化学习笔记

Language:Jupyter NotebookNOASSERTION5 10