zhaobinNF

zhaobinNF

Geek Repo

Company:Fudan University

Location:Shanghai

Github PK Tool:Github PK Tool

zhaobinNF's starred repositories

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36853Issues:429Issues:1641

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27240Issues:185Issues:4341

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

ChatLaw

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4399Issues:49Issues:285

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3320Issues:27Issues:265

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2702Issues:28Issues:257

learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1887Issues:19Issues:77

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1252Issues:17Issues:82

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1233Issues:34Issues:51

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Language:PythonLicense:MITStargazers:845Issues:18Issues:97

uniem

unified embedding model

Language:PythonLicense:Apache-2.0Stargazers:793Issues:7Issues:101

RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

Language:PythonLicense:Apache-2.0Stargazers:755Issues:19Issues:107

Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Language:PythonLicense:Apache-2.0Stargazers:572Issues:5Issues:20

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonLicense:Apache-2.0Stargazers:443Issues:10Issues:87

MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答

RLHF

Implementation of Chinese ChatGPT

SynoCN

中文近义词表 Chinese Synonyms

open-chatgpt

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Language:PythonLicense:Apache-2.0Stargazers:170Issues:11Issues:6

NTU-ReinforcementLearning-Notes

国立**大学李宏毅老师讲解的深度强化学习学习笔记

Language:PythonStargazers:117Issues:3Issues:0

alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Language:PythonLicense:MITStargazers:103Issues:3Issues:16

onnx-embedding

A repository for creating, and sample code for consuming an ONNX embedding model

Language:PythonLicense:Apache-2.0Stargazers:25Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:23Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:22Issues:2Issues:0

-DRL

李宏毅老师强化学习笔记

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5Issues:1Issues:0