Rayking1433465

Rayking1433465

Geek Repo

0

followers

0

following

Company:******

Location:Shanghai China

Github PK Tool:Github PK Tool

Rayking1433465's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38580Issues:384Issues:1648

easy-rl

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9009Issues:78Issues:143

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4383Issues:30Issues:147

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3694Issues:33Issues:501

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3419Issues:32Issues:201

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3170Issues:37Issues:387

Chinese-LangChain

中文langchain项目|小必应,Q.Talk,强聊,QiangTalk

Chinese-Llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

Language:PythonLicense:Apache-2.0Stargazers:2219Issues:21Issues:39

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:1831Issues:17Issues:109

Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1765Issues:16Issues:60

nlg-eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Language:PythonLicense:NOASSERTIONStargazers:1337Issues:29Issues:107

uniem

unified embedding model

Language:PythonLicense:Apache-2.0Stargazers:813Issues:7Issues:104

Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Chinese-Text-Classification-PyTorch

中文文本分类任务,基于PyTorch实现(TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention, DPCNN, Transformer,Bert,ERNIE),开箱即用!

dialogbot

dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:323Issues:6Issues:7

MedQA-ChatGLM

🛰️ 基于真实医疗对话数据在ChatGLM上进行LoRA、P-Tuning V2、Freeze、RLHF等微调,我们的眼光不止于医疗问答

quick_sentence_transformers

sentence-transformers to onnx 让sbert模型推理效率更快

Language:PythonLicense:MITStargazers:156Issues:3Issues:7

chatglm2_finetuning

chatglm2 6b finetuning and alpaca finetuning

Language:PythonLicense:Apache-2.0Stargazers:145Issues:3Issues:30

ml-qrecc

Open-Domain Question Answering Goes Conversational via Question Rewriting

Language:PythonLicense:Apache-2.0Stargazers:137Issues:8Issues:12

LLMforDialogDataGenerate

Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集

tudouNLP

基于bert的中文自然语言处理工具,包括情感分析、中文分词、词性标注、以及命名实体识别功能,并提供文本分类任务、序列标注任务、句对关系判断任务的训练与预测接口

Language:PythonLicense:MITStargazers:128Issues:3Issues:3

ChatGPTX-Uni

实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最终在小模型的基座上发生“智能涌现”,力图最小计算代价达成ChatGPT、GPT4、ChatRWKV等人类友好亲和效果。当前可以满足总结、提问、问答、摘要、改写、评论、扮演等各种需求。

Language:PythonLicense:GPL-3.0Stargazers:118Issues:6Issues:8

Baichuan-13B-Finetuning

Baichuan-13B 指令微调

baichuan_sft_lora

baichuan LLM surpervised finetune by lora

wiki-word2vec

基于word2vec使用wiki中文语料库实现词向量训练模型

Language:PythonStargazers:55Issues:0Issues:0

baichuan-Qlora-Tuning

基于qlora对baichuan-7B大模型进行指令微调。