Ti1bur's starred repositories

GoMate

GoMate:RAG Framework within Reliable input,Trusted output

Language:PythonStargazers:476Issues:0Issues:0

fastbm25

The fast python bm25 algorithm implemented with reverted index

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0

WanJuan1.0

万卷1.0多模态语料

License:CC-BY-4.0Stargazers:539Issues:0Issues:0

NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

Stargazers:2451Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Language:PythonStargazers:6812Issues:0Issues:0

chinese-chatbot-corpus

中文公开聊天语料库

Language:PythonLicense:Apache-2.0Stargazers:3985Issues:0Issues:0

insurance-clause-pdf-format

保险条款pdf数据结构化

Stargazers:11Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3277Issues:0Issues:0

rlt2t

Text to text with reinforcement learning

Language:PythonStargazers:30Issues:0Issues:0

MuCGEC

MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Language:PythonLicense:Apache-2.0Stargazers:499Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40521Issues:0Issues:0

machine-learning-interview

算法工程师-机器学习面试题总结

Stargazers:1330Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133598Issues:0Issues:0

WBDC_2022_RANK8

2022微信大数据挑战赛 第8名 方案

Language:PythonStargazers:73Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26727Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:55144Issues:0Issues:0