yuleiqin's starred repositories
google-research
Google Research
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
lm-evaluation-harness
A framework for few-shot evaluation of language models.
lemon-cleaner
腾讯柠檬清理是针对macOS系统专属制定的清理工具。主要功能包括重复文件和相似照片的识别、软件的定制化垃圾扫描、可视化的全盘空间分析、内存释放、浏览器隐私清理以及设备实时状态的监控等。重点聚焦清理功能,对上百款软件提供定制化的清理方案,提供专业的清理建议,帮助用户轻松完成一键式清理。
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
NLPDataSet
记录本人整理的一些数据集
CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Contrastive-Learning-NLP-Papers
Paper List for Contrastive Learning for Natural Language Processing
llama-lora-fine-tuning
llama fine-tuning with lora
CodeLLaMA-chat
CodeLLaMA 中文版 - 代码生成助手,huggingface累积下载2w+次
acl2020-commonsense
Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.