yuleiqin's starred repositories

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33499Issues:748Issues:1218

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:13164Issues:99Issues:758

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13107Issues:90Issues:611

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6190Issues:38Issues:891

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5953Issues:36Issues:955

lemon-cleaner

腾讯柠檬清理是针对macOS系统专属制定的清理工具。主要功能包括重复文件和相似照片的识别、软件的定制化垃圾扫描、可视化的全盘空间分析、内存释放、浏览器隐私清理以及设备实时状态的监控等。重点聚焦清理功能,对上百款软件提供定制化的清理方案,提供专业的清理建议,帮助用户轻松完成一键式清理。

Language:Objective-CLicense:NOASSERTIONStargazers:5348Issues:52Issues:68

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4633Issues:51Issues:272

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3018Issues:33Issues:369
Language:PythonLicense:Apache-2.0Stargazers:1712Issues:124Issues:20

CodeXGLUE

CodeXGLUE

NLPDataSet

记录本人整理的一些数据集

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

SpanBERT

Code for using and evaluating SpanBERT.

Language:PythonLicense:NOASSERTIONStargazers:883Issues:20Issues:73
Language:PythonLicense:Apache-2.0Stargazers:765Issues:12Issues:34

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:548Issues:8Issues:10

CSL

[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集

TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Language:PythonLicense:MITStargazers:534Issues:11Issues:23

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

rerope

Rectified Rotary Position Embeddings

MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Language:PythonLicense:Apache-2.0Stargazers:249Issues:3Issues:57

EcomGPT

An Instruction-tuned Large Language Model for E-commerce

llama-lora-fine-tuning

llama fine-tuning with lora

Language:PythonLicense:MITStargazers:130Issues:2Issues:15

ckipnlp

CKIP CoreNLP Toolkits

Language:PythonLicense:GPL-3.0Stargazers:115Issues:8Issues:34

ambient

Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)

Language:Jupyter NotebookStargazers:50Issues:2Issues:1

CodeLLaMA-chat

CodeLLaMA 中文版 - 代码生成助手,huggingface累积下载2w+次

acl2020-commonsense

Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.

Language:PythonLicense:Apache-2.0Stargazers:27Issues:7Issues:6
Language:PythonLicense:Apache-2.0Stargazers:15Issues:7Issues:0