QDX's starred repositories
LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
sft_datasets
开源SFT数据集整理,随时补充
ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
handson-ml3
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
dlwpt-code
Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
NLPDataSet
记录本人整理的一些数据集
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
ContextualSP
Multiple paper open-source codes of the Microsoft Research Asia DKI group
instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
example-code-2e
Example code for Fluent Python, 2nd edition (O'Reilly 2022)