Chengwei Qin's repositories
Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
100-Days-Of-ML-Code
100-Days-Of-ML-Code中文版
agentic_patterns
Implementing the 4 agentic patterns from scratch
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
alignment-handbook
Robust recipes for to align language models with human and AI preferences
Awesome-Incremental-Learning
Awesome Incremental Learning
Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
awesome_lists
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
ChatDev
Create Customized Software using Natural Language Idea (through Multi-Agent Collaboration)
ColossalAI
Making large AI models cheaper, faster and more accessible
Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
FindTheChatGPTer
汇总那些ChatGPT的平替们,仅汇总那些开源代码/模型文件以及对话语料,闭源或者部分闭源的暂时不在统计范围内
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
qcwthu.github.io
Github Pages for personal website
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
SwiftSage
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.