Yixin's starred repositories
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
guwen-models
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.
chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Classical-Modern
非常全的文言文(古文)-现代文平行语料
ChatGenTitle
🌟 ChatGenTitle:使用百万arXiv论文信息在LLaMA模型上进行微调的论文题目生成模型
awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
efficient_alpaca
The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
ICL_PaperList
Paper List for In-context Learning 🌷
ColossalAI
Making large AI models cheaper, faster and more accessible
promptsource
Toolkit for creating, sharing and using natural language prompts.
pycantonese
Cantonese Linguistics and NLP
Awesome-Hyperbolic-NeuralNetworks
Papers and Codes for the deep learning in hyperbolic space
Awesome-Hyperbolic-Representation-and-Deep-Learning
Paper list about hyperbolic embedding, hyperbolic models,hyperbolic applications
machine-learning-notes
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense