Yingfei(Jeremy) Xiang's repositories
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
how-to-train-tokenizer
怎么训练一个LLM分词器
InstructEval
Evaluation suite for the systematic evaluation of instruction selection methods.
llm-foundry
LLM training code for MosaicML foundation models
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
sentencepiece_chinese_bpe
使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。
Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
Transnormer
[EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer
langchain
⚡ Building applications with LLMs through composability ⚡
llama
Inference code for LLaMA models
llm3s-conatiner
large language model training-3-stages+deployment
olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
PdfGptIndexer
An efficient tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.
torchscale
Transformers at any scale