SuperXiang

Yingfei(Jeremy) Xiang's repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

MIT100

be_great

A novel approach for synthesizing tabular data using pretrained large language models

Language:PythonMIT100

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Apache-2.0100

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Apache-2.0100

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.0100

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

MIT100

how-to-train-tokenizer

怎么训练一个LLM分词器

100

InstructEval

Evaluation suite for the systematic evaluation of instruction selection methods.

100

llm-foundry

LLM training code for MosaicML foundation models

Apache-2.0100

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

MIT100

LM-RMT

Recurrent Memory Transformer

Apache-2.0100

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Apache-2.0100

lorahub

MIT100

mlmc

Code for fine-tuning transformers (XLNet, Bert and GPT-2) on binary, multi-class and multi-label sequence classification tasks.

100

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

MIT100

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0100

sentencepiece_chinese_bpe

使用sentencepiece中BPE训练中文词表，并在transformers中进行使用。

100

sharegpt

Easily share permanent links to ChatGPT conversations with your friends

Language:TypeScriptMIT100

Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

NOASSERTION100

t5x

Apache-2.0100

Transnormer

[EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer

100

trl

Train transformer language models with reinforcement learning.

Apache-2.0100

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT100

VardaGPT

Associative memory-enhanced GPT-2 model

100

langchain

⚡ Building applications with LLMs through composability ⚡

MIT000

llama

Inference code for LLaMA models

NOASSERTION000

llm3s-conatiner

large language model training-3-stages+deployment

Language:Python000

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Apache-2.0000

PdfGptIndexer

An efficient tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.

MIT000

torchscale

Transformers at any scale

MIT000