qinr's starred repositories
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
simstring-fast
A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
gradient-checkpointing
Make huge neural nets fit in memory
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
promptbase
All things prompt engineering
CTranslate2
Fast inference engine for Transformer models
google-research
Google Research
BigTranslate
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters