Pengzhi Gao's starred repositories
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
CTranslate2
Fast inference engine for Transformer models
Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
CrossConST-MT
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization"
CrossConST-SR
Code for EMNLP 2023 industry track paper "Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization"
CrossConST-LLM
Code for arXiv paper "Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models"