Pengzhi Gao (gpengzhi)

gpengzhi

Geek Repo

Company:Xiaomi AI Lab

Location:Beijing

Home Page:https://gpengzhi.github.io/

Github PK Tool:Github PK Tool

Pengzhi Gao's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:69348Issues:575Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55982Issues:525Issues:968

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37999Issues:396Issues:67

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32309Issues:205Issues:4975

LLM101n

LLM101n: Let's build a Storyteller

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:29189Issues:309Issues:94

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:28296Issues:234Issues:4807

fastText

Library for fast text representation and classification.

openai-translator

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:23746Issues:123Issues:794

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:15047Issues:139Issues:414

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13682Issues:102Issues:1046

llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:12082Issues:180Issues:353

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10839Issues:140Issues:353

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7778Issues:89Issues:100

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7627Issues:106Issues:291

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5672Issues:67Issues:129

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4181Issues:43Issues:34

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4083Issues:41Issues:395

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3303Issues:57Issues:696

Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:2986Issues:32Issues:194

texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Language:PythonLicense:Apache-2.0Stargazers:2387Issues:78Issues:159

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

ALMA

State-of-the-art LLM-based translation models.

Language:RubyLicense:MITStargazers:400Issues:12Issues:55

CrossConST-MT

Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization"

CrossConST-SR

Code for EMNLP 2023 industry track paper "Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization"

Language:PythonLicense:Apache-2.0Stargazers:5Issues:2Issues:2

SimCR

Code for NAACL 2024 main conference paper "An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation"

Language:PythonStargazers:5Issues:2Issues:0

CrossConST-LLM

Code for arXiv paper "Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models"