Chaofan Tao (ChaofanTao)

ChaofanTao

Geek Repo

Location:Hong Kong

Home Page:https://chaofantao.top/

Github PK Tool:Github PK Tool

Chaofan Tao's starred repositories

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:475Issues:0Issues:0

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonLicense:NOASSERTIONStargazers:571Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:28944Issues:0Issues:0

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonLicense:Apache-2.0Stargazers:491Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:388Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:836Issues:0Issues:0

how-to-train-tokenizer

怎么训练一个LLM分词器

Language:PythonStargazers:114Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7324Issues:0Issues:0

datablations

Scaling Data-Constrained Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:300Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1083Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10901Issues:0Issues:0

pixel

Research code for pixel-based encoders of language (PIXEL)

Language:PythonLicense:Apache-2.0Stargazers:324Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:269Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2269Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:256Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4196Issues:0Issues:0

PICL

Code for ACL2023 paper: Pre-Training to Learn in Context

Language:PythonLicense:MITStargazers:106Issues:0Issues:0

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonLicense:MITStargazers:4629Issues:0Issues:0

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonLicense:Apache-2.0Stargazers:7204Issues:0Issues:0

Megatron-LLM

distributed trainer for LLMs

Language:PythonLicense:NOASSERTIONStargazers:500Issues:0Issues:0

ProAgent

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Language:PythonLicense:Apache-2.0Stargazers:708Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:838Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1726Issues:0Issues:0

OpenAgents

OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:3746Issues:0Issues:0

visual_prompt_retrieval

[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"

Language:PythonLicense:CC0-1.0Stargazers:158Issues:0Issues:0

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1568Issues:0Issues:0

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

Stargazers:641Issues:0Issues:0

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:281Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23281Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4205Issues:0Issues:0