ChaofanTao

Chaofan Tao's starred repositories

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonMIT47500

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonNOASSERTION57100

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2894400

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonApache-2.049100

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:Python38800

icefall

Language:PythonApache-2.083600

how-to-train-tokenizer

怎么训练一个LLM分词器

Language:Python11400

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0732400

datablations

Scaling Data-Constrained Language Models

Language:Jupyter NotebookApache-2.030000

open-instruct

Language:PythonApache-2.0108300

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01090100

pixel

Research code for pixel-based encoders of language (PIXEL)

Language:PythonApache-2.032400

URIAL

Language:PythonApache-2.026900

promptbench

A unified evaluation framework for large language models

Language:PythonMIT226900

BPO

Language:PythonApache-2.025600

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0419600

PICL

Code for ACL2023 paper: Pre-Training to Learn in Context

Language:PythonMIT10600

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT462900

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonApache-2.0720400

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION50000

ProAgent

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation

Language:PythonApache-2.070800

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

83800

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.0172600