songyinghao's starred repositories

llama2.c

Inference Llama 2 in one file of pure C

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9952Issues:124Issues:734

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8998Issues:74Issues:1038

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonLicense:MITStargazers:8764Issues:56Issues:3249

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7552Issues:109Issues:291

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7496Issues:109Issues:151

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5919Issues:67Issues:269

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5189Issues:63Issues:476

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3549Issues:22Issues:455

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1792Issues:24Issues:173

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:26Issues:24

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1341Issues:25Issues:62

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1319Issues:12Issues:118

chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

bert4torch

An elegent pytorch implement of transformers

Language:PythonLicense:MITStargazers:1192Issues:16Issues:148

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:1115Issues:20Issues:19

swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

Language:PythonLicense:MITStargazers:853Issues:13Issues:1

llama-from-scratch

Llama from scratch, or How to implement a paper without crying

Language:Jupyter NotebookStargazers:494Issues:5Issues:8

Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:291Issues:8Issues:35

LLMChat

A full-stack Webui implementation of Large Language model, such as ChatGPT or LLaMA.

Language:PythonLicense:MITStargazers:247Issues:7Issues:43

llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

open-chatgpt

The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.

Language:PythonLicense:Apache-2.0Stargazers:170Issues:11Issues:6

llama2.c-zh

支持中文场景的的小语言模型 llama2.c-zh

allamo

Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models

Language:PythonLicense:MITStargazers:139Issues:6Issues:8

open-llama2

从预训练到强化学习的中文llama2

instruct_storyteller_tinyllama2

Training and Fine-tuning an llm in Python and PyTorch.

Language:Jupyter NotebookStargazers:37Issues:0Issues:0
Language:PythonStargazers:28Issues:0Issues:1