songyinghao's starred repositories
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
bert4torch
An elegent pytorch implement of transformers
swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
llama-from-scratch
Llama from scratch, or How to implement a paper without crying
Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
open-chatgpt
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
llama2.c-zh
支持中文场景的的小语言模型 llama2.c-zh
open-llama2
从预训练到强化学习的中文llama2
instruct_storyteller_tinyllama2
Training and Fine-tuning an llm in Python and PyTorch.