Beast code in Giters

Castiel's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.034066 340 2658

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT29950 190 990

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.012845 99 1033

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

10918 250 106

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonNOASSERTION10765 232 89

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:Python9706 148 59

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.08894 75 1019

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Language:PythonApache-2.08020 85 212

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT5991 36 964

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5785 47 75

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonApache-2.05003 58 71

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonApache-2.04642 51 273

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.03109 58 3

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.03033 33 370

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++NOASSERTION2458 69 366

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.01831 16 153

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonApache-2.01576 21 85

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause1383 22 38

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION1290 25 62

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonApache-2.0916 71 22

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonNOASSERTION526 11 22

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

482 20 4

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookMIT427 10 11

sft_datasets

开源SFT数据集整理,随时补充

394 1 2

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Language:Python312 9 31

video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

Language:PythonMIT281 9 26

llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Language:Jupyter NotebookNOASSERTION280 7 7

MusicLDM

The latent diffusion model for text-to-music generation.

Language:PythonNOASSERTION142 13 6

blsp

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Language:PythonApache-2.040 1 1

Open-Suno

trying to reproduce suno v3

MIT23 30