Castiel (castiel520)

castiel520

Geek Repo

Location:Beijing, China

Github PK Tool:Github PK Tool

Castiel's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34066Issues:340Issues:2658

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29950Issues:190Issues:990

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12845Issues:99Issues:1033

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonLicense:NOASSERTIONStargazers:10765Issues:232Issues:89

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8894Issues:75Issues:1019

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Language:PythonLicense:Apache-2.0Stargazers:8020Issues:85Issues:212

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5991Issues:36Issues:964

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5785Issues:47Issues:75

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:5003Issues:58Issues:71

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4642Issues:51Issues:273

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3033Issues:33Issues:370

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++License:NOASSERTIONStargazers:2458Issues:69Issues:366

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1831Issues:16Issues:153

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1576Issues:21Issues:85

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonLicense:BSD-3-ClauseStargazers:1383Issues:22Issues:38

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1290Issues:25Issues:62

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:916Issues:71Issues:22

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:526Issues:11Issues:22

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookLicense:MITStargazers:427Issues:10Issues:11

sft_datasets

开源SFT数据集整理,随时补充

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

video-bgm-generation

Video Background Music Generation with Controllable Music Transformer (ACM MM 2021 Best Paper Award)

Language:PythonLicense:MITStargazers:281Issues:9Issues:26

llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:280Issues:7Issues:7

MusicLDM

The latent diffusion model for text-to-music generation.

Language:PythonLicense:NOASSERTIONStargazers:142Issues:13Issues:6

blsp

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Language:PythonLicense:Apache-2.0Stargazers:40Issues:1Issues:1

Open-Suno

trying to reproduce suno v3

License:MITStargazers:23Issues:3Issues:0