Beast code in Giters

LuckerYi's starred repositories

Interpret_Instruction_Tuning_LLMs

Understanding Why and How Instruction Tuning Changes Pre-trained Models

Language:PythonGPL-3.01300

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Language:PythonApache-2.015300

LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

10400

emphassess

This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).

Language:PythonNOASSERTION1100

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonApache-2.08300

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLMIT1064500

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02476200

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION588900

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT3136800

best-rq-pytorch

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Language:PythonMIT7300

SECap

Language:Python12300

CLAP

Learning audio concepts from natural language supervision

Language:PythonMIT44800

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:Python55600