cc-cherie's starred repositories

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3224Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6879Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12415Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:28891Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27184Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6963Issues:0Issues:0

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7509Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19130Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23271Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1930Issues:0Issues:0

emotionally_consistent_speech

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

benchmarks

This repository contains the SpeechBrain Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

audiotext-transformer

Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features

Language:PythonStargazers:24Issues:0Issues:0

MMSA

MMSA is a unified framework for Multimodal Sentiment Analysis.

Language:PythonLicense:MITStargazers:611Issues:0Issues:0

BERT-like-is-All-You-Need

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Language:PythonLicense:MITStargazers:109Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6637Issues:0Issues:0

Speech-emotion-recognition-MCFN

This is a repository for our work: A DUAL ATTENTION-BASED MODALITY-COLLABORATIVE FUSION NETWORK FOR EMOTION RECOGNITION

Language:PythonStargazers:4Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3061Issues:0Issues:0

data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Language:PythonLicense:MITStargazers:165Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1243Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2155Issues:0Issues:0

PraatScripts

These are praat scripts I use in my research, implemented in parselmouth for python for use in binder

Language:Jupyter NotebookLicense:MITStargazers:119Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33757Issues:0Issues:0

stock

stock,股票系统。使用python进行开发。

Language:PythonLicense:Apache-2.0Stargazers:6539Issues:0Issues:0

leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11090Issues:0Issues:0

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:560Issues:0Issues:0

ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Language:Jupyter NotebookLicense:MITStargazers:62Issues:0Issues:0

audioset-downloader

cli to download examples of a specific class from google's AudioSet

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

audioset-processing

Toolkit for downloading and processing Google's AudioSet dataset.

Language:Jupyter NotebookLicense:MITStargazers:152Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0