Liujingxiu23's repositories

ai-audio-datasets-list

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

License:MITStargazers:1Issues:0Issues:0

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

License:MITStargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

DeepMIR

Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)

License:NOASSERTIONStargazers:0Issues:0Issues:0

Diff-BGM

official code for CVPR'24 paper Diff-BGM

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HeyGenClone

A simple and open-source analogue of the HeyGen system

Language:PythonStargazers:0Issues:0Issues:0

lina-speech

lina-speech : linear attention based text-to-speech

License:NOASSERTIONStargazers:0Issues:0Issues:0

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Language:PythonStargazers:0Issues:0Issues:0

Make-An-Audio-3

Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen-7B

The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

supervoice-gpt

GPT-style network for phonemization with durations of text

Language:PythonStargazers:0Issues:0Issues:0

supervoice-hybrid

My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one

Stargazers:0Issues:0Issues:0

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen, Tortoise)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vampnet

music generation with masked transformers!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WavJourney

WavJourney: Compositional Audio Creation with LLMs

Stargazers:0Issues:0Issues:0