Liujingxiu23's repositories

ai-audio-datasets-list

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

License:MITStargazers:1Issues:0Issues:0

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

License:MITStargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

DeepMIR

Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)

License:NOASSERTIONStargazers:0Issues:0Issues:0

Diff-BGM

official code for CVPR'24 paper Diff-BGM

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HeyGenClone

A simple and open-source analogue of the HeyGen system

Language:PythonStargazers:0Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:0Issues:0Issues:0

lina-speech

lina-speech : linear attention based text-to-speech

License:NOASSERTIONStargazers:0Issues:0Issues:0

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Language:PythonStargazers:0Issues:0Issues:0

Make-An-Audio-3

Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen-7B

The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

supervoice-gpt

GPT-style network for phonemization with durations of text

Language:PythonStargazers:0Issues:0Issues:0

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

License:Apache-2.0Stargazers:0Issues:0Issues:0

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen, Tortoise)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vampnet

music generation with masked transformers!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

License:MITStargazers:0Issues:0Issues:0

WavJourney

WavJourney: Compositional Audio Creation with LLMs

Stargazers:0Issues:0Issues:0