Jingdong Li (vBaiCai)

vBaiCai

Geek Repo

Company:Li Auto

Location:Beijing, China

Github PK Tool:Github PK Tool

Jingdong Li's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:62766Issues:527Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33318Issues:308Issues:418

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:19950Issues:189Issues:356

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4071Issues:54Issues:116

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3255Issues:57Issues:70

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2298Issues:61Issues:166

cccl

CUDA C++ Core Libraries

Language:C++License:NOASSERTIONStargazers:872Issues:30Issues:1038

BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Language:PythonStargazers:667Issues:87Issues:0

Meta-voicebox

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonLicense:MITStargazers:471Issues:11Issues:57

UniAudio

The Open Source Code of UniAudio

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Language:PythonLicense:GPL-3.0Stargazers:414Issues:13Issues:19

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:350Issues:30Issues:25
Language:PythonLicense:NOASSERTIONStargazers:311Issues:12Issues:11

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Language:PythonLicense:MITStargazers:292Issues:16Issues:42

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaLicense:BSD-3-ClauseStargazers:198Issues:3Issues:15
Language:PythonLicense:Apache-2.0Stargazers:163Issues:8Issues:7

torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Language:PythonLicense:MITStargazers:119Issues:6Issues:5

gss

A simple package for Guided source separation (GSS)

Language:PythonLicense:MITStargazers:99Issues:5Issues:8

torchiva

Blind source separation with independent vector analysis family of algorithm in torch

Language:PythonLicense:MITStargazers:84Issues:5Issues:3

UniAudio

The official source code of UniAudio

meeteval

MeetEval - A meeting transcription evaluation toolkit

Language:PythonLicense:MITStargazers:63Issues:7Issues:8
Language:PythonStargazers:42Issues:0Issues:0

CausalityCheck

Causality Check in Frame-online Speech Separation

Language:PythonStargazers:40Issues:2Issues:0
Language:PythonStargazers:10Issues:0Issues:0

interspeech2023-moving-iva-samples

Repository containing samples produced by the method proposed in "Multi-channel separation of dynamic speech and sound events" and presented at Interspeech 2023.

Language:HTMLStargazers:9Issues:3Issues:0

SpeakerVerSim

Python-based simulation framework for different version control strategies of speaker recognition systems.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:2Issues:0