dyang (dongsig)

dongsig

Geek Repo

Company:Tencent

Location:Shanghai

Github PK Tool:Github PK Tool

dyang's repositories

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AEC-Challenge

AEC Challenge

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioAge

Transferring audio features to build models for rare conditions with scarce data

License:Apache-2.0Stargazers:0Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

License:NOASSERTIONStargazers:0Issues:0Issues:0

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Auto-Age-Labeler

A web application that uses artificial intelligence to automatically label voice datasets with the age of the speaker.

License:MITStargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

create_wsj1_2345_db

Collection of scripts to create a dataset of noisy multi-channel reverberant mixtures based on wsj1 and CHiME3 datasets.

License:MITStargazers:0Issues:0Issues:0

E2E-KWS

End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM

Stargazers:0Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

License:MITStargazers:0Issues:0Issues:0

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

License:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi_rt_decoder

using microphone

License:NOASSERTIONStargazers:0Issues:0Issues:0

KalmanNet_TSP

code for KalmanNet

Stargazers:0Issues:0Issues:0

latex-examples

small (la)tex files showing features, solutions, and attempts

Stargazers:0Issues:0Issues:0

musegan

An AI for Music Generation

License:MITStargazers:0Issues:0Issues:0

OpenChineseLLaMA

Chinese large language model base generated through incremental pre-training on Chinese datasets

License:GPL-3.0Stargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

License:MITStargazers:0Issues:0Issues:0

Percepnet-Keras

percepnet implemented using Keras, still need to be optimized and tuned.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Pitch-Tracking

Pitch tracking in real-time with the Kalman filter

Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

License:MITStargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

sound-source-localization-algorithm_DOA_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

License:Apache-2.0Stargazers:0Issues:0Issues:0

Spoken-Keyword-Spotting

In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keyword Spotting task.

License:MITStargazers:0Issues:0Issues:0

ssspy

A Python toolkit for sound source separation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out

Stargazers:0Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

License:MITStargazers:0Issues:0Issues:0

torchiva

Blind source separation with independent vector analysis family of algorithm in torch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Voice2Face

http://www.facegood.cc

License:MITStargazers:0Issues:0Issues:0