wgsh3706

wgsh3706

Geek Repo

Github PK Tool:Github PK Tool

wgsh3706's repositories

AMFM_decompy

Package containing the tools necessary for decomposing a speech signal into its modulated components (also known as AM-FM decomposition). Includes the algorithms of the QHM family and the YAAPT pitch tracker.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Android-Audio-Processing-Using-WebRTC

All in all WebRTC. A Complete Guide to enable Rich and High Quality of **Real-Time Voice Communication** on Android Platform. This repository involves a complete understanding, implementation and documentation related to WebRTC Audio Processing.

Language:C++Stargazers:0Issues:0Issues:0

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepFormants

Formant Tracking & Estimation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepSpeech

A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FastCqt

Fast constant-Q transform feature, c++ implement

Language:C++Stargazers:0Issues:0Issues:0

FormantNet

Software and data supporting Lilley & Bunnell (2021) InterSpeech paper

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Paddle

PArallel Distributed Deep LEarning (『飞桨』核心框架,高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

penn

Pitch Estimating Neural Networks (PENN)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Parselmouth

Praat in Python, the Pythonic way

License:GPL-3.0Stargazers:0Issues:0Issues:0

pesto-full

Full models and training code for PESTO

License:LGPL-3.0Stargazers:0Issues:0Issues:0
Language:JavaLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

License:MITStargazers:0Issues:0Issues:0

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:PythonStargazers:0Issues:0Issues:0
Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

webrtc-sdk-rnnoise

Enhance real-time communication with our WebRTC SDK integrated with advanced RNNoise technology. Enjoy noise-free audio calls, powered by Chromium open-source code. Flexible API for seamless development, cross-platform compatibility, and improved user experience.

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0