Beast code in Giters

wgsh3706's repositories

AMFM_decompy

Package containing the tools necessary for decomposing a speech signal into its modulated components (also known as AM-FM decomposition). Includes the algorithms of the QHM family and the YAAPT pitch tracker.

Language:PythonMIT000

Android-Audio-Processing-Using-WebRTC

All in all WebRTC. A Complete Guide to enable Rich and High Quality of **Real-Time Voice Communication** on Android Platform. This repository involves a complete understanding, implementation and documentation related to WebRTC Audio Processing.

Language:C++000

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonMIT000

DeepFormants

Formant Tracking & Estimation

Language:PythonMIT000

DeepSpeech

A PaddlePaddle implementation of DeepSpeech2 architecture for ASR.

Language:PythonApache-2.0000

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonMIT000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

FastCqt

Fast constant-Q transform feature, c++ implement

Language:C++000

FormantNet

Software and data supporting Lilley & Bunnell (2021) InterSpeech paper

Language:Jupyter NotebookMIT000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT000

omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Language:PythonMIT000

Paddle

PArallel Distributed Deep LEarning （『飞桨』核心框架，高性能单机、分布式训练和跨平台部署）

Language:C++Apache-2.0000

penn

Pitch Estimating Neural Networks (PENN)

Language:PythonMIT000

Parselmouth

Praat in Python, the Pythonic way

GPL-3.0000

pesto-full

Full models and training code for PESTO

LGPL-3.0000

Real-Time-Convolutional-Neural-Network-Based-Speech-Source-Localization-on-Smartphone

speech source localization on phone/pad

Language:JavaMIT000

REAPER

Language:C++Apache-2.0000

rnnoise

Recurrent neural network for audio noise reduction

Language:CBSD-3-Clause000

Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Language:PythonApache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0000

spleeter

Deezer source separation library including pretrained models.

MIT000

VGG-Speaker-Recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

Language:Python000

webrtc-sdk

Language:C++BSD-3-Clause000

webrtc-sdk-rnnoise

Enhance real-time communication with our WebRTC SDK integrated with advanced RNNoise technology. Enjoy noise-free audio calls, powered by Chromium open-source code. Flexible API for seamless development, cross-platform compatibility, and improved user experience.

Language:C++BSD-3-Clause000