fb029ed's repositories
yolov5_cpp_openvino
用c++实现了yolov5使用openvino的部署
scrcpy-opencv-SQ
使用c++对scrcpy进行重构,提供opencv Mat图像,便于二次开发,提供了智慧树知到的自动刷课脚本.
asv-subtools
An Open Source Tools for Speaker Recognition
adversarial-disentangling-autoencoder-for-spk-representation
Software presented in the article "Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation".
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
g2p
g2p: English Grapheme To Phoneme Conversion
gmm-torch
Gaussian mixture models in PyTorch.
GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
lidbox
End-to-end spoken language identification out of the box. Rewrite in progress for first release (version 1).
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
NeMo
NeMo: a toolkit for conversational AI
openTSNE
Extensible, parallel implementations of t-SNE
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
phonemizer
Simple text to phones converter for multiple languages
PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
STL
The ITU-T Software Tool Library (G.191)
TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021
WaveRNN
WaveRNN Vocoder + TTS
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit