Beast code in Giters

fb029ed's repositories

yolov5_cpp_openvino

用c++实现了yolov5使用openvino的部署

Language:C++Apache-2.0266 6 23

scrcpy-opencv-SQ

使用c++对scrcpy进行重构,提供opencv Mat图像,便于二次开发,提供了智慧树知到的自动刷课脚本．

Language:C++Apache-2.022 1 1

asv-subtools

An Open Source Tools for Speaker Recognition

Language:PythonApache-2.0100

fb029ed

100

adversarial-disentangling-autoencoder-for-spk-representation

Software presented in the article "Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation".

Language:Python000

auorange

Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet

Language:PythonApache-2.0000

CLUB

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Language:Jupyter Notebook000

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Apache-2.0000

ConvS2S-VC

Language:Python000

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

MIT000

g2p

g2p: English Grapheme To Phoneme Conversion

Language:PythonApache-2.0000

gmm-torch

Gaussian mixture models in PyTorch.

Language:PythonMIT000

GST-Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

MIT000

leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

000

lidbox

End-to-end spoken language identification out of the box. Rewrite in progress for first release (version 1).

MIT000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

NeMo

NeMo: a toolkit for conversational AI

Apache-2.0000

openTSNE

Extensible, parallel implementations of t-SNE

BSD-3-Clause000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT000

phonemizer

Simple text to phones converter for multiple languages

GPL-3.0000

PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

MIT000

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

MIT000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

SC-WaveRNN

Official PyTorch implementation of Speaker Conditional WaveRNN

000

snowfall

Apache-2.0000

STL

The ITU-T Software Tool Library (G.191)

NOASSERTION000

TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

NOASSERTION000

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021

MIT000

WaveRNN

WaveRNN Vocoder + TTS

MIT000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Apache-2.0000