uang93's repositories
pytorch-pretrained-BERT
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer-XL.
Anime4K
A High-Quality Real Time Upscaler for Anime Video
bert
TensorFlow code and pre-trained models for BERT
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
chinese_speech_pretrain
chinese speech pretrained models
conv-emotion
This repo contains implementation of different architectures for emotion recognition in conversations
DeOldify
A Deep Learning based project for colorizing and restoring old images (and video!)
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
ganhacks
starter from "How to Train a GAN?" at NIPS2016
Git-Commands
A list of commonly used Git commands
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
learn-regex
Learn regex the easy way
LibreASR
:speech_balloon: An On-Premises, Streaming Speech Recognition System
models
Models and examples built with TensorFlow
neural_sp
End-to-end ASR/LM implementation with pytorch.
OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
pixel-cnn
Code for the paper "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
rnn-transducer
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
self-attention-tacotron
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
so-vits-svc
SoftVC VITS Singing Voice Conversion
spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
unredacter
Never ever ever use pixelation as a redaction technique
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
xuesebot
一个关于血色衣冠的对话机器人, 基于 Rasa, 可语音与机器人对话