i-MaTh's repositories
3dv_tutorial
An Invitation to 3D Vision: A Tutorial for Everyone
Algorithm
记录一些常用算法的实现(涵盖常用的数据结构,机器学习以及语音识别中常用算法)
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
learning-dl
learning and understanding deep learning
china_city_dataset
**城市数据集
city_json
**城市json&港澳台、世界城市json
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
cs-self-learning
计算机自学指南
lstmp.pytorch
The implementation of LSTM with projection layer by PyTorch
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
NCE
Yingshi New Concept English
NeuralVoicePuppetry
This github contains the network architectures of NeuralVoicePuppetry.
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
tacotron
PyTorch implementation of Tacotron and Tacotron2
tensorflow
An Open Source Machine Learning Framework for Everyone
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Easy to adapt for other languages)
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.