Beast code in Giters

z.q.mao's repositories

ttsGAN-ICLR2019

Language:Python1 10

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonBSD-2-Clause010

bana-tts

Language:Jupyter NotebookMIT010

contentvec

speech self-supervised representations

Language:Python010

Coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:Jupyter NotebookMPL-2.0010

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonGPL-3.0010

DJtransGAN

"Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022

MIT000

DocProduct

Medical Q&A with Deep Language Models

Language:Jupyter NotebookMIT020

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonMIT010

g2pM

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonApache-2.0020

hardware_introduction

What scienfitic programmers must know about CPUs and RAM to write fast code.

Language:Jupyter Notebook010

headliner

🏖 Easy training and deployment of seq2seq models.

Language:PythonNOASSERTION020

KAN-TTS

MIT000

LiveSpeechPortraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Language:PythonMIT010

melgan-neurips

Language:PythonMIT020

MelGAN-VC

MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

Language:Jupyter NotebookMIT010

musika

Fast Infinite Waveform Music Generation

Language:PythonMIT010

Neural-Style-Transfer-Audio

This is PyTorch Implementation Of Naural Style Transfer Algorithm which is modified for Audios.

Language:PythonMIT020

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookMIT020

prosody

Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text

Language:PythonMIT000

Real_Time_Image_Animation

The Project is real time application in opencv using first order model

Language:PythonGPL-3.0010

shell_command

010

SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Language:PythonMIT010

SpanPSP

000

state-spaces

Sequence Modeling with Structured State Spaces

Apache-2.0000

StyleGAN2

NOASSERTION000

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

CC0-1.0000

TrWebOCR

开源易用的中文离线OCR，识别率媲美大厂，并且提供了易用的web页面及web的接口，方便人类日常工作使用或者其他程序来调用~

Language:PythonApache-2.0010

U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Language:PythonApache-2.0010

WindTerm

A quicker and better cross-platform SSH/Sftp/Shell/Telnet/Serial client.

Language:C010