z.q.mao (maozhiqiang)

maozhiqiang

Geek Repo

Github PK Tool:Github PK Tool

z.q.mao's repositories

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

contentvec

speech self-supervised representations

Language:PythonStargazers:0Issues:1Issues:0

Coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:1Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

DJtransGAN

"Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022

License:MITStargazers:0Issues:0Issues:0

DocProduct

Medical Q&A with Deep Language Models

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

g2pM

A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

hardware_introduction

What scienfitic programmers must know about CPUs and RAM to write fast code.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

headliner

🏖 Easy training and deployment of seq2seq models.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
License:MITStargazers:0Issues:0Issues:0

LiveSpeechPortraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

MelGAN-VC

MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

musika

Fast Infinite Waveform Music Generation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Neural-Style-Transfer-Audio

This is PyTorch Implementation Of Naural Style Transfer Algorithm which is modified for Audios.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

prosody

Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Real_Time_Image_Animation

The Project is real time application in opencv using first order model

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

state-spaces

Sequence Modeling with Structured State Spaces

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

License:CC0-1.0Stargazers:0Issues:0Issues:0

TrWebOCR

开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

U-2-Net

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

WindTerm

A quicker and better cross-platform SSH/Sftp/Shell/Telnet/Serial client.

Language:CStargazers:0Issues:1Issues:0