yzyouzhang

You Zhang's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.032251 273 1068

paper-reading

深度学习经典、新论文逐段精读

Apache-2.025146 7060

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT19249 297 1339

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLMIT10512 266 45

gdrive

Google Drive CLI Client

Language:GoMIT8995 223 594

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonApache-2.05054 31 52

improved-diffusion

Release for Improved Denoising Diffusion Probabilistic Models

Language:PythonMIT3052 124 127

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT1883 40 43

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.01504 68 5

Awesome-Implicit-NeRF-Robotics

A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites

1194 77 1

AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Language:PythonMIT1009 16 138

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT871 23 32

IguanaTex

A PowerPoint add-in allowing you to insert LaTeX equations into PowerPoint presentations on Windows and Mac

Language:VBANOASSERTION808 14 67

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonNOASSERTION509 34 27

Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

458 20 16

DFRF

[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".

Language:PythonMIT335 10 37

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonCC-BY-4.0328 16 138

BeatNet

BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. (ISMIR 2021's paper implementation).

Language:PythonCC-BY-4.0308 9 26