Beast code in Giters

powei-C's repositories

STC

Submit to ICASSP 2023. Accepted.

adaptive_voice_conversion

Language:PythonApache-2.0000

Adversarial-Many-to-Many-VC

[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna

Language:PythonNOASSERTION000

albert-chinese-ner

使用预训练语言模型ALBERT做中文NER

Language:PythonMIT000

albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Language:Python000

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonGPL-3.0000

autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Language:PythonMIT000

Disentangle-VAE-for-VC

Language:Python000

DM2020-Lab1-Homework1

Language:Jupyter Notebook000

DM2020-Lab1-Master

Language:Jupyter Notebook000

DM2020-Lab2-Homework

Language:Jupyter Notebook000

DM2020-Lab2-Master

Language:Jupyter Notebook000

emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0

This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".

Language:Python000

Emovox

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

000

espnet

End-to-End Speech Processing Toolkit

Apache-2.0000

fast-transformers

Pytorch library for fast transformer implementations

000

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT000

KConverter

000

MaskCycleGAN-VC

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Language:PythonMIT000

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Language:Jupyter NotebookBSD-3-Clause000

MQTTS

RVQ-based TTS

Language:PythonMIT000

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT000

ParallelWaveGAN-VC

Unofficial Parallel WaveGAN VC with Pytorch

Language:Jupyter NotebookMIT000

randomCNN-voice-transfer

Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample

Language:Python000

roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Language:Python000

Singing-Voice-Vocoder

PyTorch Implementation of Multi-Singer (ACM-MM'21)

Language:PythonMIT000

Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT

This is the implementation of the paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".

Language:Python000

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT000

Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Language:PythonApache-2.0000

WGANSing

Multi-voice singing voice synthesis

Language:Python000