powei-C's repositories

STC

Submit to ICASSP 2023. Accepted.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Adversarial-Many-to-Many-VC

[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

albert-chinese-ner

使用预训练语言模型ALBERT做中文NER

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Language:PythonStargazers:0Issues:0Issues:0

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0

This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".

Stargazers:0Issues:0Issues:0

Emovox

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

fast-transformers

Pytorch library for fast transformer implementations

Stargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MaskCycleGAN-VC

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MQTTS

RVQ-based TTS

License:MITStargazers:0Issues:0Issues:0

nnsvs

Neural network-based singing voice synthesis library for research

License:MITStargazers:0Issues:0Issues:0

ParallelWaveGAN-VC

Unofficial Parallel WaveGAN VC with Pytorch

License:MITStargazers:0Issues:0Issues:0

randomCNN-voice-transfer

Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample

Stargazers:0Issues:0Issues:0

roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Stargazers:0Issues:0Issues:0

Singing-Voice-Vocoder

PyTorch Implementation of Multi-Singer (ACM-MM'21)

License:MITStargazers:0Issues:0Issues:0

Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT

This is the implementation of the paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".

Stargazers:0Issues:0Issues:0

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

License:MITStargazers:0Issues:0Issues:0

Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

WGANSing

Multi-voice singing voice synthesis

Stargazers:0Issues:0Issues:0