AmorJNYH

AmorJNYH

Geek Repo

0

followers

0

following

0

stars

Github PK Tool:Github PK Tool

AmorJNYH's repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

License:Apache-2.0Stargazers:0Issues:0Issues:0

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

License:MITStargazers:0Issues:0Issues:0

Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

License:MITStargazers:0Issues:0Issues:0

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

Stargazers:0Issues:0Issues:0
License:AGPL-3.0Stargazers:0Issues:0Issues:0

clone-voice

一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonStargazers:0Issues:0Issues:0

cutword

一个简单快速的分词、命名实体识别工具

License:Apache-2.0Stargazers:0Issues:0Issues:0

deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

License:NOASSERTIONStargazers:0Issues:0Issues:0

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Stargazers:0Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

License:MITStargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

License:NOASSERTIONStargazers:0Issues:0Issues:0

pesto

Self-supervised learning for fast pitch estimation

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:0Issues:0

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pheme

VALL-E style models

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0

PitchSqueezer

A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

pretty-midi

Utility functions for handling MIDI data in a nice/intuitive way.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

SECap

音频情感标注

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

tts-frontend-dataset

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

UTMOS

UT-Sarulab MOS prediction system using SSL models

License:MITStargazers:0Issues:0Issues:0

vallex

代码美化

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vid2densepose

Convert your videos to densepose and use it on MagicAnimate

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

zhihu-tfm-llm-gpt

:books: 知乎大语言模型、ChatGPT、Transformers问答

Stargazers:0Issues:0Issues:0