Yuhang's repositories

SpeechAlgorithms

Speech Algorithms

Language:CLicense:Apache-2.0Stargazers:1Issues:0Issues:0

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome

😎 Awesome lists about all kinds of interesting topics

License:CC0-1.0Stargazers:0Issues:0Issues:0

Awesome-GPT-Store

A collection of major GPTS available in public

License:MITStargazers:0Issues:0Issues:0

ChatWaifu-marai

About Combined ChatGPT with Moegoe TTS to create a Chatting Waifu for Marai

License:MITStargazers:0Issues:0Issues:0

CyberWaifu

GPT + Tacotron2/VITS + Live2D = CyberWaifu

License:MITStargazers:0Issues:0Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

License:MITStargazers:0Issues:0Issues:0

Free-Certifications

A curated list of free courses & certifications.

License:MITStargazers:0Issues:0Issues:0

g2p-zh-en

Chinese and English Bilinguish G2P

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

GPT-vup

GPT-vup BIliBili | 抖音 | AI | 虚拟主播

Stargazers:0Issues:0Issues:0

hackingtool

ALL IN ONE Hacking Tool For Hackers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IP_LAP

CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LiveWhisper

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

megatts2

Unoffical implement of Megatts2

License:MITStargazers:0Issues:0Issues:0

mustango

Mustango: Toward Controllable Text-to-Music Generation

License:MITStargazers:0Issues:0Issues:0

OpenPhonemizer

Permissively licensed, open sourced, local IPA Phonemizer (G2P) powered by deep learning.

License:BSD-3-Clause-ClearStargazers:0Issues:0Issues:0

RefAudioEmoTagger

一种基于Emotion2Vec的批量音频情感自动标注脚本

License:GPL-3.0Stargazers:0Issues:0Issues:0

roop

one-click face swap

License:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

speech-synthesis-paper

List of speech synthesis papers.

License:MITStargazers:0Issues:0Issues:0

speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

SpEx_Plus

SpEx+(tied) source code

License:MITStargazers:0Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

License:MITStargazers:0Issues:0Issues:0

stable-speech

Reproduction of Stability AI's Text-to-Speech model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

License:MITStargazers:0Issues:0Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Language:PythonStargazers:0Issues:0Issues:0

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

License:MITStargazers:0Issues:0Issues:0

wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

License:MITStargazers:0Issues:0Issues:0