Beast code in Giters

jwang1993's starred repositories

Firefly-VQ-GAN

based on fishaudio/vocoder to train firefly-vq-gan

Language:PythonMIT300

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.0282600

DDPM-demo

pytorch ddpm demo

Language:Python6100

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookMIT26800

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION652800

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.0195500

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT256600

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0290500

DIG-In

This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.

Language:PythonNOASSERTION1900

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonMIT37900

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION727300

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT420200

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02832400

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookMIT43300

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT185400

contentvec

speech self-supervised representations

Language:PythonMIT43800

TransferTTS

TransferTTS (Zero-Shot learning of VITS)

Language:PythonMIT8200

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT653300

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT746300

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.0465700

notes

:books: 所看所学所记，Python，Go，后端/架构技术，数据分析，机器学习。持续学习中

Language:Python900

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonMIT21600

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2987700

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION934900

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目

128400

StableStudio

Community interface for generative AI

Language:TypeScriptMIT853500

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.0113100

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT236800

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03228600

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01254900