jwang1993's starred repositories

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:4303Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1926Issues:0Issues:0

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2552Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2860Issues:0Issues:0

DIG-In

This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.

Language:PythonLicense:NOASSERTIONStargazers:19Issues:0Issues:0

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:354Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7222Issues:0Issues:0

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4180Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:27122Issues:0Issues:0

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookLicense:MITStargazers:418Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1825Issues:0Issues:0

contentvec

speech self-supervised representations

Language:PythonLicense:MITStargazers:434Issues:0Issues:0

TransferTTS

TransferTTS (Zero-Shot learning of VITS)

Language:PythonLicense:MITStargazers:82Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6480Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7415Issues:0Issues:0

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4635Issues:0Issues:0

notes

:books: 所看所学所记,Python,Go,后端/架构技术,数据分析,机器学习。持续学习中

Language:PythonStargazers:8Issues:0Issues:0

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonLicense:MITStargazers:210Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:28656Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9117Issues:0Issues:0

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

Stargazers:1168Issues:0Issues:0

StableStudio

Community interface for generative AI

Language:TypeScriptLicense:MITStargazers:8500Issues:0Issues:0

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1103Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2263Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:31817Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12354Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38312Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20258Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10881Issues:0Issues:0

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

License:MITStargazers:1970Issues:0Issues:0