jwang1993's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38549Issues:383Issues:1647

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:33121Issues:277Issues:1090

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31951Issues:195Issues:1147

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:30235Issues:172Issues:493

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21484Issues:179Issues:460

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12800Issues:168Issues:510

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11202Issues:159Issues:274

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9669Issues:78Issues:461

StableStudio

Community interface for generative AI

Language:TypeScriptLicense:MITStargazers:8568Issues:115Issues:75

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7531Issues:81Issues:151

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7427Issues:88Issues:122

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:7335Issues:62Issues:326

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6625Issues:54Issues:205

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4685Issues:40Issues:565

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4245Issues:43Issues:101

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:4060Issues:55Issues:88

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2603Issues:29Issues:163

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2474Issues:43Issues:81

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1977Issues:49Issues:126

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1881Issues:32Issues:162

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1149Issues:18Issues:62

contentvec

speech self-supervised representations

Language:PythonLicense:MITStargazers:449Issues:11Issues:30

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookLicense:MITStargazers:449Issues:13Issues:14

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:401Issues:15Issues:21

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonLicense:MITStargazers:221Issues:15Issues:4

TransferTTS

TransferTTS (Zero-Shot learning of VITS)

Language:PythonLicense:MITStargazers:83Issues:5Issues:2

DIG-In

This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.

Language:PythonLicense:NOASSERTIONStargazers:19Issues:3Issues:0

notes

:books: 所看所学所记,Python,Go,后端/架构技术,数据分析,机器学习。持续学习中