Beast code in Giters

jwang1993's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038549 383 1647

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.033121 277 1090

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT31951 195 1147

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.030235 172 493

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021484 179 460

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.012800 168 510

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT11202 159 274

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION9669 78 461

StableStudio

Community interface for generative AI

Language:TypeScriptMIT8568 115 75

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT7531 81 151

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION7427 88 122

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION7335 62 326

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT6625 54 205

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.04685 40 565

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT4245 43 101

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.04060 55 88

whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT2603 29 163

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT2474 43 81

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

MIT2063 72 7

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.01977 49 126

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT1881 32 162

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1469 240

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.01149 18 62

contentvec

speech self-supervised representations

Language:PythonMIT449 11 30

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookMIT449 13 14

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonMIT401 15 21

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonMIT221 15 4

TransferTTS

TransferTTS (Zero-Shot learning of VITS)

Language:PythonMIT83 5 2

DIG-In

This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.

Language:PythonNOASSERTION19 30

notes

:books: 所看所学所记，Python，Go，后端/架构技术，数据分析，机器学习。持续学习中

Language:Python9 2 119