zceng

followers

following

stars

miHoYo

Shanghai

Zhen Zeng's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION66109 556 697

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

MIT51028 356 92

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT33318 308 418

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT29549 423 4151

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonNOASSERTION13880 285 319

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT10868 97 333

awesome-chatgpt-zh

ChatGPT 中文指南🔥，ChatGPT 中文调教指南，指令指南，应用开发指南，精选资源清单，更好的使用 chatGPT 让你的生产力 up up up! 🚀

Language:PythonMIT9945 105 13

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonApache-2.06286 68 500

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonApache-2.06140 43 81

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookMIT5475 76 213

pedalboard

🎛 🔊 A Python library for audio.

Language:C++GPL-3.04933 58 166

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

HarvestText

文本挖掘和预处理工具（文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等），无监督或弱监督方法

Language:PythonMIT2329 55 46

MoeGoe

Executable file for VITS inference

Language:PythonMIT2300 16 41

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

speech-synthesis-paper

List of speech synthesis papers.

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonMIT891 11 104

zhvoice

Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonApache-2.0385 10 91

torchcrepe

Pytorch implementation of the CREPE pitch tracker

Language:PythonMIT381 9 26

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookMIT256 8 17

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookMIT256 10 10

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Language:PythonApache-2.0215 9 6

Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonMIT184 7 3

CPED

CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集

Language:PythonApache-2.0184 4 6

Automatic-Prosody-Annotation

Language:Python109 3 5

DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.

Language:PythonMIT85 50

WaveODE

An ODE-based generative neural vocoder using Rectified Flow

Language:Python54 8 4