Zhen Zeng (zceng)

zceng

Geek Repo

Company:miHoYo

Location:Shanghai

Github PK Tool:Github PK Tool

Zhen Zeng's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66109Issues:556Issues:697

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33318Issues:308Issues:418

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29549Issues:423Issues:4151

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:13880Issues:285Issues:319

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:10868Issues:97Issues:333

awesome-chatgpt-zh

ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀

Language:PythonLicense:MITStargazers:9945Issues:105Issues:13

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6286Issues:68Issues:500

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6140Issues:43Issues:81

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5475Issues:76Issues:213

pedalboard

🎛 🔊 A Python library for audio.

Language:C++License:GPL-3.0Stargazers:4933Issues:58Issues:166

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

Language:PythonLicense:MITStargazers:2329Issues:55Issues:46

MoeGoe

Executable file for VITS inference

Language:PythonLicense:MITStargazers:2300Issues:16Issues:41

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

speech-synthesis-paper

List of speech synthesis papers.

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:891Issues:11Issues:104

zhvoice

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

WeTextProcessing

Text Normalization & Inverse Text Normalization

Language:PythonLicense:Apache-2.0Stargazers:385Issues:10Issues:91

torchcrepe

Pytorch implementation of the CREPE pitch tracker

Language:PythonLicense:MITStargazers:381Issues:9Issues:26

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookLicense:MITStargazers:256Issues:8Issues:17

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookLicense:MITStargazers:256Issues:10Issues:10

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Language:PythonLicense:Apache-2.0Stargazers:215Issues:9Issues:6

Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

DailyTalk

Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023 (Oral)

Language:PythonLicense:MITStargazers:184Issues:7Issues:3

CPED

CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集

Language:PythonLicense:Apache-2.0Stargazers:184Issues:4Issues:6

DiffWave-Vocoder

Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.

Language:PythonLicense:MITStargazers:85Issues:5Issues:0

WaveODE

An ODE-based generative neural vocoder using Rectified Flow