Rui Wang's repositories
AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
audio-dataset
Audio Dataset for training CLAP and other models
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
chatgpt_academic
科研工作专用ChatGPT拓展,特别优化学术Paper润色体验,支持自定义快捷按钮,支持自定义函数插件,支持markdown表格显示,Tex公式双显示,代码显示功能完善,新增本地Python/C++/Go项目树剖析功能/项目源代码自译解能力,新增PDF和Word文献批量总结功能/PDF论文全文翻译功能
crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
cs-self-learning
计算机自学指南
DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
espnet
End-to-End Speech Processing Toolkit
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
diff-svc
Singing Voice Conversion via diffusion model
dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
gomin
GOMIN; Gaudio Open Mel-spectrogram Inversion Network
gpt-vits
text to speech using decoder-only transformer and VITS
gpt4free
decentralising the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered api's ( poe.com, phind.com, chat.openai.com, phind.com, writesonic.com, sqlchat.ai, t3nsor.com, you.com etc...)
Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
NeMo
NeMo: a toolkit for conversational AI
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
NYU-DLSP21
NYU Deep Learning Spring 2021
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
timeismylife.github.io
never的个人网站
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch