Beast code in Giters

Rui Wang's repositories

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptGPL-3.0000

audio-dataset

Audio Dataset for training CLAP and other models

Language:Python000

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.0000

chatgpt_academic

科研工作专用ChatGPT拓展，特别优化学术Paper润色体验，支持自定义快捷按钮，支持自定义函数插件，支持markdown表格显示，Tex公式双显示，代码显示功能完善，新增本地Python/C++/Go项目树剖析功能/项目源代码自译解能力，新增PDF和Word文献批量总结功能/PDF论文全文翻译功能

Language:PythonGPL-3.0000

crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Language:PythonMIT000

CS-Books

🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~

000

DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Language:PythonMIT000

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

diff-svc

Singing Voice Conversion via diffusion model

AGPL-3.0000

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Apache-2.0000

gender-audio-classification

A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.

GPL-3.0000

gomin

GOMIN; Gaudio Open Mel-spectrogram Inversion Network

MIT000

gpt-vits

text to speech using decoder-only transformer and VITS

MIT000

decentralising the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered api's ( poe.com, phind.com, chat.openai.com, phind.com, writesonic.com, sqlchat.ai, t3nsor.com, you.com etc...)

GPL-3.0000

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

000

learngitthehardway

000

NeMo

NeMo: a toolkit for conversational AI

Apache-2.0000

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

BSD-3-Clause000

NYU-DLSP21

NYU Deep Learning Spring 2021

000

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Apache-2.0000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

MIT000

timeismylife.github.io

never的个人网站

Language:HTML010

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Apache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Apache-2.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

MIT000

timeismylife

Rui Wang's repositories

AgentGPT

audio-dataset

Auto-GPT

BELLE

chatgpt_academic

crank

CS-Books

cs-self-learning

DiffGAN-TTS

espnet

DeepSpeed

diff-svc

Diffusion-SVC

dolly

gender-audio-classification

gomin

gpt-vits

gpt4free

Large-Audio-Models

learngitthehardway

NeMo

NExT-GPT

NYU-DLSP21

PaddleSpeech

pyannote-audio

timeismylife.github.io

tortoise-tts

transformers

TTS

voicebox-pytorch