Essen Wilson's repositories
ChatGPT-Virtual-Live
ChatGPT虚拟主播、支持B站、抖音、视频号
ColossalAI
Making large AI models cheaper, faster and more accessible
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
MLE-LLaMA
Multi-language Enhanced LLaMA
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
neural_renderer
A PyTorch port of the Neural 3D Mesh Renderer
nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
OpenVoice
Instant voice cloning by MyShell.
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
pdfGPT
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot!
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
so-vits-svc
SoftVC VITS Singing Voice Conversion
Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐(排名不分先后)
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tiny-cuda-nn
Lightning fast C++/CUDA neural network framework
transducer
A Fast Sequence Transducer Implementation with PyTorch Bindings
vid2avatar
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)
VITS-BigVGAN-SpanPSP-Chinese
基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
VoiceConversionLab
Collect Voice Conversion researches
VPGTrans
Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.