wangtao's starred repositories
prompt-in-context-learning
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Awesome-instruction-tuning
A curated list of awesome instruction tuning datasets, models, papers and repositories.
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
Hypo2Trans
Single-blind supplementary materials for NeurIPS 2023 submission
metavoice-src
Foundational model for human-like, expressive TTS
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audio-dataset
Audio Dataset for training CLAP and other models
tortoise-tts-fast
Fast TorToiSe inference (5x or your money back!)
StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
ACL2023-Retrieval-LM.github.io
https://acl2023-retrieval-lm.github.io/
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
TTS-Portuguese-Corpus
Open Source Text-To-Speech Portuguese Dataset
bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.