Beast code in Giters

wangtao's starred repositories

prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Language:Jupyter NotebookMIT143800

Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Language:PythonApache-2.028100

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.0300600

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause364100

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2744800

Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Language:Jupyter NotebookMIT20900

Hypo2Trans

Single-blind supplementary materials for NeurIPS 2023 submission

Language:PythonMIT5200

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonApache-2.0354600

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2026300

audio-dataset

Audio Dataset for training CLAP and other models

Language:Python60500

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookAGPL-3.075900

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03215500

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT745400

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonMIT47100

SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Language:Jupyter NotebookMIT33400

TiCodec

Language:Python3600

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT233100

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT3390900

UniAudio

The official source code of UniAudio

Language:Python7800

llama

Inference code for Llama models

Language:PythonNOASSERTION5432700

ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

Language:JavaScript14900

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonApache-2.0652400

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language:PythonApache-2.060100

PyTSMod

An open-source Python library for audio time-scale modification.

Language:PythonGPL-3.018700

Brazilian-Portuguese

300

RHVoice

a free and open source speech synthesizer for Russian and other languages

Language:C++GPL-2.0147800

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT1116600

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

CC-BY-4.014700

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonMIT61800

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonNOASSERTION856200

hairuo55