wangtao (hairuo55)

hairuo55

Geek Repo

Company:NLPR

Github PK Tool:Github PK Tool

wangtao's starred repositories

prompt-in-context-learning

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Language:Jupyter NotebookLicense:MITStargazers:1438Issues:0Issues:0

Awesome-instruction-tuning

A curated list of awesome instruction tuning datasets, models, papers and repositories.

Language:PythonLicense:Apache-2.0Stargazers:281Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3006Issues:0Issues:0

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3641Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27448Issues:0Issues:0

Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Language:Jupyter NotebookLicense:MITStargazers:209Issues:0Issues:0

Hypo2Trans

Single-blind supplementary materials for NeurIPS 2023 submission

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3546Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20263Issues:0Issues:0

audio-dataset

Audio Dataset for training CLAP and other models

Language:PythonStargazers:605Issues:0Issues:0

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:759Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32155Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7454Issues:0Issues:0

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonLicense:MITStargazers:471Issues:0Issues:0

SpecVQGAN

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

Language:Jupyter NotebookLicense:MITStargazers:334Issues:0Issues:0
Language:PythonStargazers:36Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2331Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33909Issues:0Issues:0

UniAudio

The official source code of UniAudio

Language:PythonStargazers:78Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54327Issues:0Issues:0

ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

Language:JavaScriptStargazers:149Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6524Issues:0Issues:0

wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Language:PythonLicense:Apache-2.0Stargazers:601Issues:0Issues:0

PyTSMod

An open-source Python library for audio time-scale modification.

Language:PythonLicense:GPL-3.0Stargazers:187Issues:0Issues:0

RHVoice

a free and open source speech synthesizer for Russian and other languages

Language:C++License:GPL-2.0Stargazers:1478Issues:0Issues:0

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11166Issues:0Issues:0

TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

License:CC-BY-4.0Stargazers:147Issues:0Issues:0

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonLicense:MITStargazers:618Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8562Issues:0Issues:0