indiejoseph

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4072 54 116

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT3861 39 120

Style-Transfer-in-Text

Paper List for Style Transfer in Text

1594 73 13

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.01319 12 118

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01212 9 115

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0857 11 26

wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.

Language:PythonMIT741 18 37

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Language:PythonMIT703 8 20

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonAGPL-3.0565 14 88

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonMIT528 9 33

cosmopedia

Language:PythonApache-2.0280 11 5

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Language:PythonMIT202 22 19

ChatAlpaca

A Multi-Turn Dialogue Corpus based on Alpaca Instructions

Language:PythonApache-2.0153 4 2

LoRD

Low-Rank adapter extraction for fine-tuned transformers model

Language:Jupyter NotebookApache-2.0147 20

amber-train

Pre-training code for Amber 7B LLM

Language:PythonApache-2.0138 8 5

DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

Language:Python131 15 12

honcho

Platform for building personalized AI applications

Language:PythonAGPL-3.0106 2 1

MiniMA

Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"

Language:PythonApache-2.088 3 5

hf-rvc

Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.

Language:PythonMIT55 6 5

fastbm25

The fast python bm25 algorithm implemented with reverted index

Language:PythonApache-2.035 1 1

Noise-Contrastive-Alignment

Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards"

MIT1400

EncT5

Implementation of EncT5 (https://arxiv.org/abs/2110.08426)

Language:PythonApache-2.0600

tts_rvc

A system that integrates Microsoft's Edge text-to-speech (TTS) engine with Retrieval-Based Voice Conversion (Voice Cloning) technology for generating unique voices.

Language:PythonMIT100