WhiteFu

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonApache-2.01433 26 24

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Language:TypeScriptMIT1402 28 179

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonApache-2.0968 12 74

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookAGPL-3.0741 27 121

language-detection

This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)

Language:Java715 42 94

LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Language:PythonApache-2.0656 18 21

bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech

Language:C++MIT605 34 72

UltraSinger

AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files.

Language:PythonMIT219 18 83

DL-Art-School

TorToiSe fine-tuning with DLAS

Language:PythonAGPL-3.0205 15 61

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

MIT182 8 1

Easy-Translate

Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for advanced users.

Language:PythonApache-2.0168 9 8

RecAlgorithm

主流推荐系统Rank算法的实现

Language:PythonBSD-2-Clause153 6 3

SC_VALL-E

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Language:PythonMIT132 7 1

UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Language:Jupyter NotebookNOASSERTION122 11 8

tortoise-tts-fastest

Faster Tortoise inference then Tortoise Fast Fork

Language:Jupyter NotebookAGPL-3.0116 2 10

PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Language:PythonMIT70 5 3

laughter-synthesis

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Language:PythonMIT63 4 4