WhiteFu's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:105814Issues:1383Issues:0

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

Language:PythonLicense:GPL-3.0Stargazers:4496Issues:41Issues:479

LLaMA-Efficient-Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

Language:PythonLicense:Apache-2.0Stargazers:3511Issues:34Issues:696

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:2948Issues:56Issues:646

poe-api

[UNMAINTAINED] A reverse engineered Python API wrapper for Quora's Poe, which provides free access to ChatGPT, GPT-4, and Claude.

Language:PythonLicense:GPL-3.0Stargazers:2497Issues:34Issues:200

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:2221Issues:31Issues:125

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1433Issues:26Issues:24

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Language:TypeScriptLicense:MITStargazers:1402Issues:28Issues:179

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonLicense:Apache-2.0Stargazers:968Issues:12Issues:74

tortoise-tts-fast

Fast TorToiSe inference (5x or your money back!)

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:741Issues:27Issues:121

language-detection

This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)

LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Language:PythonLicense:Apache-2.0Stargazers:656Issues:18Issues:21

bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech

Language:C++License:MITStargazers:605Issues:34Issues:72

UltraSinger

AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files.

Language:PythonLicense:MITStargazers:219Issues:18Issues:83

DL-Art-School

TorToiSe fine-tuning with DLAS

Language:PythonLicense:AGPL-3.0Stargazers:205Issues:15Issues:61

ai-audio-datasets-list

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

Easy-Translate

Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible for beginners and as seamlesscustomizable and as possible for advanced users.

Language:PythonLicense:Apache-2.0Stargazers:168Issues:9Issues:8

RecAlgorithm

主流推荐系统Rank算法的实现

Language:PythonLicense:BSD-2-ClauseStargazers:153Issues:6Issues:3

SC_VALL-E

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Language:PythonLicense:MITStargazers:132Issues:7Issues:1

UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:122Issues:11Issues:8

tortoise-tts-fastest

Faster Tortoise inference then Tortoise Fast Fork

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:116Issues:2Issues:10

PolyLangVITS

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Language:PythonLicense:MITStargazers:70Issues:5Issues:3

laughter-synthesis

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Language:PythonLicense:MITStargazers:63Issues:4Issues:4

EasyLLM

make LLM easier to use

Language:PythonStargazers:58Issues:2Issues:0

PhoneLM

(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.

Language:Jupyter NotebookLicense:MITStargazers:45Issues:9Issues:0

NSF-BigVGAN

BigVGAN with Neural Source-Filter

Language:PythonLicense:MITStargazers:41Issues:4Issues:1
Language:PythonStargazers:29Issues:0Issues:0

CML-TTS-Dataset

CML-TTS: A Multilingual Dataset for Speech Synthesis

Language:HTMLStargazers:27Issues:2Issues:0

useful_audio_scripts

Some useful scripts for audio

Language:PythonLicense:MITStargazers:7Issues:1Issues:0