Joseph Cheng (indiejoseph)

indiejoseph

Geek Repo

Company:Soft Butter Studio

Location:Hong Kong

Home Page:http://josephcheng.me

Github PK Tool:Github PK Tool

Joseph Cheng's starred repositories

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23282Issues:194Issues:3639

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8515Issues:79Issues:34

nsfwjs

NSFW detection on the client-side via TensorFlow.js

Language:TypeScriptLicense:MITStargazers:7717Issues:83Issues:183

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7372Issues:111Issues:286

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7207Issues:48Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6547Issues:57Issues:141

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4582Issues:38Issues:561

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4072Issues:54Issues:116

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:3861Issues:39Issues:120

Style-Transfer-in-Text

Paper List for Style Transfer in Text

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1319Issues:12Issues:118

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1212Issues:9Issues:115

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:857Issues:11Issues:26

wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.

Language:PythonLicense:MITStargazers:741Issues:18Issues:37

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:703Issues:8Issues:20

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonLicense:AGPL-3.0Stargazers:565Issues:14Issues:88

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:528Issues:9Issues:33
Language:PythonLicense:Apache-2.0Stargazers:280Issues:11Issues:5

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Language:PythonLicense:MITStargazers:202Issues:22Issues:19

ChatAlpaca

A Multi-Turn Dialogue Corpus based on Alpaca Instructions

Language:PythonLicense:Apache-2.0Stargazers:153Issues:4Issues:2

LoRD

Low-Rank adapter extraction for fine-tuned transformers model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:147Issues:2Issues:0

amber-train

Pre-training code for Amber 7B LLM

Language:PythonLicense:Apache-2.0Stargazers:138Issues:8Issues:5

DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

honcho

Platform for building personalized AI applications

Language:PythonLicense:AGPL-3.0Stargazers:106Issues:2Issues:1

MiniMA

Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"

Language:PythonLicense:Apache-2.0Stargazers:88Issues:3Issues:5

hf-rvc

Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.

Language:PythonLicense:MITStargazers:55Issues:6Issues:5

fastbm25

The fast python bm25 algorithm implemented with reverted index

Language:PythonLicense:Apache-2.0Stargazers:35Issues:1Issues:1

Noise-Contrastive-Alignment

Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards"

License:MITStargazers:14Issues:0Issues:0

EncT5

Implementation of EncT5 (https://arxiv.org/abs/2110.08426)

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

tts_rvc

A system that integrates Microsoft's Edge text-to-speech (TTS) engine with Retrieval-Based Voice Conversion (Voice Cloning) technology for generating unique voices.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0