EricFuma

followers

following

stars

AliPay

HangZhou, China

Fu Guanyu's starred repositories

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonApache-2.040200

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:Python64900

role-play-synthetic

Synthetic Role-Play Conversation Dataset Generation

Language:PythonApache-2.02800

Diana

DIANA Dataset (ACL 22)

800

fuzzywuzzy

Fuzzy String Matching in Python

Language:PythonGPL-2.0918600

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonApache-2.0199600

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02845100

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language:HTML23100

tango

A family of diffusion models for text-to-audio generation.

Language:PythonNOASSERTION96800

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT1535200

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.0559200

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookMIT42700

i-Code

Language:Jupyter NotebookMIT165600

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1394400

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT359800

Paper-Implementation-Template

A simple reproducible template to implement AI research papers

MIT2100

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonMIT203900

tts-qa

Language:Python6100

pyannote-whisper

Language:Python46400

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT994900

Awesome-LLM-System-Papers

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonMIT20300

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2494800

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.01256300

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.0150400

SpeechDatasetSplitter

A simple waveform segmentator using OpenAI's Whisper

Language:PythonMIT400

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Language:PythonGPL-3.0838300

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonMIT29000

mamba

Mamba SSM architecture

Language:PythonApache-2.01196300