Fu Guanyu (EricFuma)

EricFuma

Geek Repo

Company:AliPay

Location:HangZhou, China

Github PK Tool:Github PK Tool

Fu Guanyu's starred repositories

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonLicense:Apache-2.0Stargazers:402Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:649Issues:0Issues:0

role-play-synthetic

Synthetic Role-Play Conversation Dataset Generation

Language:PythonLicense:Apache-2.0Stargazers:28Issues:0Issues:0

Diana

DIANA Dataset (ACL 22)

Stargazers:8Issues:0Issues:0

fuzzywuzzy

Fuzzy String Matching in Python

Language:PythonLicense:GPL-2.0Stargazers:9186Issues:0Issues:0

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:478Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:1996Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28451Issues:0Issues:0

Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language:HTMLStargazers:231Issues:0Issues:0

tango

A family of diffusion models for text-to-audio generation.

Language:PythonLicense:NOASSERTIONStargazers:968Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15352Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5592Issues:0Issues:0

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookLicense:MITStargazers:427Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1656Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13944Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3598Issues:0Issues:0

Paper-Implementation-Template

A simple reproducible template to implement AI research papers

License:MITStargazers:21Issues:0Issues:0

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonLicense:MITStargazers:2039Issues:0Issues:0
Language:PythonStargazers:61Issues:0Issues:0
Language:PythonStargazers:464Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9949Issues:0Issues:0

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonLicense:MITStargazers:203Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24948Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12563Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:1504Issues:0Issues:0

SpeechDatasetSplitter

A simple waveform segmentator using OpenAI's Whisper

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

Language:PythonLicense:GPL-3.0Stargazers:8383Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:290Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11963Issues:0Issues:0