Yuan-Man (Yuan-ManX)

Yuan-ManX

Geek Repo

Location:Shanghai, China

Home Page:ym1076302261@163.com

Github PK Tool:Github PK Tool

Yuan-Man's repositories

ai-audio-datasets

AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

ai-game-devtools

Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥

audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

License:MITStargazers:254Issues:10Issues:0

ai-agent-roadmap

Explore the latest AI Agent Framework!

License:MITStargazers:30Issues:5Issues:0

ComfyUI-Tools-Roadmap

Here we will track the latest development tools for ComfyUI, including Image, Mesh, Texture, Animation, Video, Audio, 3D Model, and more!🔥

License:MITStargazers:22Issues:3Issues:0

ai-multimodal-timeline

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

License:MITStargazers:8Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:3Issues:1Issues:0

Awesome-ChatTTS

Awesome-ChatTTS 整理和汇总了 ChatTTS 项目的常见问题和相关资源,是 ChatTTS 的最佳入门指南。

License:NOASSERTIONStargazers:2Issues:0Issues:0

ai-voice-agents

AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧

License:MITStargazers:1Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
License:NOASSERTIONStargazers:1Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

License:NOASSERTIONStargazers:1Issues:0Issues:0
License:MITStargazers:1Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

License:NOASSERTIONStargazers:1Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AudioLLM

Audio Large Language Models

Stargazers:0Issues:0Issues:0

awesome-ssm-ml

Reading list for research topics in state-space models

License:MITStargazers:0Issues:0Issues:0

ComfyUI_examples

Examples of ComfyUI workflows

Language:HTMLStargazers:0Issues:0Issues:0

friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

License:MITStargazers:0Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

License:MITStargazers:0Issues:0Issues:0

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

ragoon

Improve large language models (LLM) retrieval using dynamic web-search based on blazingly fast query generation from Groq chips ⚡

License:Apache-2.0Stargazers:0Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

License:MITStargazers:0Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonStargazers:0Issues:0Issues:0

XTTSv2

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0