Yuan-Man (Yuan-ManX)

Yuan-ManX

Geek Repo

Location:Shanghai, China

Home Page:ym1076302261@163.com

Github PK Tool:Github PK Tool

Yuan-Man's repositories

ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

License:MITStargazers:264Issues:11Issues:0

SouPyX

SouPyX: An Audio Exploration Space.🪐

Language:PythonLicense:MITStargazers:31Issues:2Issues:2

audio-ai-agent

Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.

License:MITStargazers:11Issues:2Issues:0

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

artnex

ArtNex is a deep learning framework exploring the innovative fusion of art and technology.

Language:PythonLicense:MITStargazers:2Issues:3Issues:0

riffusion

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

audio-preprocess

Preprocess Audio for training

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

game-engine

Explore Game Engine Tools! 🚀

License:MITStargazers:1Issues:2Issues:0

multi-clip

Connecting text, images, audio, and video!

License:MITStargazers:1Issues:2Issues:0

ollama

Get up and running with Llama 2, Mistral, and other large language models locally.

Language:GoLicense:MITStargazers:1Issues:1Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

speechtoolkit

[EARLY PUBLIC ALPHA] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activity detection, and more!

Language:PythonStargazers:1Issues:1Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

GPT-SoVITS-GUI

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:2Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

metavoice-src

AI for human-level speech intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

NexEngine

NexEngine Game Engine 🚀

License:MITStargazers:0Issues:2Issues:0

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Open-Sora-Plan

This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ppgs

High-Fidelity Neural Phonetic Posteriorgrams

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:2Issues:0

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Stargazers:0Issues:1Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

VisionProTeleop

VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.

Language:SwiftLicense:MITStargazers:0Issues:1Issues:0