731why's repositories
IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
insightface
State-of-the-art 2D and 3D Face Analysis Project
Micro-Wheeled_leg-Robot
全球最小的桌面级双轮腿机器人!
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
richdreamer
Live Demo:https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点对商品进行解说并激发用户的购买意愿的卖货主播模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊
act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
ai-collection
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
AI-GAL
AL GAL是专门为Galgame场景设计的程序,旨在让得每一名用户都能享受到独一无二的剧情。程序基于renpy框架开发
ai2apps
Setup AI2Apps at local system so you can use your own OpenAI key or make more back-end features.
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
DOLY-DIY
DIY Doly project
IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
InstructAvatar
Official implementation of the paper 'InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation'
open-parse
Improved file parsing for LLM’s
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
suno-api
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
TAICHI-flet
基于flet的一款windows桌面应用,实现了浏览图片、音乐、小说、漫画、各种资源的功能。
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision