yumianhuli1

yumianhuli's repositories

adetailer

Auto detecting, masking and inpainting with detection model.

Language:PythonAGPL-3.0000

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

Awesome-Interaction-Aware-Trajectory-Prediction

A selection of state-of-the-art research materials on trajectory prediction

Language:TeXMIT000

ChatDev-QingHua

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Apache-2.0000

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

MIT000

Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

GPL-3.0000

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

000

FollowYourPose

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

MIT000

HandRefiner

MIT000

HybrIK

Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021

MIT000

InternVideo

InternVideo: General Video Foundation Models via Generative and Discriminative Learning (https://arxiv.org/abs/2212.03191)

Apache-2.0000

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Apache-2.0000

LLMUnity

Integrate LLM models in Unity!

MIT000

LWM

Apache-2.0000

MaterialSearch

AI语义搜索本地素材。以图搜图、查找本地素材、根据文字描述匹配画面、视频帧搜索、根据画面描述搜索视频。Semantic search. Search local photos and videos through natural language.

GPL-3.0000

momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Language:PythonMIT000

MoneyPrinterV2

Automate the process of making money online.

AGPL-3.0000

Open-Sora-Plan

This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

NOASSERTION000