Beast code in Giters

xunnew's repositories

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Language:Python200

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonApache-2.0000

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.0000

anything-llm

Open-source ChatGPT experience for LLMs, embedders, and vector databases. Unlimited documents, messages, and concurrent users with permission management in one app.

Language:JavaScriptMIT000

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonNOASSERTION000

ComfyUI-Diffusers

This repository is a custom node in ComfyUI. This is a program that allows you to use Huggingface Diffusers module with ComfyUI. Additionally, Stream Diffusion is also available.

Language:PythonMIT000

DigiHuman

Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques

GPL-3.0000

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Apache-2.0000

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

GPL-3.0000

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

GPL-3.0000

FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

Language:CNOASSERTION000

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.0000

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonNOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT000

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

NOASSERTION000

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

000

lerobot

🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch

Language:PythonApache-2.0000

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Apache-2.0000

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

MIT000

metahuman-stream

Real time streaming digital human based on nerf

MIT000

oms-Diffusion

NOASSERTION000

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

NOASSERTION000

OpenPromptStudio

🥣 AIGC 提示词可视化编辑器 | OPS | Open Prompt Studio

000

Orion

Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型，包括对话模型，长文本模型，量化模型，RAG微调模型，Agent微调模型等。

Apache-2.0000

xunnew

xunnew's repositories

V-Express

3D-Speaker

AnimateDiff

anything-llm

audio2photoreal

ComfyUI-Diffusers

DiffSynth-Studio

DigiHuman

DynamiCrafter

edge-tts

Fay

FFmpeg

Fooocus

gaussian-splatting

GPT-SoVITS

HunyuanDiT

IDM-VTON

lerobot

MagicTime

MeloTTS

metahuman-stream

oms-Diffusion

OOTDiffusion

OpenPromptStudio

Orion

pytorch3d

StableCascade

StoryDiffusion

unstructured-api

VirtualWife