yanhn

followers

following

stars

MOMO Tech

Beijing

double4tar's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLCC0-1.0107642 13970

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION66917 555 705

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024321 192 3836

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION11340 149 811

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Language:PythonGPL-3.08384 56 458

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonMIT7660 32 284

point-e

Point cloud diffusion for 3D model synthesis

Language:PythonMIT6425 224 85

instruct-pix2pix

Language:PythonNOASSERTION6155 70 116

sd-scripts

Language:PythonApache-2.04599 50 847

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonApache-2.04237 58 143

stable-diffusion

Latent Text-to-Image Diffusion

Language:Jupyter NotebookNOASSERTION3773 60 41

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT2627 33 96

DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Language:PythonApache-2.02083 28 86

sdfstudio

A Unified Framework for Surface Reconstruction

Language:PythonApache-2.01917 30 270

DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Language:PythonMIT1907 119 62

custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Language:PythonNOASSERTION1823 31 93

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonMIT1618 26 174

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonApache-2.01614 29 70

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonMIT1478 27 45

chinese_speech_pretrain

chinese speech pretrained models

Language:Shell975 10 54

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画

Language:PythonApache-2.0836 19 40

MeshDiffusion

Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)

Language:PythonMIT759 18 36

ICT-FaceKit

ICT's Vision and Graphics Lab's morphable face model and toolkit

Language:PythonMIT634 35 14

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonMIT610 24 69

NeRO

[SIGGRAPH2023] NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Language:PythonMIT522 11 34

ReVersion

ReVersion: Diffusion-Based Relation Inversion from Images

Language:PythonNOASSERTION443 20 7

ddib

Dual Diffusion Implicit Bridges for Image-to-Image Translation. ICLR 2023.

Language:PythonMIT338 4 18

ultrapose

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

Language:Jupyter NotebookMIT101 14 9

KnowledgeVL-Reading

AugmentationTutorial

some basic data augmentation method

Language:Python2 10