double4tar (yanhn)

yanhn

Geek Repo

Company:MOMO Tech

Location:Beijing

Github PK Tool:Github PK Tool

double4tar's starred repositories

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:107642Issues:1397Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66917Issues:555Issues:705

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24321Issues:192Issues:3836

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11340Issues:149Issues:811

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

Language:PythonLicense:GPL-3.0Stargazers:8384Issues:56Issues:458

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7660Issues:32Issues:284

point-e

Point cloud diffusion for 3D model synthesis

Language:PythonLicense:MITStargazers:6425Issues:224Issues:85
Language:PythonLicense:NOASSERTIONStargazers:6155Issues:70Issues:116
Language:PythonLicense:Apache-2.0Stargazers:4599Issues:50Issues:847

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4237Issues:58Issues:143

stable-diffusion

Latent Text-to-Image Diffusion

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3773Issues:60Issues:41

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2627Issues:33Issues:96

DWPose

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Language:PythonLicense:Apache-2.0Stargazers:2083Issues:28Issues:86

sdfstudio

A Unified Framework for Surface Reconstruction

Language:PythonLicense:Apache-2.0Stargazers:1917Issues:30Issues:270

DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Language:PythonLicense:MITStargazers:1907Issues:119Issues:62

custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:1823Issues:31Issues:93

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1618Issues:26Issues:174

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:1614Issues:29Issues:70

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonLicense:MITStargazers:1478Issues:27Issues:45

chinese_speech_pretrain

chinese speech pretrained models

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:836Issues:19Issues:40

MeshDiffusion

Official implementation of "MeshDiffusion: Score-based Generative 3D Mesh Modeling" (ICLR 2023 Spotlight)

Language:PythonLicense:MITStargazers:759Issues:18Issues:36

ICT-FaceKit

ICT's Vision and Graphics Lab's morphable face model and toolkit

Language:PythonLicense:MITStargazers:634Issues:35Issues:14

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonLicense:MITStargazers:610Issues:24Issues:69

NeRO

[SIGGRAPH2023] NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Language:PythonLicense:MITStargazers:522Issues:11Issues:34

ReVersion

ReVersion: Diffusion-Based Relation Inversion from Images

Language:PythonLicense:NOASSERTIONStargazers:443Issues:20Issues:7

ddib

Dual Diffusion Implicit Bridges for Image-to-Image Translation. ICLR 2023.

Language:PythonLicense:MITStargazers:338Issues:4Issues:18

ultrapose

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

Language:Jupyter NotebookLicense:MITStargazers:101Issues:14Issues:9

AugmentationTutorial

some basic data augmentation method

Language:PythonStargazers:2Issues:1Issues:0