sunhd96

sunhd96

Geek Repo

Github PK Tool:Github PK Tool

sunhd96's starred repositories

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23597Issues:0Issues:0

awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language:TeXLicense:MITStargazers:280Issues:0Issues:0

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXLicense:MITStargazers:187Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5789Issues:0Issues:0

ControlNet-for-Diffusers

Transfer the ControlNet with any basemodel in diffusers🔥

Language:PythonLicense:MITStargazers:787Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9321Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4520Issues:0Issues:0

T2I-Adapter

T2I-Adapter

Language:PythonStargazers:3338Issues:0Issues:0

dpm-solver

Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)

Language:PythonLicense:MITStargazers:1480Issues:0Issues:0
Language:PythonLicense:MITStargazers:5939Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35613Issues:0Issues:0
License:CC-BY-4.0Stargazers:799Issues:0Issues:0

BiT

[CVPR2023] Blur Interpolation Transformer for Real-World Motion from Blur

Language:PythonLicense:MITStargazers:207Issues:0Issues:0

Awesome-Deblurring

A curated list of resources for Image and Video Deblurring

Stargazers:2342Issues:0Issues:0

DiffIR

This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023

Language:Jupyter NotebookStargazers:416Issues:0Issues:0

HI-Diff

PyTorch code for our NeurIPS 2023 paper "Hierarchical Integration Diffusion Model for Realistic Image Deblurring"

Language:PythonLicense:Apache-2.0Stargazers:144Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:8945Issues:0Issues:0

glide

An image loading and caching library for Android focused on smooth scrolling

Language:JavaLicense:NOASSERTIONStargazers:34502Issues:0Issues:0

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonLicense:Apache-2.0Stargazers:1583Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8224Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5599Issues:0Issues:0

EDGE

Official PyTorch Implementation of EDGE (CVPR 2023)

Language:PythonLicense:MITStargazers:421Issues:0Issues:0

DisCo

[CVPR2024] DisCo: Referring Human Dance Generation in Real World

Language:PythonLicense:Apache-2.0Stargazers:1027Issues:0Issues:0

HoT

[CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"

Language:PythonLicense:MITStargazers:148Issues:0Issues:0

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:2986Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8173Issues:0Issues:0

sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Language:PythonLicense:GPL-3.0Stargazers:3852Issues:0Issues:0

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Language:PythonLicense:GPL-3.0Stargazers:21971Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9836Issues:0Issues:0

ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Language:PythonLicense:Apache-2.0Stargazers:1277Issues:0Issues:0