Xiaohu Huang (OliverHxh)

OliverHxh

Geek Repo

Company:huangxiaohu@connect.hku.hk

Github PK Tool:Github PK Tool

Xiaohu Huang's starred repositories

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46424Issues:304Issues:658

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10176Issues:103Issues:343

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonLicense:Apache-2.0Stargazers:4393Issues:55Issues:122

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2756Issues:30Issues:107

Video-Swin-Transformer

This is an official implementation for "Video Swin Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1395Issues:9Issues:93

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:961Issues:21Issues:111

pyskl

A toolbox for skeleton-based action recognition.

Language:PythonLicense:Apache-2.0Stargazers:929Issues:12Issues:217

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:844Issues:40Issues:42

OpenGait

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

MIMDet

[ICCV 2023] You Only Look at One Partial Sequence

Language:PythonLicense:MITStargazers:335Issues:10Issues:28

SAN

Open-vocabulary Semantic Segmentation

Language:PythonLicense:MITStargazers:295Issues:6Issues:57

ddfnet

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

Language:PythonLicense:MITStargazers:211Issues:8Issues:36

TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Language:PythonLicense:Apache-2.0Stargazers:166Issues:3Issues:21

DASR

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution' in ECCV 2022

Language:PythonLicense:Apache-2.0Stargazers:130Issues:4Issues:27

infogcn

Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"

CrosSCLR

The Official PyTorch implementation of "3D Human Action Representation Learning via Cross-View Consistency Pursuit" in CVPR 2021

Language:PythonLicense:BSD-2-ClauseStargazers:64Issues:5Issues:7

FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Language:PythonLicense:NOASSERTIONStargazers:52Issues:4Issues:4

CSTL

ICCV 2021 PAPER

SkeletonGCL

The repository is the implementation of ICLR 2023 paper "Graph Contrastive Learning for Skeleton-based Action Recognition".

Language:PythonLicense:NOASSERTIONStargazers:43Issues:6Issues:5

ChangeViT

The officical code of 'ChangeViT: Unleashing Plain Vision Transformers for Change Detection'.

Language:PythonLicense:NOASSERTIONStargazers:31Issues:4Issues:4

LLM4VPR

Can multimodal LLM help visual place recognition?

Language:PythonStargazers:28Issues:1Issues:0

SPTNet

The official repository for ICLR2024 paper "SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning"

Language:PythonLicense:NOASSERTIONStargazers:22Issues:2Issues:9

rankseg

RankSEG: A consistent ranking-based framework for segmentation

Language:Jupyter NotebookLicense:MITStargazers:18Issues:1Issues:0

RegionDrag

The official repository for ECCV2024 paper "RegionDrag: Fast Region-Based Image Editing with Diffusion Models"

Language:PythonStargazers:16Issues:0Issues:0

CAG

The official repository for paper "Condition-Adaptive Graph Convolution Learning for Skeleton-Based Gait Recognition"

Stargazers:4Issues:0Issues:0