OliverHxh

Xiaohu Huang's starred repositories

fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.046424 304 658

CPlusPlusThings

C++那些事

Language:C++38629 548 221

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.010176 103 343

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonApache-2.04393 55 122

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3063 127 18

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python2863 33 128

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonApache-2.02756 30 107

Video-Swin-Transformer

This is an official implementation for "Video Swin Transformers".

Language:PythonApache-2.01395 9 93

VideoX

VideoX: a collection of video cross-modal models

Language:PythonNOASSERTION961 21 111

pyskl

A toolbox for skeleton-based action recognition.

Language:PythonApache-2.0929 12 217

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonNOASSERTION844 40 42

OpenGait

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

Language:Python703 19 203

MIMDet

[ICCV 2023] You Only Look at One Partial Sequence

Language:PythonMIT335 10 28

SAN

Open-vocabulary Semantic Segmentation

Language:PythonMIT295 6 57

ddfnet

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

Language:PythonMIT211 8 36

TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Language:PythonApache-2.0166 3 21

DASR

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution' in ECCV 2022

Language:PythonApache-2.0130 4 27

infogcn

Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"

Language:Python111 2 21

Open-VCLIP

Language:Python100 2 14

CrosSCLR

The Official PyTorch implementation of "3D Human Action Representation Learning via Cross-View Consistency Pursuit" in CVPR 2021

Language:PythonBSD-2-Clause64 5 7

FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Language:PythonNOASSERTION52 4 4

CSTL

ICCV 2021 PAPER

Language:Python43 1 8

SkeletonGCL

The repository is the implementation of ICLR 2023 paper "Graph Contrastive Learning for Skeleton-based Action Recognition".

Language:PythonNOASSERTION43 6 5

ChangeViT

The officical code of 'ChangeViT: Unleashing Plain Vision Transformers for Change Detection'.

Language:PythonNOASSERTION31 4 4

LLM4VPR

Can multimodal LLM help visual place recognition?

Language:Python28 10

SPTNet

The official repository for ICLR2024 paper "SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning"

Language:PythonNOASSERTION22 2 9

rankseg

RankSEG: A consistent ranking-based framework for segmentation

Language:Jupyter NotebookMIT18 10

RegionDrag

The official repository for ECCV2024 paper "RegionDrag: Fast Region-Based Image Editing with Diffusion Models"

Language:Python1600

CAG

The official repository for paper "Condition-Adaptive Graph Convolution Learning for Skeleton-Based Gait Recognition"

400