Beast code in Giters

Kunpeng Li's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT23972 316 388

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonGPL-3.016609 149 1477

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonApache-2.04161 49 95

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3317 105 41

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

2928 124 18

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.02734 48 87

pytorch-meta

A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch

Language:PythonMIT1960 44 141

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookMIT1732 21 61

Versatile-Diffusion

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Language:PythonMIT1301 28 34

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookMIT1080 14 33

Awesome-Image-Colorization

:books: A collection of Deep Learning based Image Colorization and Video Colorization papers.

960 41 16

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language:Jupyter Notebook953 36 25

video_analyst

A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.

Language:PythonMIT821 29 131

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Language:PythonMIT717 12 25

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookNOASSERTION666 13 30

UniControl

Unified Controllable Visual Generation Model

Language:PythonApache-2.0594 19 27

unbiased-teacher

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Language:PythonMIT410 18 80

EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Language:PythonApache-2.0339 2 26

CFBI

The official implementation of CFBI(+): Collaborative Video Object Segmentation by (Multi-scale) Foreground-Background Integration.

Language:PythonBSD-3-Clause322 20 58

VSRN

PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"

Language:Python285 8 25

GloRe

Global Reasoning module for visual recognition

Language:PythonMIT206 10 16

CVPR21Chal-SLR

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Language:PythonCC0-1.0205 3 32

vse_infty

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Language:PythonMIT152 4 10

FCViT

A Close Look at Spatial Modeling: From Attention to Convolution

Language:PythonApache-2.089 3 6

TERAN

Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

Language:PythonApache-2.074 2 6

OTTER

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Language:PythonMIT64 5 1

Generative_MLZSL

[TPAMI 2023] Generative Multi-Label Zero-Shot Learning

Language:PythonGPL-3.048 5 16

Efficient_Graph_Similarity_Computation

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Language:PythonMIT38 2 1

ego-topo

Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)

Language:PythonNOASSERTION29 7 3

PsTuts

PyTorch code for the CVPR'2020 paper "Screencast Tutorial Video Understanding"

Language:Jupyter Notebook4 20