Kunpeng Li (KunpengLi1994)

KunpengLi1994

Geek Repo

Location:Boston, USA

Home Page:https://kunpengli1994.github.io/

Github PK Tool:Github PK Tool

Kunpeng Li's starred repositories

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23972Issues:316Issues:388

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16609Issues:149Issues:1477

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4161Issues:49Issues:95

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2734Issues:48Issues:87

pytorch-meta

A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch

Language:PythonLicense:MITStargazers:1960Issues:44Issues:141

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookLicense:MITStargazers:1732Issues:21Issues:61

Versatile-Diffusion

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Language:PythonLicense:MITStargazers:1301Issues:28Issues:34

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1080Issues:14Issues:33

Awesome-Image-Colorization

:books: A collection of Deep Learning based Image Colorization and Video Colorization papers.

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language:Jupyter NotebookStargazers:953Issues:36Issues:25

video_analyst

A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.

Language:PythonLicense:MITStargazers:821Issues:29Issues:131

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:717Issues:12Issues:25

ov-seg

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:666Issues:13Issues:30

UniControl

Unified Controllable Visual Generation Model

Language:PythonLicense:Apache-2.0Stargazers:594Issues:19Issues:27

unbiased-teacher

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Language:PythonLicense:MITStargazers:410Issues:18Issues:80

EMA-VFI

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Language:PythonLicense:Apache-2.0Stargazers:339Issues:2Issues:26

CFBI

The official implementation of CFBI(+): Collaborative Video Object Segmentation by (Multi-scale) Foreground-Background Integration.

Language:PythonLicense:BSD-3-ClauseStargazers:322Issues:20Issues:58

VSRN

PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"

GloRe

Global Reasoning module for visual recognition

Language:PythonLicense:MITStargazers:206Issues:10Issues:16

CVPR21Chal-SLR

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Language:PythonLicense:CC0-1.0Stargazers:205Issues:3Issues:32

vse_infty

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Language:PythonLicense:MITStargazers:152Issues:4Issues:10

FCViT

A Close Look at Spatial Modeling: From Attention to Convolution

Language:PythonLicense:Apache-2.0Stargazers:89Issues:3Issues:6

TERAN

Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

Language:PythonLicense:Apache-2.0Stargazers:74Issues:2Issues:6

OTTER

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Language:PythonLicense:MITStargazers:64Issues:5Issues:1

Generative_MLZSL

[TPAMI 2023] Generative Multi-Label Zero-Shot Learning

Language:PythonLicense:GPL-3.0Stargazers:48Issues:5Issues:16

Efficient_Graph_Similarity_Computation

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Language:PythonLicense:MITStargazers:38Issues:2Issues:1

ego-topo

Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)

Language:PythonLicense:NOASSERTIONStargazers:29Issues:7Issues:3

PsTuts

PyTorch code for the CVPR'2020 paper "Screencast Tutorial Video Understanding"

Language:Jupyter NotebookStargazers:4Issues:2Issues:0