Yiyun Chen (yiyunchen)

yiyunchen

Geek Repo

Location:Shengzhen, China

Github PK Tool:Github PK Tool

Yiyun Chen's starred repositories

VAR-CLIP

Implements VAR+CLIP for image generation

Language:PythonStargazers:67Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4024Issues:0Issues:0

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:36827Issues:0Issues:0

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonLicense:Apache-2.0Stargazers:1283Issues:0Issues:0

VmambaIR

This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"

Language:PythonStargazers:166Issues:0Issues:0

SEA-RAFT

[ECCV2024 Oral] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

Language:PythonLicense:BSD-3-ClauseStargazers:250Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:134Issues:0Issues:0

FineDiving

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment

Language:PythonLicense:MITStargazers:106Issues:0Issues:0

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:7709Issues:0Issues:0

COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Language:PythonStargazers:194Issues:0Issues:0

IDM-VTON-training

IDM-VTON-training : This is an unofficial training code of idm-vton

Language:PythonStargazers:55Issues:0Issues:0

DMT

Deficiency-Aware Masked Transformer for Video Inpainting

License:Apache-2.0Stargazers:51Issues:0Issues:0

FuseFormer

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Language:PythonStargazers:110Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5051Issues:0Issues:0

Sports-QA

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

Stargazers:27Issues:0Issues:0

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:4191Issues:0Issues:0

SportsHHI

[CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos

Language:PythonStargazers:11Issues:0Issues:0

MultiSports

[ICCV 2021] MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions

Language:PythonLicense:NOASSERTIONStargazers:107Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:30066Issues:0Issues:0

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:PythonStargazers:3663Issues:0Issues:0

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6374Issues:0Issues:0

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++License:Apache-2.0Stargazers:1825Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4354Issues:0Issues:0

TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

Language:PythonLicense:Apache-2.0Stargazers:225Issues:0Issues:0

EssentialMC2

EssentialMC2 Video Understanding.

Language:PythonLicense:MITStargazers:114Issues:0Issues:0

PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language:PythonLicense:Apache-2.0Stargazers:1506Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:98Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2391Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:1576Issues:0Issues:0