Shoufa Chen's repositories
DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Awesome-Diffusion-Transformers
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
clone-anonymous4open
clone/download codes from https://anonymous.4open.science/
Grounded-Segment-Anything-patch
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
accelerate-patch
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Awesome-Anything-patch
AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
diffusers-dev
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
pytorch-grad-cam
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
detr-patch
End-to-End Object Detection with Transformers
DiffDock-patch
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
langchain-patch
⚡ Building applications with LLMs through composability ⚡
minisora-patch
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
waymo-open-dataset
Waymo Open Dataset