Jacob's repositories
Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
SparseR-CNN
End-to-End Object Detection with Learnable Proposal, CVPR2021
all-in-one
[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training
ASU-Thesis-Format
ASU Thesis Format
color-aware-style-transfer
Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
LocalizingMoments
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
markdown-content
Markdown content for the www.aerobatic.io website
PaddleSeg
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
pytorchvideo
A deep learning library for video understanding research.
video-swin-transformer-pytorch
Video Swin Transformer - PyTorch