Jacob (jacobswan1)

jacobswan1

Geek Repo

Company: Amazon Alexa AI.

Location:San Jose

Github PK Tool:Github PK Tool

Jacob's repositories

Video2Commonsense

Video captioning baseline models on Video2Commonsense Dataset.

ViTCAP

Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

SparseR-CNN

End-to-End Object Detection with Learnable Proposal, CVPR2021

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

all-in-one

[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training

Language:PythonStargazers:0Issues:1Issues:0

ASU-Thesis-Format

ASU Thesis Format

Language:TeXStargazers:0Issues:1Issues:0

botocore

The low-level, core functionality of boto 3.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CMC

Contrastive Multiview Coding

Language:PythonStargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

color-aware-style-transfer

Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

denseflow

Extracting optical flow and frames

Language:C++License:MITStargazers:0Issues:1Issues:0

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

IMRAM

code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"

Language:PythonStargazers:0Issues:1Issues:0

info-ground

Learning phrase grounding from captioned images through InfoNCE bound on mutual information

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

LocalizingMoments

Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"

Stargazers:0Issues:0Issues:0

markdown-content

Markdown content for the www.aerobatic.io website

Stargazers:0Issues:1Issues:0

Oscar

Oscar and VinVL

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

SwinBERT

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

video-swin-transformer-pytorch

Video Swin Transformer - PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:ShellLicense:CC0-1.0Stargazers:0Issues:1Issues:0