Arka Sadhu (TheShadow29)

TheShadow29

Geek Repo

Company:University of Southern California

Location:Los Angeles, CA, USA

Home Page:https://theshadow29.github.io

Github PK Tool:Github PK Tool

ezoic increase your site revenue

Arka Sadhu's repositories

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

research-advice-list

A compilation of research advice.

vognet-pytorch

[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

Language:PythonLicense:MITStargazers:65Issues:4Issues:7

zsgnet-pytorch

Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)

Language:PythonLicense:MITStargazers:63Issues:3Issues:10

VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Language:PythonLicense:MITStargazers:39Issues:2Issues:16

Video-QAP

Repository for the paper Video Question Answering with Phrases via Semantic Roles

Language:PythonLicense:MITStargazers:4Issues:2Issues:1

ALBEF

Code for ALBEF: a new vision-language pre-training method

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

alfred

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

Language:CLicense:MITStargazers:0Issues:1Issues:0

bert_score

BERT score for text generation

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

bpycv

Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Conference-Acceptance-Rate

Statistics of acceptance rate for the main AI conference

Stargazers:0Issues:0Issues:0

coval

A coreference evaluation package for the CoNLL and ARRAU datasets

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dotfiles

Some of my config files

Language:Emacs LispStargazers:0Issues:2Issues:0

DownloadConceptualCaptions

Reliably download millions of images efficiently

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GMED

Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020

Language:PythonStargazers:0Issues:0Issues:0

grounded-video-description

Video Grounding and Captioning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

manim

A community-maintained Python framework for creating mathematical animations.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mnemonics

PyTorch implementation of "Mnemonics Training: Multi-Class Incremental Learning without Forgetting" (CVPR2020 Oral)

License:MITStargazers:0Issues:2Issues:0

neptune-mlflow

Neptune integration with MLflow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pycls

Codebase for Image Classification Research, written in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

USCthesis

a LaTeX style for theses and dissertations at USC

Language:TeXStargazers:0Issues:0Issues:0

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0