TheShadow29

Arka Sadhu's repositories

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

research-advice-list

A compilation of research advice.

zsgnet-pytorch

Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)

Language:PythonMIT69 4 11

vognet-pytorch

[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

Language:PythonMIT67 4 8

VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Language:PythonMIT57 3 21

Video-QAP

Repository for the paper Video Question Answering with Phrases via Semantic Roles

Language:PythonMIT4 3 1

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonBSD-3-Clause010

bert_score

BERT score for text generation

Language:Jupyter NotebookMIT020

bpycv

Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)

Language:PythonMIT010

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonMIT010

coco-caption

Language:Jupyter NotebookNOASSERTION020

coval

A coreference evaluation package for the CoNLL and ARRAU datasets

Language:PythonMIT010

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0010

dotfiles

Some of my config files

Language:Shell020

DownloadConceptualCaptions

Reliably download millions of images efficiently

Language:Jupyter NotebookMIT010

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

fast-stable-diffusion

fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth

Language:PythonMIT010

GMED

Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020

Language:Python010

manim

A community-maintained Python framework for creating mathematical animations.

Language:PythonMIT010

manim-pptx

Language:PythonMIT010

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonNOASSERTION010

mnemonics

PyTorch implementation of "Mnemonics Training: Multi-Class Incremental Learning without Forgetting" (CVPR2020 Oral)

MIT020

neptune-mlflow

Neptune integration with MLflow

Language:PythonApache-2.0010

pycls

Codebase for Image Classification Research, written in PyTorch.

Language:PythonMIT020

pytorchvideo

A deep learning library for video understanding research.

Language:PythonApache-2.0010

raiv-task

Repository to hold dataset for RAIV task

MIT010

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonApache-2.0010

tabular_dae

Language:PythonApache-2.0010

USCthesis

a LaTeX style for theses and dissertations at USC

Language:TeX010

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonApache-2.0010