Arka Sadhu (TheShadow29)

TheShadow29

Geek Repo

Company:Meta

Location:Sunnyvale, CA, USA

Home Page:https://theshadow29.github.io

Twitter:@ArkaSadhu29

Github PK Tool:Github PK Tool

Arka Sadhu's repositories

awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

research-advice-list

A compilation of research advice.

zsgnet-pytorch

Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)

Language:PythonLicense:MITStargazers:69Issues:4Issues:11

vognet-pytorch

[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)

Language:PythonLicense:MITStargazers:67Issues:4Issues:8

VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Language:PythonLicense:MITStargazers:57Issues:3Issues:21

Video-QAP

Repository for the paper Video Question Answering with Phrases via Semantic Roles

Language:PythonLicense:MITStargazers:4Issues:3Issues:1

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

bert_score

BERT score for text generation

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

bpycv

Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:2Issues:0

coval

A coreference evaluation package for the CoNLL and ARRAU datasets

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dotfiles

Some of my config files

Language:ShellStargazers:0Issues:2Issues:0

DownloadConceptualCaptions

Reliably download millions of images efficiently

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

fast-stable-diffusion

fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GMED

Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020

Language:PythonStargazers:0Issues:1Issues:0

manim

A community-maintained Python framework for creating mathematical animations.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

mnemonics

PyTorch implementation of "Mnemonics Training: Multi-Class Incremental Learning without Forgetting" (CVPR2020 Oral)

License:MITStargazers:0Issues:2Issues:0

neptune-mlflow

Neptune integration with MLflow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pycls

Codebase for Image Classification Research, written in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

raiv-task

Repository to hold dataset for RAIV task

License:MITStargazers:0Issues:1Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

USCthesis

a LaTeX style for theses and dissertations at USC

Language:TeXStargazers:0Issues:1Issues:0

VisTR

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0