Risa Shinoda's starred repositories

vila

Incorporating VIsual LAyout Structures for Scientific Text Classification

Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis

Language:PythonLicense:Apache-2.0Stargazers:4644Issues:0Issues:0

pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Language:ScalaLicense:Apache-2.0Stargazers:568Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:28554Issues:0Issues:0

Pets-Face-Recognition

Animal identification using face recognition based methods

Language:PythonLicense:Apache-2.0Stargazers:60Issues:0Issues:0

STIP

Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.

Language:PythonLicense:NOASSERTIONStargazers:44Issues:0Issues:0

pytorch-i3d-feature-extraction

Code for I3D Feature Extraction

Language:PythonLicense:Apache-2.0Stargazers:132Issues:0Issues:0

BlendFace

[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854

Language:PythonLicense:NOASSERTIONStargazers:162Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:954Issues:0Issues:0

kinetics_i3d_pytorch

Inflated i3d network with inception backbone, weights transfered from tensorflow

Language:PythonLicense:MITStargazers:520Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4469Issues:0Issues:0

lk_demo

Lucas-Kanade tracking demo

Language:C++Stargazers:5Issues:0Issues:0

sam-pt

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Language:PythonLicense:Apache-2.0Stargazers:932Issues:0Issues:0