Shoubin (Yui010206)

Yui010206

Geek Repo

Company:UNC, Chapel Hill

Location:Chapel Hill

Home Page:https://yui010206.github.io/

Github PK Tool:Github PK Tool

Shoubin's repositories

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:165Issues:3Issues:24

CREMA

☕️ CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion

Language:PythonLicense:BSD-3-ClauseStargazers:19Issues:2Issues:0

MoPRL

[TCSVT] Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Language:PythonStargazers:10Issues:1Issues:0

AIART_Website

an image style translatiton website

Stargazers:0Issues:2Issues:0

AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

arunmallya.github.io

my public website

Language:JavaScriptStargazers:0Issues:1Issues:0

awesome-anomaly-detection

A curated list of awesome anomaly detection resources

Stargazers:0Issues:1Issues:0

awesome-vln

A curated list of research papers in Vision-Language Navigation (VLN)

License:MITStargazers:0Issues:1Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

grid-feats-vqa

Grid features pre-training code for visual question answering

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

HOI-Learning-List

A list of Human-Object Interaction Learning.

Stargazers:0Issues:1Issues:0

just-ask

[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

magenta

Magenta: Music and Art Generation with Machine Intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

merlot_reserve

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

n2nmn

Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017

Language:SourcePawnLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

Person-Search-with-Natural-Language-Description

Person Search with Natural Language Description

Language:LuaStargazers:0Issues:1Issues:0

Research

novel deep learning research works with PaddlePaddle

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Scene-Graph-Benchmark.pytorch

A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training CVPR 2020”

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:1Issues:0

seg2vid

Video Generation from Single Semantic Label Map

Language:PythonStargazers:0Issues:1Issues:0

SJTUThesis

Shanghai Jiao Tong University XeLaTeX Thesis Template

Language:TeXLicense:Apache-2.0Stargazers:0Issues:1Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

transformer-time-series-prediction

proof of concept for a transformer-based time series prediction model

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

VGT

Video Graph Transformer for Video Question Answering (ECCV'22)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

video-swin-transformer-pytorch

Video Swin Transformer - PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

video_feature_extractor

Easy to use video deep features extractor

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ViLT

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:SCSSLicense:MITStargazers:0Issues:0Issues:0