Linjie Li (linjieli222)

linjieli222

Geek Repo

Company:University of Washington; Microsoft

Location:Seattle, WA

Github PK Tool:Github PK Tool

Linjie Li's repositories

HERO

Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Language:PythonLicense:MITStargazers:228Issues:7Issues:46

VQA_ReGAT

Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"

Language:PythonLicense:MITStargazers:176Issues:6Issues:41

HERO_Video_Feature_Extractor

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Language:PythonLicense:MITStargazers:91Issues:3Issues:8

VALUE

Video And Language Understanding Evaluation

Language:PythonStargazers:2Issues:3Issues:0

attrEXP

attractiveness experiments on Amazon MTurk

Language:JavaScriptStargazers:0Issues:2Issues:0

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

cc

Creative Commons copyright license files

Language:HTMLStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

merlot-1

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MIL-NCE_HowTo100M

PyTorch GPU distributed training code for MIL-NCE HowTo100M

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pythia

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

seada-vqa

A pytorch implemetation of data augmentation method for visual question answering

License:MITStargazers:0Issues:0Issues:0
Language:MatlabStargazers:0Issues:4Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

TVRetrieval

PyTorch implementation of XML on TVR dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:CC0-1.0Stargazers:0Issues:0Issues:0

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:MITStargazers:0Issues:0Issues:0