Vincent (forrestsz)

forrestsz

Geek Repo

Location:Shenzhen

Github PK Tool:Github PK Tool

Vincent's starred repositories

Language:LuaStargazers:6Issues:0Issues:0

clip_it

CLIP-It! Language-Guided Video Summarization

Stargazers:72Issues:0Issues:0

charades-algorithms

Activity Recognition Algorithms for the Charades Dataset

Language:LuaStargazers:201Issues:0Issues:0

actor-observer

ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018

Language:PythonLicense:GPL-3.0Stargazers:76Issues:0Issues:0
Language:PythonLicense:MITStargazers:67Issues:0Issues:0

AVT

Code release for ICCV 2021 paper "Anticipative Video Transformer"

Language:PythonLicense:Apache-2.0Stargazers:151Issues:0Issues:0

all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Language:PythonStargazers:279Issues:0Issues:0

webui-aria2

The aim for this project is to create the worlds best and hottest interface to interact with aria2. Very simple to use, just download and open index.html in any web browser.

Language:JavaScriptLicense:MITStargazers:9901Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:848Issues:0Issues:0

Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

License:Apache-2.0Stargazers:524Issues:0Issues:0
Language:PythonLicense:MITStargazers:184Issues:0Issues:0
Language:PythonLicense:MITStargazers:103Issues:0Issues:0

Efficient-PyTorch

My best practice of training large dataset using PyTorch.

Language:PythonStargazers:1080Issues:0Issues:0

Ego4d

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Language:Jupyter NotebookLicense:MITStargazers:343Issues:0Issues:0

howto100m

Code for the HowTo100M paper

Language:PythonLicense:Apache-2.0Stargazers:250Issues:0Issues:0

HERO_Video_Feature_Extractor

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

video_features

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

Language:PythonLicense:MITStargazers:502Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:27Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30195Issues:0Issues:0

evit

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations

Language:PythonLicense:Apache-2.0Stargazers:168Issues:0Issues:0

slowfast_feature_extractor

Feature Extractor module for videos using the PySlowFast framework

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

Balanced-DataParallel

这里是改进了pytorch的DataParallel, 用来平衡第一个GPU的显存使用量

Language:PythonStargazers:230Issues:0Issues:0

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:968Issues:0Issues:0

Ego-Exo

Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)

Language:PythonLicense:NOASSERTIONStargazers:33Issues:0Issues:0

video_feature_extractor

Easy to use video deep features extractor

Language:PythonLicense:Apache-2.0Stargazers:306Issues:0Issues:0

YouCook2-Leaderboard

A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.

Stargazers:38Issues:0Issues:0

epic-kitchens-download-scripts

Download scripts for EPIC-KITCHENS

Language:PythonStargazers:121Issues:0Issues:0

ProcNets-YouCook2

Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"

Language:LuaLicense:MITStargazers:34Issues:0Issues:0

20bn-something-something-label-hierarchies

Metadata for the Something-Something dataset

Stargazers:8Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:19700Issues:0Issues:0