stdKonjac

stdKonjac

Geek Repo

Company:Tsinghua University

Location:Shenzhen, Guangdong, China

Home Page:https://www.stdkonjac.icu/

Twitter:@stdKonjac

Github PK Tool:Github PK Tool

stdKonjac's starred repositories

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:6828Issues:57Issues:183

DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language:C++License:Apache-2.0Stargazers:4931Issues:93Issues:1555

nccl

Optimized primitives for collective multi-GPU communication

Language:C++License:NOASSERTIONStargazers:2864Issues:150Issues:1109

RepDistiller

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Language:PythonLicense:BSD-2-ClauseStargazers:2095Issues:17Issues:56

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonLicense:Apache-2.0Stargazers:1899Issues:41Issues:77

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++License:Apache-2.0Stargazers:1663Issues:30Issues:236

V2Ray-Desktop

最优雅的跨平台代理客户端,支持Shadowsocks(R),V2Ray和Trojan协议。The most elegant cross-platform proxy GUI client that supports Shadowsocks(R), V2Ray, and Trojan. Built with Qt5 and QML2.

Language:QMLLicense:GPL-3.0Stargazers:1515Issues:28Issues:99

PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language:PythonLicense:Apache-2.0Stargazers:1433Issues:38Issues:311

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1227Issues:17Issues:116

moco-v3

PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057

Language:PythonLicense:NOASSERTIONStargazers:1158Issues:17Issues:34

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:935Issues:22Issues:109

VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Language:Jupyter NotebookLicense:MITStargazers:734Issues:14Issues:83

omnivore

Omnivore: A Single Model for Many Visual Modalities

Language:PythonLicense:NOASSERTIONStargazers:546Issues:19Issues:31

ActionCLIP

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Language:PythonLicense:MITStargazers:467Issues:4Issues:50

UniCL

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Language:PythonLicense:MITStargazers:369Issues:20Issues:11

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonLicense:MITStargazers:329Issues:11Issues:45

bit-diffusion

Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising diffusion, in Pytorch

Language:PythonLicense:MITStargazers:312Issues:5Issues:8

all-in-one

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

pims

Python Image Sequence: Load video and sequential images in many formats with a simple, consistent interface.

Language:PythonLicense:NOASSERTIONStargazers:258Issues:14Issues:208

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonLicense:MITStargazers:221Issues:14Issues:18

deffcode

A cross-platform High-performance FFmpeg based Real-time Video Frames Decoder in Pure Python 🎞️⚡

Language:PythonLicense:Apache-2.0Stargazers:166Issues:4Issues:31

BMVCTemplate

Paper template and author instructions for BMVC

merlot_reserve

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

Language:PythonLicense:MITStargazers:135Issues:5Issues:25

singularity

[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"

Language:PythonLicense:MITStargazers:124Issues:2Issues:30

TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Language:Jupyter NotebookLicense:MITStargazers:118Issues:3Issues:16

qb-norm

Cross Modal Retrieval with Querybank Normalisation

Language:PythonLicense:MITStargazers:51Issues:4Issues:6

svo_probes

The SVO-Probes Dataset for Verb Understanding

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:27Issues:4Issues:1
Language:PythonStargazers:7Issues:0Issues:0

DALI

A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications

Language:C++License:Apache-2.0Stargazers:6Issues:0Issues:0