stdKonjac

stdKonjac

Geek Repo

Company:Tsinghua University

Location:Shenzhen, Guangdong, China

Home Page:https://www.stdkonjac.icu/

Twitter:@stdKonjac

Github PK Tool:Github PK Tool

stdKonjac's starred repositories

Magpie

An all-purpose window upscaler for Windows 10/11.

Language:HLSLLicense:GPL-3.0Stargazers:7785Issues:68Issues:556

faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

Language:PythonLicense:MITStargazers:7612Issues:91Issues:837

DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language:C++License:Apache-2.0Stargazers:4955Issues:94Issues:1559

voxelmorph

Unsupervised Learning for Image Registration

Language:PythonLicense:Apache-2.0Stargazers:2184Issues:48Issues:441

RepDistiller

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Language:PythonLicense:BSD-2-ClauseStargazers:2099Issues:17Issues:56

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonLicense:Apache-2.0Stargazers:1914Issues:41Issues:77

PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language:PythonLicense:Apache-2.0Stargazers:1438Issues:38Issues:314

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1245Issues:16Issues:118

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

Language:TeXLicense:MITStargazers:848Issues:79Issues:1

kinetics_i3d_pytorch

Inflated i3d network with inception backbone, weights transfered from tensorflow

Language:PythonLicense:MITStargazers:518Issues:14Issues:27

BackdoorBox

The open-sourced Python toolbox for backdoor attacks and defenses.

Language:PythonLicense:GPL-2.0Stargazers:384Issues:8Issues:11

Low-rank-Multimodal-Fusion

This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018

TeViT

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Language:PythonLicense:MITStargazers:234Issues:8Issues:12

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonLicense:MITStargazers:222Issues:14Issues:18

Awesome-Cross-Modal-Video-Moment-Retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

OGM-GE_CVPR2022

The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)

Language:PythonLicense:MITStargazers:205Issues:4Issues:43

S3D_HowTo100M

S3D Text-Video model trained on HowTo100M using MIL-NCE

Language:PythonLicense:Apache-2.0Stargazers:187Issues:10Issues:13

UMT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Language:PythonLicense:NOASSERTIONStargazers:184Issues:6Issues:53

BMVCTemplate

Paper template and author instructions for BMVC

merlot_reserve

Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"

Language:PythonLicense:MITStargazers:135Issues:5Issues:25

MCQ

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

s3d.pytorch

Spatiotemporal-separable 3D convolution network.

Language:PythonLicense:MITStargazers:117Issues:4Issues:10

HC-STVG

The HC-STVG Dataset

ReLoCLNet

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

Language:PythonLicense:MITStargazers:51Issues:1Issues:7

DRN

Dense Regression Network for Video Grounding (CVPR2020)

CONQUER

[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval

mTVRetrieval

[ACL 2021] mTVR: Multilingual Video Moment Retrieval

WWW22-HCQ

The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22).

Language:PythonStargazers:7Issues:0Issues:0