stdKonjac

stdKonjac's starred repositories

Magpie

An all-purpose window upscaler for Windows 10/11.

Language:HLSLGPL-3.07785 68 556

faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

Language:PythonMIT7612 91 837

DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language:C++Apache-2.04955 94 1559

voxelmorph

Unsupervised Learning for Image Registration

Language:PythonApache-2.02184 48 441

RepDistiller

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Language:PythonBSD-2-Clause2099 17 56

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonApache-2.01914 41 77

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.

Language:PythonApache-2.01438 38 314

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonNOASSERTION1245 16 118

DL-NLP-Readings

My Reading Lists of Deep Learning and Natural Language Processing

Language:TeXMIT848 79 1

kinetics_i3d_pytorch

Inflated i3d network with inception backbone, weights transfered from tensorflow

Language:PythonMIT518 14 27

BackdoorBox

The open-sourced Python toolbox for backdoor attacks and defenses.

Language:PythonGPL-2.0384 8 11

Low-rank-Multimodal-Fusion

This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018

Language:Python242 8 26

TeViT

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Language:PythonMIT234 8 12

merlot

MERLOT: Multimodal Neural Script Knowledge Models

Language:PythonMIT222 14 18

Awesome-Cross-Modal-Video-Moment-Retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

210 11 3

OGM-GE_CVPR2022

The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)

Language:PythonMIT205 4 43

S3D_HowTo100M

S3D Text-Video model trained on HowTo100M using MIL-NCE

Language:PythonApache-2.0187 10 13

UMT

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

Language:PythonNOASSERTION184 6 53