Beast code in Giters

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonMIT010

colorfromlanguage

Code base of the paper : Learning to Color from Language

Language:OpenEdge ABL010

colorization

Automatic colorization using deep neural networks. "Colorful Image Colorization." In ECCV, 2016.

Language:PythonBSD-2-Clause010

DeepLabV3Plus-Pytorch

DeepLabv3, DeepLabv3+ and pretrained weights on VOC & Cityscapes

Language:PythonMIT010

dotfiles

Personal Configuration Files

Language:ShellMIT020

DRAW

Knet implementation of DRAW: A Recurrent Neural Network For Image Generation

Language:Jupyter NotebookMIT05 1

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonMIT000

ilkerkesen.github.io

Personal Website

Language:JavaScriptMIT010

MCQ

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Language:Python000

mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Language:PythonApache-2.0000

pytorch-deeplab-xception

DeepLab v3+ model in PyTorch. Support different backbones.

Language:PythonMIT010

singularity

[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"

Language:PythonMIT000

UVR-NMT

Neural Machine Translation with universal Visual Representation (ICLR 2020)

Language:Python010

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause000

VideoCLIP

VideoCLIP and VLM implementations for custom benchmark (originally it's fairseq).

Language:PythonMIT010

VindLU

Language:PythonMIT000