İlker Kesen (ilkerkesen)

ilkerkesen

Geek Repo

Company:KUIS AI Center

Location:İstanbul

Home Page:http://ilkerkesen.github.io

Twitter:@ilker_kesen

Github PK Tool:Github PK Tool


Organizations
ai-ku
ITURO
OTOKON
ozgurlukicin

İlker Kesen's repositories

frozen

A PyTorch implementation of Multimodal Few-Shot Learning with Frozen Language Models with OPT.

Language:Jupyter NotebookLicense:MITStargazers:40Issues:3Issues:2

GAN

Generative Adversarial Networks in Knet

Language:JuliaLicense:MITStargazers:13Issues:2Issues:1

ViLMA

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)

Language:PythonLicense:MITStargazers:12Issues:5Issues:1

bvpr

[MULA Workshop @ CVPR 2022] Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters

Language:Jupyter NotebookLicense:MITStargazers:4Issues:4Issues:0

euphemism

Official Implementation of "Detecting Euphemisms with Literal Descriptions and Visual Imagery"

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

.emacs.d

Personal Emacs Configuration

Language:Emacs LispLicense:MITStargazers:2Issues:2Issues:0

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

ilkerkesen

Repository for my bio

Language:PythonLicense:MITStargazers:1Issues:5Issues:0

RAM

Recurrent Models of Visual Attention implementation in Julia/Knet

Language:JuliaLicense:MITStargazers:1Issues:2Issues:0

Sloth.jl

Lazy bums' Knet package

Language:JuliaLicense:MITStargazers:1Issues:2Issues:1

Ask-Anything

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-Referring-Image-Segmentation

:books: A collection of papers about Referring Image Segmentation.

Stargazers:0Issues:0Issues:0

caption_metrics

Evaluation Metrics for Image Captioning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

colorfromlanguage

Code base of the paper : Learning to Color from Language

Language:OpenEdge ABLStargazers:0Issues:1Issues:0

colorization

Automatic colorization using deep neural networks. "Colorful Image Colorization." In ECCV, 2016.

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:1Issues:0

DeepLabV3Plus-Pytorch

DeepLabv3, DeepLabv3+ and pretrained weights on VOC & Cityscapes

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dotfiles

Personal Configuration Files

Language:ShellLicense:MITStargazers:0Issues:2Issues:0

DRAW

Knet implementation of DRAW: A Recurrent Neural Network For Image Generation

Language:Jupyter NotebookLicense:MITStargazers:0Issues:5Issues:1

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ilkerkesen.github.io

Personal Website

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

MCQ

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Language:PythonStargazers:0Issues:0Issues:0

mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-deeplab-xception

DeepLab v3+ model in PyTorch. Support different backbones.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

singularity

[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

UVR-NMT

Neural Machine Translation with universal Visual Representation (ICLR 2020)

Language:PythonStargazers:0Issues:1Issues:0

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

VideoCLIP

VideoCLIP and VLM implementations for custom benchmark (originally it's fairseq).

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0