Lim Geun Taek's repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
VLM_survey
Collection of AWESOME vision-language models for vision tasks
detr
End-to-End Object Detection with Transformers
annotated_deep_learning_paper_implementations
๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper โUnbiased Scene Graph Generation from Biased Training CVPR 2020โ
ML-YouTube-Courses
๐บ Discover the latest machine learning / AI courses on YouTube.
SHG-VQA
Learning Situation Hyper-Graphs for Video Question Answering
VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
SogCLR
Official implementation of the paper "Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance", ICML2022.
ECCV2022-DELU
[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
ai-tech-interview
๐ฉโ๐ป๐จโ๐ป AI ์์ง๋์ด ๊ธฐ์ ๋ฉด์ ์คํฐ๋ (โญ๏ธ 1k+)
VStates
Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)
AvatarCLIP
[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
SceneSegmentation-SCRL
Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"
FAME
Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)
Deep-Learning-with-Pytorch
pytorch study
Self-Contained-Video-Entity-Discovery
This is the official implementation and benchmark of the "Self-Contained Entity Discovery in Captioned Videos" paper.paper
HiCo
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
SceneSeg
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
CoLA
[CVPR2021] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Misc-Cheatsheet
๋ํ์ ์ํ์ ํ๋ฉฐ ์ฌ์ฉํ๋ ์๊ณ ์์คํ ์ฝ๋ฉํ (linux ๋ช ๋ น์ด ๋ฑ)