Lim Geun Taek's repositories
ai-tech-interview
๐ฉโ๐ป๐จโ๐ป AI ์์ง๋์ด ๊ธฐ์ ๋ฉด์ ์คํฐ๋ (โญ๏ธ 1k+)
annotated_deep_learning_paper_implementations
๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
AvatarCLIP
[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
CoLA
[CVPR2021] CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
Deep-Learning-with-Pytorch
pytorch study
FAME
Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)
HiCo
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
CoCLR
[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
detr
End-to-End Object Detection with Transformers
ECCV2022-DELU
[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
Misc-Cheatsheet
๋ํ์ ์ํ์ ํ๋ฉฐ ์ฌ์ฉํ๋ ์๊ณ ์์คํ ์ฝ๋ฉํ (linux ๋ช ๋ น์ด ๋ฑ)
ML-YouTube-Courses
๐บ Discover the latest machine learning / AI courses on YouTube.
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Scene-Graph-Benchmark.pytorch
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper โUnbiased Scene Graph Generation from Biased Training CVPR 2020โ
SceneSeg
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
SceneSegmentation-SCRL
Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"
Self-Contained-Video-Entity-Discovery
This is the official implementation and benchmark of the "Self-Contained Entity Discovery in Captioned Videos" paper.paper
SHG-VQA
Learning Situation Hyper-Graphs for Video Question Answering
SogCLR
Official implementation of the paper "Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance", ICML2022.
VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
VLM_survey
Collection of AWESOME vision-language models for vision tasks
VStates
Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)