KimSeHyung's starred repositories
Video-of-Thought
Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
FreeMan_API
Official Repository for FreeMan dataset
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
FusionFormer
FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation
ContextAware-PoseFormer
The project is an official implementation of our paper "A Single 2D Pose With Context is Worth Hundreds for 3D Human Pose Estimation".