Wooyoung Kang (edwin.kang)'s repositories
Language:Jupyter Notebook000
Charades
Charades_Ego Baseline
Language:PythonGPL-3.0000
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:PythonApache-2.0000
mPLUG-Owl
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Language:PythonMIT000
pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
Language:PythonNOASSERTION000
TextguidedATT
The implementation of Text-guided Attention Model for Image Captioning
Language:Jupyter NotebookNOASSERTION000
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
Language:PythonApache-2.0000