There are 0 repository under visual-instruction-tuning topic.
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)
A Video Chat Agent with Temporal Prior
Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey