Learning to Use Tools For Creating Multimodal Agents -- LLaVA-Plus (Large Language and Vision Assistants that Plug and Learn to Use Skills)
Learning to Use Tools For Creating Multimodal Agents -- LLaVA-Plus (Large Language and Vision Assistants that Plug and Learn to Use Skills)
https://llava-vl.github.io/llava-plus/