There are 2 repositories under fuyu topic.
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024]
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
Fuyu multi-modal language model for use with Autodistill.
Hands on some MultiModal Models
Testing Nvidia Machine Learning api models