foundation-multimodal-models's repositories
World2Code
Official PyTorch Implementation of World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering
Organization data from Github https://github.com/foundation-multimodal-models
Dedicated to contributing advanced and foundation multimodal models to the open source
Official PyTorch Implementation of World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering