foundation-multimodal-models

foundation-multimodal-models

Organization data from Github https://github.com/foundation-multimodal-models

Dedicated to contributing advanced and foundation multimodal models to the open source

GitHub:@foundation-multimodal-models

foundation-multimodal-models's repositories

CAL

[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Language:PythonLicense:Apache-2.0Stargazers:57Issues:0Issues:5
Language:PythonLicense:Apache-2.0Stargazers:51Issues:2Issues:9

ConBench

[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".

Language:PythonLicense:Apache-2.0Stargazers:38Issues:1Issues:1

World2Code

Official PyTorch Implementation of World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering

Language:ShellStargazers:3Issues:1Issues:0