Meshford / fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Generation".

Grounding Language Models to Images for Multimodal Inputs and Outputs

git clone https://github.com/Meshford/fromage.git  
cd fromage

docker build -t fromage_image .

Run container. Container runs tests, and you'll see result of condacted experiment. (Result should be True):

docker run fromage_image

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Generation".

Apache License 2.0

Language:Jupyter Notebook 95.1%Language:Python 4.9%Language:Dockerfile 0.1%