Meshford / fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Generation".

Home Page:https://jykoh.com/fromage

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Grounding Language Models to Images for Multimodal Inputs and Outputs

Setup instructions

  1. Clone the repo:
git clone https://github.com/Meshford/fromage.git  
cd fromage  
  1. Make docker image (It's can take some minutes due to dependencies):
docker build -t fromage_image .  
  1. Run container. Container runs tests, and you'll see result of condacted experiment. (Result should be True):
docker run fromage_image

About

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Generation".

https://jykoh.com/fromage

License:Apache License 2.0


Languages

Language:Jupyter Notebook 95.1%Language:Python 4.9%Language:Dockerfile 0.1%