marianna13 / caption-the-internet

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PythonVersion tests Code style: black PRs Welcome

Caption the internet

Objective

Generate synthetic captions for images, audio and videos to create better datasets.

Usage

Benchmark different captioner model to select best ones: benchmark.sh

You can examine how many resources (gpu memory, number of gpu hrs) you would need to caption images using set of models and how good those captioners are (e.g. using ClipScore). See benchmark_config.json for details.

image

About

License:MIT License


Languages

Language:Jupyter Notebook 78.7%Language:Python 20.7%Language:Dockerfile 0.3%Language:Shell 0.2%Language:Makefile 0.1%