kamikaze / dataset-format-benchmark

Benchmarks how different image formats affect AI model performance

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dataset-format-benchmark

This package runs different image format benchmarks for dataset ML tasks

Installation

Make sure you have some system deps installed:

sudo apt install pkg-config libhdf5-dev
python3.11 -m venv venv --upgrade-deps
source venv/bin/activate
python -m pip install -U -r requirements_dev.txt

# For running on Nvidia GPU:
python -m pip install -U torch torchvision

# For running on CPU:
python -m pip install -U torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cpu

# For some reason h5py fails to install Cython while it needs it
python -m pip install -U Cython

python setup.py develop

Running dataset format benchmark

python -m dataset_format_benchmark --data-root /path/to/datasets/

About

Benchmarks how different image formats affect AI model performance

License:GNU General Public License v3.0


Languages

Language:Python 98.7%Language:Dockerfile 1.3%