Borda / kaggle_image-classify

Various Kaggle image classification challenges solutions

Home Page:https://borda.github.io/kaggle_plant-pathology

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question about dataset

albertoceballos opened this issue · comments

Is the plant pathology 2021 dataset used in the Pytorch Lightning notebook a resized version of the dataset? There are some commands that are commented out:

# ! apt-get install -qq -y imagemagick
# ! mogrify -resize 960 train_images/*.jpg

Were these commands used to resize the dataset before running the notebook?

Yes, we generated a downscaled version of the dataset offline due to a bottleneck issue with loading large images with pil.