Common Visual Data Foundation's repositories
open-images-dataset
Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.
google-landmark
Dataset with 5 million images depicting human-made and natural landmarks spanning 200 thousand classes.
ava-dataset
The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring frequently.