Ohio University Big Data Club, DeepCats

Our models are simple modifications based on YT8M-2017 winner's solution: https://github.com/antoine77340/Youtube-8M-WILLOW.

The best model obtained is the LightVLAD model, which ranked at 18/394 in YT8M-2018. The training parameters are:

There are several patterns we found:

Increase moe_num_mixtures to 4 (default is 2) can improve score around 0.003, and the model size only increases 50M.
If trained too many iterations (> 200K), the score decreased.
Adding dropout didn't increase score
Adding more layers didn't increase score
Tried relu, relu6, and Leaky ReLu, didn't increase score.
netVLAD is the second best comparing to LightVLAD.
In the LightVLAD model, number of hidden neurons is more important than netvlad_cluster_size.

To see the actual command, check out the job files in each folder.

About

Ohio University Big Data Club - DeepCats Solution

MIT License

Language:Python 100.0%