Kajiyu / LLLNet

Keras Implementation of "Look, Listen and Learn" Model

LLLNet

About

This is a Keras implementation of "Look, Listen and Learn" Model on the research by R. Arandjelovic and A. Zisserman, at DeepMind. This model can get cross-modal features between audios and images.

Core Concept

Different Point from Original Model

SqueezeNet is used for visual CNN.

About

Keras Implementation of "Look, Listen and Learn" Model

Languages

Language:Jupyter Notebook 86.7%Language:Python 13.3%