A system to process visual input on timed frames to produce sensible audio aid in accordance with human information processing limits, using image captioning, semantic text comparison and text-to-speech modules.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool