raybenchen / RecipeWatch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DISCLAIMER

We are currently in the process of cleaning/documenting the code hence we advise everyone to wait for the relase. For the brave ones, the code is here. In order to use it you also need ModalDB https://github.com/ozansener/ModalDB and push all your data and features into it.

RecipeWatch

RecipeWatch is exploring concepts from youtube videos with no supervision. Project is composed of following sub-modules:

  • src/Crawler: A python module which can search youtube and wikihow and download videos with their subtitles if available.

  • src/Settings: Set of parameters and settings used by the entire project. It also includes ModalDB schema etc.

  • Scripts: Python scripts mostly for data crunching/processing.

  • Doc: Documents about the project including presentation, 5 minutes pitch and papers.

  • PreProcessing: A preprocessing pipeline

  • Vision Module: An unsupervised object clustering, feature extraction system.

  • ML Module: An unsupervised representational learning system.

#Pre Processing

#####Get the subtitles

pipeLine_extractSrt.py          Read the SRT/*.srt files and crete python dict

#####Extract frames

pipeLine_frames.py              #Extract frames from video files
pipeLine_md5summary.py          #Get the md5 summary of all frames

#####Process saceled ones

pipeLine_scaledSrt.py           #Convert srt to scaled file srt

#####Extract srt

pipeLine_histLang.py            #Get the histograms of the language

#####Resize

pipeLine_scale.py               #Resize frames for cpmc
pipeLine_cpmc.py                #Create the folder structure for cpmc code
pipeLine_hist.py                #Create .mat file with the list of frames

#Vision Module

collectHistogram.m              #Run after cpmc is done to compute histograms
cpmc_base/                      #Constrained parametric min-cut for object proposals
CoProposals/                    #Code to generate co-object proposals. For the details look at the documentation

#ML Module

#Doc

ProjectLog/                      #Beamer presentation about the project
ProjectPitch/                    #5 mins pitch of the project

About


Languages

Language:JavaScript 33.1%Language:HTML 27.3%Language:C 10.3%Language:CSS 7.8%Language:Fortran 6.1%Language:MATLAB 5.7%Language:C++ 5.4%Language:Python 1.3%Language:Makefile 0.7%Language:Groff 0.7%Language:TeX 0.5%Language:Java 0.3%Language:M 0.3%Language:Objective-C 0.2%Language:Shell 0.1%Language:Stata 0.1%Language:Clean 0.1%Language:Lex 0.0%Language:Prolog 0.0%