jim-schwoebel / allie

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Create cleaning scripts for text, image, video, and csv files

jim-schwoebel opened this issue · comments

Specify cleaning scripts for each of these.

  • text - if abnormal number of characters, do not process
  • image - if image is too big of a size (e.g. size)
  • video - if video is too big of a size (e.g. <1 min)
  • csv - if csv has too many columns or does not compute only numerical or categorical values, do something