aficionadoai / outliernmf

Outlier Detection for Text Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

outliernmf

This is the matlab implementation for the paper titled "Outlier Detection for Text Data"

  • Generate a labelled dataset with 0 as non-outliers and 1 as outliers.
  • Call the textoutliers function with the sparse bag of words matrix, low rank k, parameters alpha and beta.
  • It will return three matrices Z,W,H.
  • Use L2 score of every column of Z as the outlier score for every document.
  • Call ROC curve module with L2 score of every column of Z against the labels from step 1.
  • The code is tested with MATLAB R2011b (MATLAB 7.13). It should also work with any later versions. It must also work fine with octave

Citation

  • Ramakrishnan Kannan, Hyenkyun Woo, Charu C. Aggarwal and Haesun Park. "Outlier Detection for Text Data", Accepted at 2017 International Conference on SIAM Data Mining

About

Outlier Detection for Text Data


Languages

Language:MATLAB 100.0%