phung025 / orion

This repository contains the implmentation of a cluster-based outlier detection algorithm called Orion in Java.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Orion

Orion is a cluster-based outlier detection algorithm that was introduced by Le Gruenwald and Eleazar Leal from the University of Oklahoma and Shiblee Sadik from Amazon Web Services. Orion uses an evolutionary algorithm and a clustering algorithm to detect anomalies within a dataset. Using an initial set of dimensions of a dataset, Orion uses an evolutionary algorithm to evolve the population set to find a dimension that minimizes the stream density of a data point. Using the stream density and another metric called k-integral, Orion performs co-clustering to assign the data point into a cluster. Based on the cluster, Orion decides whether the data point is an outlier or not.

This repository contains the implmentation of the algorithm in Java. This is part of the research project to re-implement the algorithm to analyze the performance of Orion against several different outlier detection algorithms that have been introduced in the past as well as come up with different approaches to improve the performance of Orion.

References:

In Pursuit of Outliers in Multi-dimensional Data Streams

About

This repository contains the implmentation of a cluster-based outlier detection algorithm called Orion in Java.


Languages

Language:Java 97.4%Language:HTML 2.6%