night18 / ChiMerge

Implement ChiMerge [1992] via python

Home Page:https://chunwei.org/#chimerge

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ChiMerge

Implement ChiMerge [1992] via python

ChiMerge is announced by Randy Kerber in 1992. It is an algorithm that use X2 statistics to discretize numeric attribution, which is a good method to reduce the afford of calculate before data mining.

ChiMerger consists 4 basic step:

  1. Sort the data in ascending order.
  2. Define initial intervals so that every value is in a separate interval.
  3. Calculate the X2 of any two adjacent intervals .
  4. Find the smallest X2, and merge the intervals who own the X2
  5. Repeat 3 & 4 steps until all the X2 is larger than threshold value.

The project use iris data as example, you can get the data from https://archive.ics.uci.edu/ml/machine-learning-databases/iris/

About

Implement ChiMerge [1992] via python

https://chunwei.org/#chimerge


Languages

Language:Python 100.0%