laurita / groupit

Web interface for scluster

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

groupit

Web interface for a simple customized string clustering application. Try it at http://groupit-ruby.herokuapp.com.

Uses a scluster gem for string clustering. Before the clustering is performed, data cleaning is done. It includes separation of people and companies based on a custom regexp, emoving postcodes, company types and performing the clustering on the set of company strings.

The cluster names are predicted based on the longest common substring of all the strings in the cluster.

About

Web interface for scluster


Languages

Language:Ruby 89.6%Language:JavaScript 9.1%Language:CoffeeScript 1.4%