anunciado / IMD0030-K-Means

A clustering problem solution using k-means for UFRN IMD class IMD0030: Programming Language I.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

K-Means

A clustering problem solution using k-means. K-means clustering is a type of unsupervised learning, which is used when you have unlabeled data (i.e., data without defined categories or groups). The goal of this algorithm is to find groups in the data, with the number of groups represented by the variable K. The algorithm works iteratively to assign each data point to one of K groups based on the features that are provided. Data points are clustered based on feature similarity. In short, in this project, given a file like the one below, the program will be able to find the groups of lines with similar values in the columns:

Feature 1 Feature 2 Feature 3 Feature 4 Feature 5 Feature 6 ...
1 1 3 2 2 739 ...
2 3 2 33 1 628 ...
2 4 3 3 3 563 ...
... ... ... ... ... ... ...

Prerequisites

You will need to install the modules below to run the program:

Running

To run this software, we gonna compile in the terminal:

  1. Enter the folder and compile all the .cpp files with the following command:
$ make
  1. Enter the bin folder and run the following command:
$ ./labfinal <dataframe>

An example would be:

$ ./labfinal ../input/iris.csv
  1. From this it only interacts with the system.

Built With

  • Atom - A code editor used

Authors

Developers:

Project Advisor:

See also the list of contributors who participated in this project.

Refference

License

This project is licensed under the GPL 3.0 - see the LICENSE file for details

About

A clustering problem solution using k-means for UFRN IMD class IMD0030: Programming Language I.

License:GNU General Public License v3.0


Languages

Language:C++ 96.7%Language:Makefile 3.3%