yao5461 / DKmeans

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Distribute K-Means 
=================
Try to implement a distribute K-Means.
The first idea is to implement it by using MapReduce base Hadoop.
The second idea is to use C/S model base socket communication.
This project is for the second idea.

Detail in this project:
I try to implement K-Means in distribute way. A simply solution is to create several calculate processes and one controller process. The controller process is responsible for sending message and data to calculate process and receive the result. There are mulptiple threads in controller process. All calculate processes are the same except for the port number. It is similiar with C/S(I think so). I think it is a solution. I will change this if there are something wrong.

In this project:
controller process --- Controller
calculate process --- Mapper

The project run in Linux.
coding language: C++
using TCP/IP protocol, socket (UC Berkeley), pthread


Author: yao 

About


Languages

Language:C++ 91.2%Language:C 8.8%