philbinj / fastcluster

A library for fast, distributed clustering

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

---------------------------------------------------------------------
| FASTCLUSTER: A library for fast, distributed clustering           |
|                                                                   |
| James Philbin <philbinj@gmail.com>                                |
---------------------------------------------------------------------
This is a python library for performing fast, distributed (using MPI)
clustering for very large datasets.

Currently we only support k-means.

---------------------------------------------------------------------
| INSTALLATION                                                      |
---------------------------------------------------------------------
Before installation make sure the following dependencies are met:
- Linux
- CMake >= 2.6.0
- Python >= 2.5 (but < 3.0)
- MPI library (OpenMPI is recommended)
- Numpy (http://numpy.scipy.org/)
- PyTables (http://www.pytables.org/)
- fastann library (http://github.com/philbinj/fastann)

Build the library to install to /usr/local/ (default is /usr/)
> PREFIX=/usr/local/ cmake . && make

Install
> make install

---------------------------------------------------------------------
| USAGE                                                             |
---------------------------------------------------------------------
See help(fastcluster.kmeans) for usage.

---------------------------------------------------------------------
| CHANGELOG                                                         |
---------------------------------------------------------------------
v0.1
    - Initial commit

---------------------------------------------------------------------
| REFERENCES                                                        |
---------------------------------------------------------------------
[1] Philbin, J. and Chum, O. and Isard, M. and Sivic, J. and 
        Zisserman, A.
    Object Retrieval with Large Vocabularies and Fast Spatial 
    Matching, CVPR 2007

About

A library for fast, distributed clustering

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:C++ 66.8%Language:C 20.8%Language:Python 12.4%