José María Luna's repositories
ClusterIndices
This package contains the code for executing clustering validity indices in Spark. The package includes BD-Silhouette, BD-Dunn, Davies-Bouldin and WSSSE indices.
ExternalValidity
This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.
RandomClustersGenerator
📊 Python tool for creating datasets with clusters using a normal distribution. Customize clusters, significant columns, and add variability with dummy columns. Ideal for testing clustering algorithms.
CreateRandomDataset
This package contains the code for generating Big Data random datasets in Spark.
smallDataIndex
This package contains the code for executing clustering validity indices in Java by using K-means from Weka. The package includes the following clustering validity indices: Silhouette, Dunn, BD-Silhouette, BD-Dunn, Davies-Bouldin, Calinski-Harabasz, MaximumDiameter, SquaredDistance, AverageDistance, AverageBetweenClusterDistance, MinimumDistance.
Clustering-Datasets
This repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
MethodComparisonsInPython
Friedman tests for comparing multiple methods across datasets in python
SeminarioDeRiquelme
Clustering