clustering-analysis

There are 4 repositories under clustering-analysis topic.

milaan9 / Clustering_Algorithms_from_Scratch
Implementing Clustering Algorithms from scratch in MATLAB and Python
cluster cluster-analysis clustering-algorithms clustering-analysis clustering-benchmark clustering-methods clustering-models machine-learning-algorithms subspace-clustering tutor-milaan9 unsupervised-learning
Language:Jupyter Notebook 202
rezacsedu / Deep-Learning-for-Clustering-in-Bioinformatics
Deep Learning-based Clustering Approaches for Bioinformatics
autoencoders bioinformatics clustering-analysis convolutional-autoencoder deep-learning lstm-neural-networks neural-networks representation-learning variational-autoencoder
Language:Jupyter Notebook 141
rohanmohapatra / hdbscan-cpp
Fast and Efficient Implementation of HDBSCAN in C++ using STL
cpp c-plus-plus machine-learning machine-learning-algorithms clustering clustering-algorithm hdbscan stl-containers clustering-analysis
Language:C++ 72
bessagroup / CRATE
CRATE: Accurate and efficient clustering-based nonlinear analysis of heterogeneous materials through computational homogenization
computational-homogenization clustering-analysis computational-mechanics material-modelling
Language:Python 41
monty-se / PINstimation
A comprehensive bundle of utilities for the estimation of probability of informed trading models: original PIN in Easley and O'Hara (1992) and Easley et al. (1996); Multilayer PIN (MPIN) in Ersan (2016); Adjusted PIN (AdjPIN) in Duarte and Young (2009); and volume-synchronized PIN (VPIN) in Easley et al. (2011, 2012). Implementations of various estimation methods suggested in the literature are included. Additional compelling features comprise posterior probabilities, an implementation of an expectation-maximization (EM) algorithm, and PIN decomposition into layers, and into bad/good components. Versatile data simulation tools, and trade classification algorithms are among the supplementary utilities. The package provides fast, compact, and precise utilities to tackle the sophisticated, error-prone, and time-consuming estimation procedure of informed trading, and this solely using the raw trade-level data.
clustering-analysis expectation-maximisation-algorithm hierarchical-clustering information-asymmetry market-microstructure maximum-likelihood-estimation mixture-distributions poisson-distribution
Language:R 39
Simon-Bertrand / Clusters-Features
The Clusters-Features package allows data science users to compute high-level linear algebra operations on any type of data set. It computes approximatively 40 internal evaluation scores such as Davies-Bouldin Index, C Index, Dunn and its Generalized Indexes and many more ! Other features are also available to evaluate the clustering quality.
clustering indices internal validation ball-hall generalized-dunn-indexes c-index banfeld-raftery davies-bouldin calinski-harabasz ray-turi xie-beni wemmert-gancarski pbm point-biserial unsupervised-learning python clusters evaluation clustering-analysis
Language:Python 33
DOH-JDJ0303 / bigbacter-nf
Bacterial surveillance pipeline.
accessory-genome bacterial-genomics clustering-analysis genome-analysis public-health public-health-surveillance snp-analysis
Language:Nextflow 26
julherest / drought_clusters
Code used to identify and analyze drought clusters from gridded data.
climate-variability clustering-analysis droughts
Language:Python 25
DRLib / CDR
Implementation of CDR - Interactive Visual Cluster Analysis by Contrastive Dimensionality Reduction
dimensionality-reduction interactive-clustering contrastive-loss pytorch-implementation multidimensional-projection clustering-analysis
Language:JavaScript 22
Clustering-by-Silhouette
EtzionR / Clustering-by-Silhouette
Optimize clustering labels using Silhouette Score.
clustering clustering-analysis clustering-evaluation hdbscan kmeans machine-learning meanshift silhouette
Language:Python 15
sharmaroshan / MNIST-Using-K-means
It is One of the Easiest Problems in Data Science to Detect the MNIST Numbers, Using a Classification Algorithm, Here I have used a csv File which contains the Pixels of the Numbers from 0 to 9 and we have to Classify the Numbers Accordingly. I have Used K-Means Classification Algorithm.
kmeans clustering-analysis python jupyter-notebooks unsupervised-learning machine-learning imagery-analysis tutorial beginner
Language:HTML 15
marthadais / AISclassification
A geometric-driven semi-supervised approach for fishing activity detection from AIS data.
clustering-analysis feature-augmented-clustering mobility-behavior-detection time-series-classification
Language:Jupyter Notebook 13
salar96 / MEP-Orthogonal-NMF
Clustering and resource allocation using Deterministic Annealing Approach and Orthogonal Non-negative Matrix Factorization O-(NMF)
clustering clustering-algorithm clustering-analysis nmf nmf-decomposition orthogonal outlier-detection onmf deterministic-annealing anomaly-detection resource-allocation constrained-optimization data-analysis matrix-factorization nonnegative-matrix-factorization nonnegativity-constraints sparse-representations
Language:Jupyter Notebook 11
ShuyueG / CVI_using_DSI
Cluster Validity Index Using a Distance-based Separability Measure
cluster-validity cluster-validity-index-evaluation clustering-analysis separability-measure
Language:Python 10
at-tan / Hierarchical_Clustering_of_Currencies
A clustering exercise of global currencies on three common financial market features using data from 2017 through 2019, as published in Towards Data Science on Medium.com
clustering clustering-analysis hierarchical-clustering
Language:Jupyter Notebook 9
dilettagoglia / DataMining
🔎Data Understanding, Visualization , Preparation & Cleaning - Clustering algorithms (unsupervised learning) - Classification algorithms (supervised learning) - Sequential Pattern Mining
datamining data data-visualization data-analysis data-mining data-processing data-cleaning customer-profile clustering-analysis clustering clustering-algorithm classification correlations assessing-data-quality classification-algorithm supervised-learning unsupervised-learning sequential-patterns
Language:Jupyter Notebook 9
pajaskowiak / clusterConfusion
Clustering validation with ROC Curves
clustering clustering-algorithm clustering-analysis clustering-evaluation roc-analysis roc-auc roc-auc-curve roc-plot auc-score clustering-validation hierarchical-clustering k-means k-means-clustering roc-auc-score roc-curve roc-curves
Language:R 7
zcebeci / fcvalid
Internal Validity Indexes for Fuzzy and Possibilistic Clustering
clustering clustering-analysis validation validity-indices fuzzy-internal-indexes fuzzy-cmeans-clustering fuzzy-clustering-analyses fuzzy-possibilistic-cmeans possibilistic-clustering-algorithms validate fcm pcm unsupervised-learning unsupervised-machine-learning unsupervised-clustering cluster-analysis number-of-clusters clustering-evaluation clustering-validation clustering-benchmarks
Language:R 7
BayoAdejare / lightning-containers
Docker powered starter for geospatial analysis of lightning atmospheric data.
clustering-analysis csv-files data-engineer data-engineering-pipeline data-warehouse databases docker jupyter machine-learning-algorithms noaa-weather orchestrator pandas python3 spatialite sqlite streamlit-dashboard
Language:Python 6
KaikeWesleyReis / kaggle
Solutions for different datasets in Kaggle Website
clustering-analysis feature-engineering kaggle npl predictive-analysis
Language:Jupyter Notebook 6
danustc / Image_toolbox
This is my toolbox for image processing and downstream analysis of calcium imaging data.
calcium-imaging clustering-analysis pipeline neurons scikit-learn
Language:Jupyter Notebook 5
EtzionR / generate-Convex-Hull-SHP-from-HDBSCAN-clustering-probabilities
Defines a boundary around cluster centers in a given point-layer shapefile.
clustering clustering-analysis convex-hull coordinate-systems esri geographical-information-system geometry gis hdbscan machine-learning shp
Language:Python 5
ArtemKovera / clust
a few different clustering algorithms with python libraries for data science
clustering clustering-algorithm cluster-analysis clustering-analysis hierarchical-clustering k-means dbscan neural-network-based-clustering
Language:Jupyter Notebook 4
caesarmario / Mall-Customers-Clustering-Analysis-using-SAS-Enterprise-Miner
This repository contains mall customers clustering analysis. This repository also uses SAS Enterprise Miner to perform clustering and identify each cluster's characteristics. Full explanations about this repository can be seen on: https://medium.com/@caesarmario/mall-customers-clustering-analysis-da594bd2718b
cluster-analysis clustering clustering-algorithm clustering-analysis clustering-evaluation clusters marketing marketing-analytics sas sas-enterprise-miner segmentation
4
MarinaMoreno / Client-Segmentation-Clustering
This repository contains an ML project that was approached with a business mindset from the beginning to the end. It addresses the problem of clustering.
business-solutions clustering clustering-algorithm clustering-analysis clustering-methods machine-learning customer-segmentation
Language:Jupyter Notebook 4
parthnan / IowaGamblingTask-Clustering
Clustering Analysis of all available research data on the Iowa Gambling Task(list of sources in readme) using R. The Scripts produce the output for the most common archetypes among the dataset of one researcher using PCA.
clustering-analysis r statistics iowagambling probabilistic-models medical-diagnosis neurology
Language:R 4
renatocorreia-rmcm / mall-customers-segmentation
Implementation of a simple clustering model.
clustering clustering-analysis machine-learning ml python segmentation-models
Language:Jupyter Notebook 4
AnFrBo / internet_censorship
Analysis of the State of Internet Censorship in the United Kingdom Using Data Provided by OONI and Blocked Project as well as Scraped URL Meta Data
blocked clustering-analysis filtering internet-censorship it-security levenshtein-distance nlp ooniprobe url-analytics wordcloud-visualization
Language:R 3
AYSE-DUMAN / Clustering-by-Business-Income-and-Expenses
load and visualize data and clusters with scatter plots; prepare data for cluster analysis; perform centroid clustering with k-means; interpret clustering results and determine the optimal number of clusters for a given dataset.
clustering-methods clustering-evaluation clustering-analysis
Language:Jupyter Notebook 3
Devanshi-Bavaria / Predictive-Modeling-for-Stock-Market-Trends
📈 Comprehensive stock price analysis, including preprocessing, clustering, correlation, and predictive modeling, to enhance investment insights and accuracy. 💡
clustering-analysis correlation-analysis eda ml permutation-test
Language:Jupyter Notebook 3
DomainTools / risky-tld-cluster-analysis
An Analysis Using DomainTools Threat Profile to Identify Risky TLDs
tld cybersecurity machine-learning clustering clustering-analysis
Language:Jupyter Notebook 3
liruijia2017 / Local-gap-density-for-clustering-high-dimensional-data-with-varying-densities
A new clustering algorithm using local gap density
clustering-algorithm clustering-analysis density-based-clustering graph-based-clustering clustering clustering-methods
Language:MATLAB 3
olivierzach / random-neighbors
Random Neighbors: Random Forest style clustering for high-dimensional data
clustering clustering-analysis high-dimensional-data dbscan data-science unsupervised-learning optimization machine-learning algorithm
Language:Python 3
paocarvajal1912 / Crypto_Clustering
Uses K-Means unsupervised machine learning algorithm and Principal Component Analysis to cluster cryptocurrencies based on performance in selected periods.
clustering-analysis cryptocurrency jupyter jupyter-notebook kmeans kmeans-clustering-algorithm principal-component-analysis-pca python
Language:Jupyter Notebook 3
Priyanshu501 / CausalGeneAnalysis
This repository contains analysis and exploration of causal and non-causal relationships between genes and phenotypes using embeddings generated from GPT-3.5. The project applies vector analysis, dimensionality reduction, and clustering techniques (K-Means, Hierarchical, and DBSCAN) to uncover potential patterns and insights into causality.
clustering-analysis dbscan hierarchical-clustering jupyter-notebook kmeans
Language:Jupyter Notebook 2
Yutianxinw / Geo-segmentation-kepler
Visualizing customer segmentation using Kepler.gl with a focus on geographic patterns and spatial clustering to uncover regional marketing insights.
clustering-analysis customer-segmentation datavisualization geo-analytics kepler-gl
Language:Jupyter Notebook 2

clustering-analysis

milaan9 / Clustering_Algorithms_from_Scratch

rezacsedu / Deep-Learning-for-Clustering-in-Bioinformatics

rohanmohapatra / hdbscan-cpp

bessagroup / CRATE

monty-se / PINstimation

Simon-Bertrand / Clusters-Features

DOH-JDJ0303 / bigbacter-nf

julherest / drought_clusters

DRLib / CDR

EtzionR / Clustering-by-Silhouette

sharmaroshan / MNIST-Using-K-means

marthadais / AISclassification

salar96 / MEP-Orthogonal-NMF

ShuyueG / CVI_using_DSI

at-tan / Hierarchical_Clustering_of_Currencies

dilettagoglia / DataMining

pajaskowiak / clusterConfusion

zcebeci / fcvalid

BayoAdejare / lightning-containers

KaikeWesleyReis / kaggle

danustc / Image_toolbox

EtzionR / generate-Convex-Hull-SHP-from-HDBSCAN-clustering-probabilities

ArtemKovera / clust

caesarmario / Mall-Customers-Clustering-Analysis-using-SAS-Enterprise-Miner

MarinaMoreno / Client-Segmentation-Clustering

parthnan / IowaGamblingTask-Clustering

renatocorreia-rmcm / mall-customers-segmentation

AnFrBo / internet_censorship

AYSE-DUMAN / Clustering-by-Business-Income-and-Expenses

Devanshi-Bavaria / Predictive-Modeling-for-Stock-Market-Trends

DomainTools / risky-tld-cluster-analysis

liruijia2017 / Local-gap-density-for-clustering-high-dimensional-data-with-varying-densities

olivierzach / random-neighbors

paocarvajal1912 / Crypto_Clustering

Priyanshu501 / CausalGeneAnalysis

Yutianxinw / Geo-segmentation-kepler