high-dimensional-data

There are 9 repositories under high-dimensional-data topic.

NVIDIA / MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
neural-network computer-vision sparse-tensors convolutional-neural-networks semantic-segmentation auto-differentiation spatio-temporal-analysis space-time deep-learning 3d-convolutional-network 4d-convolutional-neural-network high-dimensional-data high-dimensional-inference trilateral-filter 3d-vision sparse-convolution pytorch minkowski-engine cuda sparse-tensor-network
Language:Python 2807
ContextLab / hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
data-visualization high-dimensional-data python topic-modeling text-vectorization data-wrangling visualization time-series
Language:Python 1873
vald
vdaas / vald
Vald. A Highly Scalable Distributed Vector Search Engine
vald approximate-nearest-neighbor-search kubernetes distributed-systems nearest-neighbor-search vector-search-engine similarity-search image-search image-search-engine vector anng ngt microservices golang cloud cloud-native high-performance high-dimensional-data hacktoberfest
Language:Go 1656
abess
abess-team / abess
Fast Best-Subset Selection Library
polynomial-algorithm high-dimensional-data best-subset-selection machine-learning python r scikit-learn principal-component-analysis linear-regression classification-algorithm cox-regression logistic-regression multitask-learning ordinal-regression poisson-regression robust-principal-component-analysis sparse-principal-component-analysis feature-selection sure-independence-screening
Language:C++ 489
ramhiser / datamicroarray
A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.
cancer colon-cancer high-dimensional-data machine-learning r
Language:R 105
Tuyki / TT_RNN
high-dimensional-data tensor-train tensor-train-layer tensor-train-rnn
Language:Python 102
daleroberts / hdmedians
High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.
high-dimensional-data machine-learning median python statistics
Language:Python 77
gdkrmr / dimRed
A Framework for Dimensionality Reduction in R
dimensionality-reduction framework high-dimensional-data manifold-learning quality-control r visualization
Language:R 73
sergiocorreia / ppmlhdfe
Poisson pseudo-likelihood regression with multiple levels of fixed effects
fixed-effects high-dimensional-data poisson-regression separation stata
Language:HTML 71
epigen / unsupervised_analysis
A general purpose Snakemake workflow and MrBiomics module to perform unsupervised analyses (dimensionality reduction & cluster analysis) and visualizations of high-dimensional data.
data-science high-dimensional-data snakemake workflow unsupervised-learning principal-component-analysis umap pca visualization clustering data-visualization dimensionality-reduction heatmap densmap cluster-analysis cluster-validation clustering-algorithm clustree leiden-algorithm
Language:Python 60
great-northern-diver / loon
A Toolkit for Interactive Statistical Data Visualization
data-analysis data-science data-visualization exploratory-analysis exploratory-data-analysis high-dimensional-data interactive-graphics interactive-visualizations loon python statistical-analysis statistical-graphics statistics tcl-extension tk
Language:Tcl 49
GuansongPang / deep-outlier-detection
Deep distance-based outlier detection published in KDD18: Learning representations specifically for distance-based outlier detection. Few-shot outlier detection
outlier-detection few-shot-learning high-dimensional-data representation-learning deep-learning
Language:Python 48
lightonai / newma
Implementation of NEWMA: a new method for scalable model-free online change-point detection
change-point-detection machine-learning hardware-acceleration python paper high-dimensional-data timeseries
Language:Python 46
VarIr / scikit-hubness
A Python package for hubness analysis and high-dimensional data mining
hubness machine-learning data-science data-mining high-dimensional-data nearest-neighbor-search approximate-nearest-neighbor-search
Language:Python 45
nanxstats / hdnom
🔮 Benchmarking and visualization toolkit for penalized Cox models
high-dimensional-data survival-analysis benchmark penalized-cox-models linear-regression nomogram-visualization
Language:R 44
ejohnson643 / EMBEDR
Statistical quality evaluation of dimensionality reduction algorithms
dimension-reduction dimensionality-reduction high-dimension-visualization high-dimensional-data scrna-seq-analysis
Language:Jupyter Notebook 29
mariaderrico / DPA
The DPA package is the scikit-learn compatible implementation of the Density Peaks Advanced clustering algorithm. The algorithm provides robust and visual information about the clusters, their statistical reliability and their hierarchical organization.
clustering-algorithm python scikit-learn high-dimensional-data hierarchy-visualization non-parametric-density-estimation
Language:Jupyter Notebook 29
NLeSC / DiVE
An interactive 3D web viewer of up to million points on one screen that represent data. Provides interaction for viewing high-dimensional data that has been previously embedded in 3D or 2D. Based on graphosaurus.js and three.js. For a Linux release of a complete embedding+visualization pipeline please visit https://github.com/sonjageorgievska/Embed-Dive.
interactive-visualizations 3d-data embedded-data web-application manifold-learning non-linear-dimensionality-reduction high-dimensional-data
Language:HTML 26
JoshEngels / FLINNG
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
nearest-neighbor-search group-testing locality-sensitive-hashing high-dimensional-data
Language:C++ 23
brian-lau / highdim
Statistics for high-dimensional data (homogeneity, sphericity, independence, spherical uniformity)
circular-statistics high-dimensional-data independence matlab statistics
Language:Matlab 19
mackelab / CorBinian
CorBinian: A toolbox for modelling and simulating high-dimensional binary and count-data with correlations
binary multivariate correlation mcmc entropy ising-model neurons maxent high-dimensional-data maximum-likelihood dichotomized-gaussian k-pairwise iterative-scaling gibbs-sampling specific-heat criticality heat-capacity bernoulli count maximum-entropy
Language:MATLAB 19
MNoorFawi / lshashing
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
nearest-neighbor-search locality-sensitive-hashing python k-nearest-neighbours high-dimensional-data
Language:Python 19
OFAI / hub-toolbox-python3
Hubness analysis and removal functions
machine-learning high-dimensional-data hubness data-mining
Language:Python 19
angeloschatzimparmpas / t-viSNE
t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections
interpretable-tsne dimensionality-reduction high-dimensional-data explainable-machine-learning visualization
Language:JavaScript 18
SuperXiang / High-Dimensional-Feature-Selection-of-Medical-Data
Feature Selection by Optimized LASSO algorithm
feature-selection high-dimensional-data lasso machine-learning data-mining
Language:MATLAB 16
huangdonghere / SRCFS
MATLAB code for Unsupervised Feature Selection with Multi-Subspace Randomization and Collaboration (SRCFS) (KBS 2019)
ensemble-learning feature-selection high-dimensional-data random-subspaces unsupervised-feature-selection
Language:MATLAB 15
ivan-pi / fortran-flann
Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.
approximate-nearest-neighbor-search hierarchical-clustering high-dimensional-data kdtree kmeans-clustering nearest-neighbor-search spatial-search
Language:Fortran 15
KChen-lab / SCMarker
Marker gene selection from scRNA-seq data
single-cell-rna-seq feature-selection high-dimensional-data statistical-methods
Language:HTML 15
pedbrgs / PyCCEA
A Python package of cooperative co-evolutionary algorithms for feature selection in high-dimensional data.
feature-selection cooperative-coevolution machine-learning supervised-learning evolutionary-algorithms high-dimensional-data classification-tasks regression-tasks python
Language:Python 15
ramhiser / sparsediscrim
Sparse and Regularized Discriminant Analysis in R
classifier high-dimensional-data machine-learning r
Language:R 14
0xshreyash / tsne-lib
A simple library for t-SNE animation and a zoom-in feature to apply t-SNE in that region
visualization t-sne machinelearning deeplearning manifold-learning machine-learning sklearn high-dimensional-data dimensionality-reduction tsne tsne-animation
Language:Python 13
nanxstats / msaenet
🧲 Multi-step adaptive estimation for reducing false positive selection in sparse regressions
high-dimensional-data variable-selection linear-regression machine-learning false-positive-control
Language:R 13
wangxb96 / MEL
Code for “MEL: Efficient Multi-Task Evolutionary Learning for High-Dimensional Feature Selection“--[IEEE Transactions on Knowledge and Data Engineering (TKDE 24)]
evolutionary-algorithms feature-selection high-dimensional-data multi-task-learning particle-swarm-optimization
Language:MATLAB 13
astro-informatics / QuantifAI
PyTorch-based radio-interferometric imaging reconstruction package with scalable Bayesian uncertainty quantification relying on data-driven (learned) priors
machine-learning radio-interferometry uncertainty-quantification high-dimensional-data pytorch
Language:Jupyter Notebook 12
MChatzakis / DARTH
[SIGMOD 2026] DARTH: Declarative Recall Through Early Termination for Approximate Nearest Neighbor Search.
declarative-workflows early-termination faiss-vector-database gbdt high-dimensional-data vector-data-management vector-database vector-search
Language:C++ 12
the-fang / Hybrid-K-means-Pso
An advanced version of K-Means using Particle swarm optimization for clustering of high dimensional data sets, which converges faster to the optimal solution.
matlab matlab-gui kmeans-clustering particle-swarm-optimization high-dimensional-data optimization
Language:MATLAB 12