There are 5 repositories under high-dimensional-data topic.
A Python toolbox for gaining geometric insights into high-dimensional data
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Vald. A Highly Scalable Distributed Vector Search Engine
Fast Best-Subset Selection Library
A collection of small-sample, high-dimensional microarray data sets to assess machine-learning algorithms and models.
A Framework for Dimensionality Reduction in R
High-dimensional medians (medoid, geometric median, etc.). Fast implementations in Python.
A Toolkit for Interactive Statistical Data Visualization
Implementation of NEWMA: a new method for scalable model-free online change-point detection
Poisson pseudo-likelihood regression with multiple levels of fixed effects
Deep distance-based outlier detection published in KDD18: Learning representations specifically for distance-based outlier detection. Few-shot outlier detection
A Python package for hubness analysis and high-dimensional data mining
Benchmarking and Visualization Toolkit for Penalized Cox Models
An interactive 3D web viewer of up to million points on one screen that represent data. Provides interaction for viewing high-dimensional data that has been previously embedded in 3D or 2D. Based on graphosaurus.js and three.js. For a Linux release of a complete embedding+visualization pipeline please visit https://github.com/sonjageorgievska/Embed-Dive.
Statistical quality evaluation of dimensionality reduction algorithms
Hubness analysis and removal functions
Statistics for high-dimensional data (homogeneity, sphericity, independence, spherical uniformity)
The DPA package is the scikit-learn compatible implementation of the Density Peaks Advanced clustering algorithm. The algorithm provides robust and visual information about the clusters, their statistical reliability and their hierarchical organization.
Marker gene selection from scRNA-seq data
CorBinian: A toolbox for modelling and simulating high-dimensional binary and count-data with correlations
Sparse and Regularized Discriminant Analysis in R
🧲 Multi-step adaptive estimation for reducing false positive selection in sparse regressions
MATLAB code for Unsupervised Feature Selection with Multi-Subspace Randomization and Collaboration (SRCFS) (KBS 2019)
jQuery plugin to easily browse and highlight your JSON
locus R package - Large-scale variational inference for variable selection in sparse multiple-response regression
A simple library for t-SNE animation and a zoom-in feature to apply t-SNE in that region
t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections
Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.
An R package for testing high-dimensional covariance matrices
A Decomposition-based Canonical Correlation Analysis for High-dimensional Datasets (JASA-20 paper)
Feature Selection by Optimized LASSO algorithm
A R package for multi-dimensional data visualization
Course Webpage for DS503 being taught at IIT Bhilai
A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashing
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data