scikit-learn scikitlearn-machine-learning scikit-learn-python sklearn sklearn-library dimensionality-reduction pca tsne

Dimensionality Reduction Techniques

Dimensionality reduction refers to the phenomenon of reducing the dimensionality of the data. Dimensionality is also known as input variables, features, or simply columns of the dataset. Dimensionality reduction is used commonly in many machine learning models since many machine learning algorithms don't function well with higher dimensionality. However, dimensionality reduction has an important use case in replacing values with confidential data with non-interpretable values, for instance, in the case of a dataset containing banking or health data.

Principal Component Analysis (PCA)

PCA is limited to linear data. Refer to the API Reference Guide here: https://scikit-learn.org/stable/modules/generated/sklearn.decomposition.PCA.html

T-distributed Stochastic Neighbor Embedding (t-SNE)

t-SNE also works with non-linear data. Refer to the API Reference Guide here: https://scikit-learn.org/stable/modules/generated/sklearn.manifold.TSNE.html

About

This repository demonstrates reducing the dimensions of the dataset using Scikit-learn's Principal Component Analysis (PCA) and T-distributed Stochastic Neighbor Embedding (t-SNE).

scikit-learn scikitlearn-machine-learning scikit-learn-python sklearn sklearn-library dimensionality-reduction pca tsne

MIT License

Languages

Language:Python 100.0%