ChienKangLu / Principal-Component-Analysis

Dimension reduction

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Principal-Component-Analysis

Principal Component Analysis (PCA) is a technique which is used for dimension reduction.

Purpose

  • Dimension reduction, keep the most important dimenstions
  • Find principle component
  • Project data a new space

Develop tools and techniques

  • Python
  • Pycharm

Skill

  • Linear algebra: eigen decomposition
  • Vector projection

Implementation Step

  1. Create data matrix which contains a lot of row vectors
  2. Normalize the matrix, which transforms all dimensions to the distribution of zero mean
  3. Calculate the covariance matrix, image
  4. Do eigen decomposition and get the eigen values and eigen vectors
  5. Verify whether the decomposition is right or not, image
  6. Project original data to the new space by the eigen vectors,

  7. Visualization

Result

image

About

Dimension reduction


Languages

Language:Python 100.0%