shv07 / Data_Science_Project

Various sampling methods to speed up clusterring

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CS328 - Introduction to Data Science

Sampling Techniques for better performance of KMeans Algorithm We have implemented the following algorithms in a fairly successful manner:

  • Uniform Sampling
  • Leverage Score based Sampling
  • Volume Sampling
  • Light Weight Coresets based Sampling

About

Various sampling methods to speed up clusterring


Languages

Language:Jupyter Notebook 97.9%Language:Python 2.1%