georgosgeorgos / data-mining

Some Data Mining topics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DataMining

  • Inverted Index for retrieval and ranking

  • Recommender Engines for the Book-Crossing Dataset

    • Content-Based
    • Collaborative Filtering (with hierarchical clustering and dimensionality reduction)
    • SVD approach
  • Evaluation metrics for a search engine (R-precision, modified nDCG) and Fagin's algorithm for Top-k queries retrieval (MG4J & Python)

  • Recommendation Systems for movies using a personalized PageRank algorithm

  • Classifiers (spam/no-spam) using k-NN & SVM with RBF kernel

  • Web Crawler with priority for pages with the highest partial PageRank

  • Toy Lucene search engine

About

Some Data Mining topics

License:MIT License


Languages

Language:Python 52.8%Language:Java 19.4%Language:Jupyter Notebook 18.7%Language:Shell 9.2%