rgualan / soton-dm-google-books

Data mining Coursework 2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data mining coursework: Understanding data

This repository includes the scripts created to extract, explore and cluster 24 digitalized documents belonging to the Google Books Library project. The goal is to understand the data and unravel the underlying relationships between the documents. The main tools used are python and MongoDB.

About

Data mining Coursework 2


Languages

Language:Python 98.3%Language:R 1.5%Language:Shell 0.2%