- Project done for faculty course hidden knowledge.
- Subject of project was:
- Web scraping of site discogs for data about albums, songs, and artists of Yugoslavia and Serbia.
- Analyses of scraped data with plotting and querying.
- Unsupervised clustering of scraped data.
- Install python 3.7
- Install docker
- Install needed modules with: pip install -r requirements.txt
- Run database in docker: docker-compose -f docker-compose.yml up -d
- Change dir from root to /src
- Run project with: python main.py
- Scraping time: 72h
- Albums scraped: 65573
- Artists scraped: 62025
- Songs scraped: 435107
- Genres count, top 6:
- Song count, grouped by song length:
- Album count, grouped by decades:
- Album count, grouped by is name written in cyrillic:
- Album count, grouped by genres number of album: