AndresMpa / big_data

Notes about big data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Big data

This repository contains notes at notes, also some toy examples at code; every section of code uses a virtual environment or any dataset from dataset. It's important to know that every requirements.txt contains the respective dependencies, to use then following pipeline could be used:

On linux
python -m venv env
source env/bin/activate
pip install -r requirements.txt
On windows
python -m venv env
env\Scripts\activate
pip install -r requirements.txt

Laboratory #1

An exploratory research about this dataset that contains the top songs on Spotify.

Laboratory #2

An exploratory research about Anime list 2023 dataset

About

Notes about big data

License:GNU General Public License v3.0


Languages

Language:Python 100.0%