Pratik Jagtap's repositories
data-reporting
Repo uses docker-compose file to create docker container to run hadoop, hive and spark. Notes.txt file has details about implementation. This repo covers end-to-end simple flow, reading parquet files to spark-df and loading it to csv-file and run unit-test-cases.
Language:Python000
Data-visuals-using-Python
Python being the favorite language for coding, tried to implement various visuals using python libraries such as Matplotlib, Seaborn, Folium. This is going to be a jupyter notebook file along with some dataset files including excel and csv
docker-basics
Sample orchestration of docker implementations