-
This is my experiment for my big-data assignment. I am working on a Covid dataset by running word-count algorithms coded in python and json.
-
Basically an analysis on the correlation between Weekly-covid-cases and deaths-weekly by counting.
- Apache-Flink
- Apache-Flink-PyFlink
- Python
- Google Collab
Locally:
- Install Python 3.8.0
choco install python --version=3.8.0
- Check for version in cmd
python --version
- Install Flink in Python using pip command.
python -m pip install apache-flink
- If a warning is dispalyed, update pip.
python -m pip install --upgrade pip
- Store your python code in a .py file and run it using the following command:
python filename.py
- To store the output on a file:
python filename.py --output output.txt
TIPS:
Make sure all previous or advanced versions of python and flink are removed from the system so that the installations run smoothly.
Google Colab
- Install Colaboratory in Google Drive.
- Create new Colab file.
- Install Flink
!pip install apache-flink
- Connect the resources(RAM and Disk) to hosted runtime.
- Run python code cell by clicking:
enter + shift
- Kaggle Dataset on Covid-19
- Code Forked from uuboyscy
- Apache Flink Example
- Apache Flink Table Example
- Main Group Repository