- View the group project data visualization report at (https://app.luminpdf.com/viewer/5ecc6da18124240012ae0885)
- Tableau Dashboard for Philadelphia House Price Data Visualization
- Data Collection: Airflow
- Data Cleaning: Pandas
- Data Storage: MySQL
- Data Transformation (Join Table): Spark(SQL), Cluster @ Databricks
- Data Visualization: Jupyter Notebook: Matplotlib -> (heatmap), seaborn -> (pairplot, boxplot), plotly -> (3D plot); Tableau: dashboard
- Airflow
- Pandas
- MySQL / PostgreSQL
- Jupyter Notebook Report
- AWS (S3, EC2, RDS)
- Language: R
- NoSQL Database: MongoDB
- Visualization Tools: ggplot2, plotly -> (3D plot)
- In this project(LEGO dataset), I applied both R-Studio and jupyter notebook to import and export data from MongoDB, and then used ggplot2 and plotly to demonstrate data visualization, respectively.
- Classes
- Unit Testing
- OOP
- Terminal
In this repo, kafka is used for tracking the route of the designed buslines. When we run the three different busdata producers python files, you will see three different moving spots in the map.
Environment & tools:
- Python 3.70 (pykafka, flask, JSON)
- Kafka
- Javascript (Leaflet.JS)
- html