Andy Pham's repositories
banking-system
Python OOP - Banking System with MySQL database case study.
Creating-a-KPI-Dashboard-with-Tableau
Creating an interactive KPI Dashboard and other visualizations with Tableau :bar_chart: .
hadoop-mini-project
Using hadoop to utilize data from an automobile tracking platform that tracks the history of important incidents after the initial sale of a new vehicle.
Top-Rentals-Cineplex
Applying data engineering techniques to create data pipeline with Azure Cloud Computing
airflow-mini-project1
Utilize Docker and Apache Airflow to orchestrate the pipeline, exercise the DAG creation, uses of various operators (BashOperator, PythonOperator, etc), setting up order of operation of each task.
kafka-mini-project-2
create a streaming application as a simple real-time fraud detection system backed by Apache Kafka using a Python client.
spark-mini-project
Using Spark transformations to solve traditional MapReduce data problems.
Statistical-Analysis
This repository focused on statistical analysis and exploration used on various data sets for personal and professional projects. :chart_with_upwards_trend:
GoodVitamins
Applying NLP and unsupervised machine learning technique to quickly showing the top representative user reviews of vitamin products from iHerb :pill: .
Data-Science--Cheat-Sheet
Cheat Sheets
EURO_CUP_2016_PostgreSQL
EURO CUP 2016 mini-project PostgreSQL schema and tables setup with solutions
How-To-Deploy-WebApp-with-Streamlit-Heroku
A quick and intuitive tutorial for a simple WebApp deployment with Streamlit and Heroku :computer: .
kubernetes-the-hard-way
Bootstrap Kubernetes the hard way on Google Cloud Platform. No scripts.
Lessons-Learned-Data-Science-Interviews
Lessons learned the hard way through over 30+ data science interviews
Python-Challenge-Questions
This repository consists of the collection of Python Challenges' solutions and some Python tips and fundamentals.
python_data_pipeline
A Simple Pure Python Data Pipeline to process a Data Stream
spark-optimization
Hands-on experience optimizing PySpark code.
SQL-Challenge-Questions
This repository consists of all the SQL solutions with intuitive explanations that I have done. (including supplemental readings relating to the tools used)
textpack
Group thousands of similar spreadsheet or database text entries in seconds
Web-Scraping-with-BeautifulSoup-and-Pandas
A quick webscraping tutorial with BeautifulSoup and Pandas :spider: :panda_face:
Web-Scraping-with-Selenium
An intuitive tutorial of web scraping with Selenium. :spider:
WebScraping
Create a database from scratch by extracting html elements from a webpage