Pathairush Seeda's repositories
rdbms_to_hdfs_data_pipeline
A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).
airflow_hive_spark_sqoop
A docker using the airflow with Hadoop ecosystem (hive, spark, and sqoop)
data_engineering_nanodegree
Data Engineering Nanodegree projects
automate_with_pyautogui
Automate SAP report extraction with pyautogui
lead_generation_apps
A practical project for building the lead generation application using streamlist.
recommender_system
Practice various type of recommender system implementation.
data_streaming_nanodegree
Projects related to the Data Streaming Nanodegree
wongnai_topic_extraction
My independent study project for master degree in Applied statistics
ATM_optimization
The implementation of ATM replenishment problem with linear optimization in python.
aws_glue_data_pipeline
building a serverless data pipeline with AWS Glue and AWS Python SDK (Boto3)
data-engineer-project
Data Engineering Capstone
Data-Science--Cheat-Sheet
Cheat Sheets
data_manipulation
personal toolboxs
dlaicourse
Notebooks for learning deep learning
learn_nlp_with_bags_of_popcorn
Practicing NLP (natural language processing) techniques from the IMDB sentiment analysis dataset.
line_oa_bot
Capture webhook event from LINE platform with AWS Chalice
machine_learning_stanford
Review the machine learning contents by applying the assignment in python language.
online_course_lecture
All the lecture courses summary from my point of view
postgres_data_modeling
A repository storing Postgres data modeling projects.
shap
A unified approach to explain the output of any machine learning model.
shopee_code_league
Solution for Shopee code league data analytics path 2021
test_driven_data_analysis
A tutorial for integrating test-driven development to your data analysis workflow.