Abdul Muffid's repositories
PySpark-Tutorials
Pyspark Tutorials
College
This repository serves as a comprehensive documentation of my coursework throughout my Applied Data Science studies. Explore detailed information, source code, and analyses covering various projects, including data analysis, machine learning, image processing, and more. Feel free to browse and provide feedback. Thank you for visiting! 🙌🏼
Airflow-Docker-Parking-ETL-Pipeline
This repository is a course project for Big Data Technology and Tools that simulates the Extract, Transform, and Load (ETL) processes of customer parking data using Apache Airflow, automating data workflows to efficiently manage and process parking information. Source: https://github.com/gilomuan/Parking-ETL-Pipeline--Airflow-Docker-
DVC-Basics
In this repository, an ML-Ops task is undertaken to practice configuring and storing data using DVC on GitHub. The goal is to explore how DVC seamlessly integrates for efficient data management, enhancing reproducibility and scalability in machine learning workflows.
Streamlit-Simple-Web-to-Predict-Milk-Grades
This repository hosts an ML-Ops project creating a Streamlit web app to predict milk grades. It employs a Random Forest Classifier model, boosted by PyCaret, a low-code ML library. Predictions are based on pH, Temperature, Taste, Odor, Fat, Turbidity, and Color factors.