datasciencegeek1

datasciencegeek1

Geek Repo

Location:London, United Kingdom

Github PK Tool:Github PK Tool

datasciencegeek1's repositories

Coronary-heart-disease-prediction-using-various-ML-algorithms

Analysed the performance of different Machine Learning algorithms on Coronary heart disease dataset acquired from Kaggle. Performed EDA, Data cleansing, data pre processing and feature correlation, feature selection. Implemented Logistic regression with 10 fold cross validation, Logistic regression with GridSearchCV, Random Forest, RNN and MLP

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

Azure-Machine-Learning-Pipeline-Big-Data-Cloud-Computing

An end-to-end machine learning pipeline on Microsoft Azure to predict flight delays using historical flight and weather data. The project integrated Azure Blob Storage, Data Factory, and Databricks for data ingestion, processing, and analysis, and employed various ML algorithms for model training and evaluation and visualisation on Power BI

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

ECMWF-Query-Prediction

Implemented RNN with LSTM to predict queries for retrieving weather data from the ECMWF. The study investigated data pre-processing, hyperparameter tuning, model, architecture and training. The model achieved 91% accuracy. The work will significantly reduce large data fetching latency from tape systems which are slow unlike disk drives

Language:PythonStargazers:1Issues:1Issues:0

Cancer-cell-analysis

This repository contains the workflow and data for a machine learning project aimed at developing a cancer diagnosis model in KNIME. We evaluate the performance of two popular machine learning models, Naiive Bayes and Support Vector Machine (SVM), using precision and recall metrics

Stargazers:0Issues:0Issues:0

LA-Metro-Bike-Data

This work is a demonstration of EDA and BI in Python Jupyter Notebook on the Los Angeles Metro Bike Share system's bicycle trip data. The project involved data cleaning, visualization, and statistical analysis to identify key trends, patterns, and actionable insights that could help improve bike share utilization and operational efficiency.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0