There are 1 repository under adaboost-classifier topic.
Determining the important factors that influences the customer or passenger satisfaction of an airlines using CRISP-DM methodology in Python and RapidMiner.
Natural Language Processing for Multiclass Classification: A repository containing NLP techniques for multiclass classification of text data.
This repository contains a collection of fundamental topics and techniques in machine learning. It aims to provide a comprehensive understanding of various aspects of machine learning through simplified notebooks. Each topic is covered in a separate notebook, allowing for easy exploration and learning.
Classification on Unbalanced Datasets using Boost Techniques (AdaBoost M2, SMOTE Boost, RusBoost,..)
Assignments from Applied Machine Learning Class (UTD BUAN-6341)
Iris Species Classification usin various ML models.
A ML application(deployed on flask) to detect heart disease in patients based on medical features.
I contributed to a group project using the Life Expectancy (WHO) dataset from Kaggle where I performed regression analysis to predict life expectancy and classification to classify countries as developed or developing. The project was completed in Python using the pandas, Matplotlib, NumPy, seaborn, scikit-learn, and statsmodels libraries. The regression models were fitted on the entire dataset, along with subsets for developed and developing countries. I tested ordinary least squares, lasso, ridge, and random forest regression models. Random forest regression performed the best on all three datasets and did not overfit the training set. The testing set R2 was .96 for the entire dataset and developing country subset. The developed country subset achieved an R2 of .8. I tested seven different classification algorithms to classify a country as developing or developed. The models obtained testing set balanced accuracies ranging from 86% - 99%. From best to worst, the models included gradient boosting, random forest, Adaptive Boosting (AdaBoost), decision tree, k-nearest neighbors, support-vector machines, and naive Bayes. I tuned all the models' hyperparameters. None of the models overfitted the training set.
This project aims to predict breast cancer using machine learning and deep learning techniques.
This project aims to predict the occurrence of diabetes using machine learning techniques. The dataset used for this analysis is the "diabetes_prediction_dataset.csv" file, which contains various features related to an individual's health condition.
All-in-1 notebook which applies different clustering (K-means, hierarchical, fuzzy, optics) and classification (AdaBoost, RandomForest, XGBoost, Custom) techniques for the best model.
The project aims to predict the 10-year risk of future coronary heart disease (CHD) for patients in Framingham, Massachusetts. A dataset (3390,16) containing demographic, behavioral, and medical risk factors of patients is used to build a classification model.
ML Project implementing decision trees, boosting and svm classification from scratch.
Text classification of messages collected during and after a natural disaster. Deploy a Flask app on Heroku .
Machine learning binary classification algorithms for classifying mails as spam or ham.
Twitter Sentiment Analisys, comparing different models
Predicting the Car prices based on its features and to help maintain transparency between car sellers and buyers. We can use the website to check the car price without third party interaction.
Analyze the data of Visa applicants, build a predictive model to facilitate the process of visa approvals, and based on important factors that significantly influence the Visa status recommend a suitable profile for the applicants for whom the visa should be certified or denied.
This Prediction is a research analysis process on data using classification algorithms to compare the accuracy rate for each algorithm given below on this Monkey Pox data such as ( K-Neighbors Classifier, RandomForest Classifier, AdaBoost Classifier, Bagging Classifier, Gradient Boosting Classifier, Decision Tree Classifier )
This is about how to make Diabetes Prediction with Machine Learning. We are developing a machine learning model capable of predicting whether someone may have diabetes based on health data and specific parameters. Using the right machine learning algorithms, we will process this data to provide valuable predictions for patients and medical
Breast_Cancer_Prediction using XGBoostClassifier & AdaboostClassifier
Applied various algorithm models to solve a binary classification problem of predicting if a patient will suffer from a disease. Project done for Machine Learning course of Data Science Ms
Machine learning project done during Monsoon Semester 2023 in IIITD.
Machine learning series 2.3 on boosted trees
Predicting the churn in the last month using the data (features) from the first three months and identify customers at high risk of churn and the main indicators of churn.
This project is aimed at predicting the case of customer's default payments. This dataset (30000,25) contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan is used to build a classification model.
A prediction model based on ML as well as DL and compare their performances to find Churned Customers
This project aims to predict bank customer churn using a dataset derived from the Bank Customer Churn Prediction dataset available on Kaggle. The dataset for this competition has been generated from a deep learning model trained on the original dataset, with feature distributions being similar but not identical to the original data.
Exploring a collection of Jupyter notebooks showcasing a variety of Natural Language Processing (NLP) projects.
Advanced Machine Learning
upGrad's Telecom Churn Case Study hosted on Kaggle platform
Interconnect seeks to forecast customer churn by analyzing package choices and contracts. If a customer plans to leave, they're offered unique codes and special packages to foster loyalty.
The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the datsaset
To analyze the provided cancer.csv data and predict whether or not a patient has breast cancer using ensemble techniques
This is the source code for the end project of Statistical Methods in AI, 5th Semester, IIITH, '22. The project involves implementation of a research paper. The research paper is the Paper of Viola Jones Algorithm
This project explores the predictive modeling workflow using the Kaggle competition "Titanic - Machine Learning from Disaster." It emphasizes key stages like data analysis and model evaluation, aiming to identify the optimal model. Through a real-world approach, we enhance our understanding of the workflow and emphasize rigorous model evaluation.