There are 3 repositories under datapreprocessing topic.
Simple tool to split COCO annotations into train/test datasets.
Roadmap for Data Engineering
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
Wind Power Forecasting Based on Hybrid CEEMDAN-EWT Deep Learning Method
Analyzing the HR Criteria of a Company and how they promote their Employees and keep Balance between them using Data Analytics, Data Visualizations, and Machine Learning Models for Classification Purposes.
Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!
Utilizes a Convolutional-based Transformer architecture for accurate and efficient PV power forecasting.
Data Science RoadMap
Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.
The project provides a real-world dataset focusing on supply chain analytics
Power BI exercises for courses on DataCamp's Data Analyst in Power BI Career Track
A helper package for Machine Learning and Deep Learning Algorithms
Data Visualization, EDA , Model Building and Deployment etc..
⚒️ Data preprocessing is the process of transforming raw data into an understandable format. It is also an important step in data mining as we cannot work with raw data. The quality of the data should be checked before applying machine learning or data mining algorithms
My learnings on different algorithms of Machine Learning with Python .
Predicting whether a person who has applied for a loan in a bank would get his/her loan approved or not using Classification Algorithms in Machine Learning, by looking at some common and useful attributes.
Automating the process of Data Preprocessing for Data Science
This repository contains a basic fraud detection system utilising supervised learning techniques to identify potentially fraudulent credit card transactions. The project establishes a baseline model that addresses the challenges of credit card fraud in financial institutions.
In this project we try to predict home credit default risk for clients. We try to predict, if the client will have payment difficulties or not.
Image Classification model for detecting and classifying *DIABETIC RETINOPATHY* using retina images
Python编写的处理法务邮单自动批量生成的脚本小工具-提取判决书内容免去手输填充邮单-Legal agency postal receipt automatically generate app
This repository focuses on two machine learning projects in the healthcare domain.
Resume Screening using Machine Learning and Python
ScrapySub is a Python library designed to recursively scrape website content, including subpages. It fetches the visible text from web pages and stores it in a structured format for easy access and analysis. This library is particularly useful for NLP and AI developers who need to gather large amounts of web content for their projects.
Preparing Mammography Images for the ResNet Algorithm and Performing Data Preprocessing for the Teknofest 2022 Healthcare Artificial Intelligence Competition.
Engineered an innovative project for epilepsy disorder classification by harnessing EEG signals, attaining an outstanding accuracy rate of 95% in identifying diverse seizure types. Implemented sophisticated signal processing techniques and advanced machine learning algorithms, enhancing the system's precision and efficiency in classification.
All my Machine Learning Projects from A to Z in (Python & R)
The Medical Chatbot is an AI-powered platform designed to assist users in obtaining accurate and reliable medical information. It aims to provide instant responses to user queries, offer symptom analysis, suggest potential diagnoses, and recommend appropriate next steps for seeking medical care.
The aim of this project is to create a custom dataset for sentiment analysis. Use the data to fine-tune a BERT model and deploy your NLP model as an API
ASHRAE - Great Energy Predictor III: How much energy will a building consume?
Heart disease is a major global health concern that affects millions of people around the world. Early detection and accurate prediction of heart disease can help to prevent the progression of the disease and save lives. In this project, we aim to develop a predictive model for heart disease using various machine learning algorithms.
Step towards maching learning
The algorithm utilizes Generative AI and Natural Language Processing (NLP) to analyze the nutritional content of packaged food products. The system considers personalized health conditions, such as allergies and dietary needs, to provide tailored recommendations, helping individuals make safer and more informed food choices.