preprocessing-data

There are 0 repository under preprocessing-data topic.

hyperimpute
vanderschaarlab / hyperimpute
A framework for prototyping and benchmarking imputation methods
data-science imputation imputation-algorithm machine-learning machine-learning-prerequisites preprocessing-data python scikit-learn
Language:Python 179
Unstructured-IO / community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
community data-pipeline deep-learning document-ai document-parsing machine-learning nlp-parsing ocr-python open-source preprocessing-data
29
ELHoussineT / AutoDataCleaner
Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtypes, cleaning dirty/empty values, normalizing values and removing unwanted columns all in one line of code. Get your data ready for model training and fitting quickly.
cleaning-data data-analysis data-science machine-learning preprocessing-data
Language:Python 19
NLPiper
dlite-tools / NLPiper
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
nlp nlp-library nlp-parsing preprocessing preprocessing-data text text-analysis text-processing
Language:Python 18
imyjk729 / Memristor
In-sensor reservoir computing for language learning via two-dimensional memristors
machine-learning memristive-crossbar memristor preprocessing-data
Language:Jupyter Notebook 17
weiglszonja / meeg-tools
EEG/MEG data preprocessing and analyses framework
pipeline eeg-preprocessing preprocessing-data eeg-analysis time-frequency-analysis connectivity-analysis
Language:Jupyter Notebook 12
data-analyst-praktikum / Projects
Jupyter Notebook Praktikum Projects. This is repository with data analyst educational projects from Yandex.Praktikum.
ab-testing cohort-analysis data-analysis data-visualization exploratory-data-analysis funnels jupyter-notebook machine-learning matplotlib metrics numpy pandas plotly preprocessing-data python sql statistical-data-analysis unit-economics
Language:HTML 11
cecivieira / cotas-genero-eleicoes-e-proposicoes-legislativas
Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas
dataanalysis pandas preprocessing-data python randomforestclassifier
Language:Jupyter Notebook 9
UniFeat / unifeat
An open-source tool for performing feature selection process in different areas of research
dimensionality-reduction feature-selection machine-learning multi-class-classification preprocessing-data supervised-learning unsupervised-learning
Language:Java 9
tuanio / backend-recommender-system-book
Flask REST API for Recommender System Book App on Android
flask python preprocessing-data
Language:Jupyter Notebook 7
bharadwaj-chukkala / Data-driven-motion-planning-using-various-machine-learning-algorithms
ENPM808A: Introduction to Machine Learning Final Project
data-structures data-visualization feature-engineering generalization hoeffding hyperparameter-tuning linear-regression machine-learning neural-network optimization predictive-modeling preprocessing-data python regularization scikitlearn-machine-learning tensorflow
Language:Jupyter Notebook 5
ArthurMangussi / pymdatagen
A Python Library for the Generation of Artificial Missing Data
machine-learning preprocessing-data amputation missing-data
Language:Python 4
ChristianGoueguel / specProc
The specProc package is a collection of preprocessing tools for spectroscopy data analysis.
preprocessing-data chemometrics emission-spectra
Language:R 4
FaezehAbedi2023 / Statistical-Analysis-in-Sensor-Data-Processing-with-Machine-Learning-Models
This project develops an activity recognition model for a mobile fitness app using statistical analysis and machine learning. By processing smartphone sensor data, it extracts features to train models that accurately recognize user activities.
matplotlib numpy pandas python scikit-learn accuracy-score confusion-matrix correlation-matrix decisiontreeclassifier f1-score histograms precision-score preprocessing-data randomforestclassifier recall-score smote-oversampler smote-sampling svm-classifier
Language:Jupyter Notebook 4
CCaribe9 / AdaptStdEPF
Code and experiments related to the paper: 'An adaptive standardisation methodology for Day-Ahead electricity price forecasting'
data-science electricity-price-forecasting machine-learning preprocessing-data time-series time-series-forecasting
Language:Jupyter Notebook 3
drleniaw / Analysis_Sentiment_Twitter_Free_Sex_In_Indonesian
Analysis Sentiment on Twitter Free Sex In Indonesia
collaboration crawling jupyter-notebook lda naive-bayes-classifier preprocessing-data python sentiment-analysis support-vector-machines twitter twitter-sentiment-analysis vader-lexicon word2vec wordcloud
Language:Jupyter Notebook 3
fezzibasma / Speed-Dating-Experiment
What attributes influence the selection of a romantic partner?
cleaning-data eda explanatory-data-analysis preprocessing-data python speed-dating visualization
Language:Jupyter Notebook 3
functorism / snapcrop
CLI for crop/resize of large amounts of images with configurable resolutions
cli crop crop-image cropper dataset-generation preprocessing preprocessing-data preprocessor resize resize-images
Language:Rust 3
kkmk11 / BLIGHT-VISION
This is a ML based Web App that aims to detect the presence of late blight or early blight on potato leaves, which are the primary causes of crop damage. Additionally, the system recommends appropriate precautions and pesticides to help farmers eliminate the blight and protect their crops and increasing their yields.
bootstrap5 cnn-classification firebase-auth firebase-database flask html-css-javascript kaggle-dataset preprocessing-data python-3 react-hooks react-router reactjs
Language:PureBasic 3
Navaneeth-Sharma / Speech_Recognition_of_Digits
This project of recognizing digit and converting it to text uses Signal processing techniques such as MFCC and other Advanced Signal Processing techniques for the preprocessing of the data. Then the Preprocessed data is used by the Neural Network algorithms to learn the pattern or structure of the sound.
signal-processing preprocessing-data mfcc mfcc-images neural-networks machine-learning cnn-keras
Language:Jupyter Notebook 3
Hollywood-Movies-Visualizations-and-Recommender-System
rifkyahmadsaputra / Hollywood-Movies-Visualizations-and-Recommender-System
In this project, I do some analysis, visualizations, and then create movie recommender system on imdb data. I do that because I want to know more about movies, especially Hollywood movies. Therefore, I do analysis and visualization on imdb data which is contain informations about movies, e.g. who is produced, when the movies release, rating movies, budget and income, etc. After that, I create movie recommender system, which is the system will recommend top 10 similar movies based on the movie that has been input by the user.
preprocessing-data data-analysis exploratory-data-analysis movie-recommender data-visualization
Language:Jupyter Notebook 3
Shaheer-khan-github / Natural-Language-Processing-in-Python-DataCamp
datacamp-track machine-learning natural-language-processing preprocessing-data python nltk-python scikit-learn spacy-nlp
Language:Jupyter Notebook 3
XuanyiJennyMa / pupil_cloud_data_preprocessing_Phase_1
Scripts for pre-processing eye-tracker data from pupil cloud
eye-tracking preprocessing-data pupillometry
Language:Python 3
abduulrahmankhalid / Bondora-Financial-Risk-Prediction
End-to-End Machine Learning Project of Peer-to-Peer Lending Bondora systems.
data-analysis data-science data-visualization exploratory-data-analysis feature-engineering flask machine-learning numpy pandas pipelines preprocessing-data render sklearn web-application
Language:Jupyter Notebook 2
ALEXUSCR-27 / Amazon-Books-Genre-Classifier
This classifier predicts the genre of books based on titles or descriptions using a Machine Learning model trained on an Amazon books dataset.
data python preprocessing-data analytics gradio matplotlib sckiit-learn svm-classifier svm-model svm-training
Language:Jupyter Notebook 2
alvaro-concha / animal-behavior-preprocessing
animal-behavior-preprocessing is a Python repository to preprocess animal behavior data. It works on the output spreadsheets from video-tracking of animal body parts with LEAP or DeepLabCut. It applies a Median Filter, an Ensemble Kalman Filter, transforms data to joint angles and computes their Morlet Wavelet Spectra.
cleaning-data data-engineering feature-extraction filtering pipeline preprocessing-data
Language:Python 2
BirchKwok / spinesUtils
A library that provides template code for Python development to shorten the project development cycle.
data-science machine-learning machine-learning-algorithms preprocessing-data
Language:Python 2
caesarmario / data-warehouse-credit-card-applicant-using-pentaho
This repository contains OLTP, ETL process (using Pentaho Data Integration), and OLAP of credit card dataset. The dataset is taken from Kaggle (https://www.kaggle.com/rikdifos/credit-card-approval-prediction) and part of author Capstone Project.
etl credit-card creditcard pentaho-data-integration pentaho oltp olap olap-database pentaho-kettle etl-process preprocessing-data
2
LuisFelipePoma / Machine_Learning
Learning about the algorithms used in machine learning, along with techniques for training and testing models.
backpropagation-learning-algorithm feature-engineering gradient-descent loss-functions metrics-visualization neuronal-networks nlp normalization-techniques optimizer-algorithms regression-models preprocessing-data data-science html ia learning python
Language:Jupyter Notebook 2
msche81 / 2-Jedha_Fullstack
450h Data Scientist training - Collect and store large amounts of data - Build prediction models in Machine Learning and Deep Learning - Deploy your models in real conditions
big-data big-data-analytics bigdata data-science data-visualization database deep-learning deeplearning eda machine-learning-algorithms machinelearning preprocessing preprocessing-data vizualisation vizualization vizualizations vizualize-data
Language:Jupyter Notebook 2
nlqthinh / WeaviateAnime
Explore your favorite anime with this interactive search app! 🚀 This project leverages Weaviate for vector search and Gradio for a seamless user interface. Using embeddings from a custom anime dataset, you can perform quick and accurate similarity searches for anime titles
anime docker gradio preprocessing-data python vectordb weaviate
Language:Python 2
r-a-j / Social-Scope
"SocialScope harnesses the power of data science to Instagram's vast content, providing insightful analytics and trend predictions for informed decision-making."
data data-analysis data-modeling data-science data-scraping data-visualization datacollection model-evaluation preprocessing-data
Language:SCSS 2
RafiQamar / HR-Analytics-Project
Cleaned and processed HR data using Python for analysis and visualization. Analyzed employee trends and performance using SQL and Python. Built an interactive Power BI dashboard connected to MySQL for dynamic insights.
exploratory-data-analysis mysql-database powerbi preprocessing-data python
Language:Jupyter Notebook 2
RafiQamar / IMDb-Movie-Analysis
This project involves web scraping, data preprocessing, database storage and visualization of IMDb movie data from the last decade (2014-2024). The dataset includes details of 10,000 movies such as name, release year, genre, ratings, metascore and more. The project culminates in an interactive Power BI dashboard for in-depth insights and reporting.
machine-learning mysql-database powerbi preprocessing-data python webscraping
Language:Jupyter Notebook 2
Sabaudian / Music_Genre_Classification_project
Audio Pattern Recognition project - Music Genres Classification
audio-analysis audio-classification audio-processing genre-classification genres-classification k-nearest-neighbours k-nn music-genre-classification music-information-retrieval neural-network preprocessing preprocessing-data python random-forest random-forest-classification svm svm-classifier artificial-intelligence machine-learning
Language:Python 2
Shakilgithub20 / News-Classification
nltk-python nltk-library sklearn naivebayes svm-model svm-kernel preprocessing-data preprocessing
Language:Jupyter Notebook 2

preprocessing-data

vanderschaarlab / hyperimpute

Unstructured-IO / community

ELHoussineT / AutoDataCleaner

dlite-tools / NLPiper

imyjk729 / Memristor

weiglszonja / meeg-tools

data-analyst-praktikum / Projects

cecivieira / cotas-genero-eleicoes-e-proposicoes-legislativas

UniFeat / unifeat

tuanio / backend-recommender-system-book

bharadwaj-chukkala / Data-driven-motion-planning-using-various-machine-learning-algorithms

ArthurMangussi / pymdatagen

ChristianGoueguel / specProc

FaezehAbedi2023 / Statistical-Analysis-in-Sensor-Data-Processing-with-Machine-Learning-Models

CCaribe9 / AdaptStdEPF

drleniaw / Analysis_Sentiment_Twitter_Free_Sex_In_Indonesian

fezzibasma / Speed-Dating-Experiment

functorism / snapcrop

kkmk11 / BLIGHT-VISION

Navaneeth-Sharma / Speech_Recognition_of_Digits

rifkyahmadsaputra / Hollywood-Movies-Visualizations-and-Recommender-System

Shaheer-khan-github / Natural-Language-Processing-in-Python-DataCamp

XuanyiJennyMa / pupil_cloud_data_preprocessing_Phase_1

abduulrahmankhalid / Bondora-Financial-Risk-Prediction

ALEXUSCR-27 / Amazon-Books-Genre-Classifier

alvaro-concha / animal-behavior-preprocessing

BirchKwok / spinesUtils

caesarmario / data-warehouse-credit-card-applicant-using-pentaho

LuisFelipePoma / Machine_Learning

msche81 / 2-Jedha_Fullstack

nlqthinh / WeaviateAnime

r-a-j / Social-Scope

RafiQamar / HR-Analytics-Project

RafiQamar / IMDb-Movie-Analysis

Sabaudian / Music_Genre_Classification_project

Shakilgithub20 / News-Classification