There are 2 repositories under datamining-algorithms topic.
Course Project of Information Retrieval.
A porting to modern g++ and C+11 of the IBM Quest dataset generator
This is projects of Data Mining
Code for the paper "SPEck: Mining Statistically-significant Sequential Patterns Efficiently with Exact Sampling", by Steedman Jenkins, Stefan Walzer-Goldfeld, and Matteo Riondato, appearing in the Data Mining and Knowledge Discovery Special Issue for ECML PKDD'22.
FP Growth algorithm implemented using python
Implementing Gaussian Naive Bayes and KNN from scratch and evaluating their performances on heart dataset
A project for streaming algorithms: Bloom filtering, Flajolet-Martin Algorithm, Fixed-Size Sampling
KNN algorithm in big data for the detection of anomalies using Apache Spark and Scala
Implementation de l'algorithme de generation des items frequents en python et l'application sur une base de donnes diabetique
The dataset is about past loans. The loan_train.csv data set includes details of 346 customers whose loans are already paid off or defaulted.
The project or work which goals to extract the opinions, emotions, attitudes of public towards different object of interest. Sentiment analysis is a form of shallow semantic analysis of texts. In the project an automatic approach that involves supervised machine learning and text mining classification algorithms are used which includes the sentimental analysis in various applications. Various fields like twitter tweets, movie review tweets, election result tweets, digital libraries, life sciences, social media tweets and various other fields have been analyzed and the algorithms like line regression, Support vector machine (SVM), Naïve bayes are used in every content and a result is brought out through the means of various types of graphs. More specifically, MOVIE REVIEWS have been considered and the scrapping reviews of a particular movie after the predicting the sentiment of each review and on that basis, status of the movie review is understood. It is predicted that one can generalize the other applications using the specific content even.
Used clustering algorithms such as K-Means, Fuzzy C-Means, and Density-Based Algorithms like DBScan to cluster three datasets and reported result of the best algorithm after 200 random starting points.[part of my data mining course]
Practice codes for Machine Learning, Data Mining and NLP in Python
In this project we tried to solve a $10000 Kaggle competition. Starting with a dataset containing information about buying and selling used cars, we want to determine whether a purchase is a good or a bad purchase through the use of state-of-the-art Machine Learning and AI algorithms.
In this project, data mining and time series analysis algorithms are used to predict whether people are present in a room based on physical information such as CO2 or humidity levels in the air.
This is Final Capstone Project for ALY6040 Data Mining Fall 2021 CPS. Primarily to learn Data Analytics, Data Mining and Python. Residential and commercial properties were assessed in Boston. The Boston Globe reported in May 2021 that the competitive Boston housing market drives up costs. As the pandemic continues, people demand larger homes. Finding a home became more difficult as most property managers and realtors could not display their properties to several people. This post was written to help individuals, realtors, and real estate brokers find a property at a reasonable price. We selected to use a few basic machine learning concepts to help us determine the best selling price for the house based on the amount of rooms, location, design, and other characteristics about the bath and kitchen. We only focused on residential property because it was in demand. This study's goal was to improve on initial EDA work by constructing predictive models that solved our business concerns. Finally, optimizing the model's performance.
Experimenting with clustering, classification and association analysis with various csv files.
Data minnig GUI project to predict laptop prices,I uses most of ML algorithmes here
This contains all projects that I have done during my master degree.
Data Mining
some training, learning and TD/TP ressources
Decision Tree project based on ID3 Algorithm built on Jupytor Notebook with Python. Dataset taken: Tennis.csv
Datamining concepts
Course Code: CS626, MCS Batch-2019 (Final Year) Evening
Market basket analysis is a technique used mostly by retailers to identify which products clients purchase together most frequently. This involves analyzing point of sale (POS) transaction data to identify the correlations between different items according to their co-occurrence in the data.
Web Based application with various operations for data science
A Data Mining project which focuses on the comparison between different un-supervised clustering algorithms on geographical data
Data Mining for Research Diary at Indiana University
Data Mining Model For Detection of Fraudulent Behaviour
Python Implementation of data mining algorithms(Apriori, Eclact, FP Growth ).