Sabyasachi Datta's repositories
Breast-Cancer-Wisconsin-Dataset
Visualize the data in 2-D scatter plot and write the inferences, Make a boxplot for each feature and highlight the outlier, if any, then remove the outlier, make again box plot to show the outlier effect and write the inferences.
DieTanics-Dataset
Perform 1. Data input 2. Basic head view of the data 3. Data description 4. Data summary 5. Univariate analysis
Hbos-Technique
Hbos (Histogram based Outlier Score) Technique
Linear-Regression-Decision-Tree-Gaussian-Naive-Bayes
Perform Linear Regression on the given dataset. Also perform K-Fold cross-validation
Preprocessing-Hidden-Markov-Model
Preprocessing-Hidden-Markov-Model
Stroke-Prediction-Dataset
You need to download ‘Stroke Prediction Dataset’ data using the library Scikit learn; ref is given below. [5] 2. Divide the data randomly in training and testing with a 7:3 ratio 100 times, perform the following tasks with training data and test the performance on testing data. Testing data should remain unseen for all steps.
Wine-Data-Clusterring
1. You need to download “Wine” data from the kaggle Perform at least 5 Clustering methods with varying cluster sizes. Find correct cluster numbers for each method and show with line plot, how you finalized this cluster number.